EconPapers    
Economics at your fingertips  
 

A High-Dimensional Choice Model for Online Retailing

Zhaohui (Zoey) Jiang (), Jun Li () and Dennis Zhang ()
Additional contact information
Zhaohui (Zoey) Jiang: Tepper School of Business, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213
Jun Li: Ross School of Business, University of Michigan, Ann Arbor, Michigan 48109
Dennis Zhang: John M. Olin Business School, Washington University in St. Louis, St. Louis, Missouri 63130

Management Science, 2025, vol. 71, issue 4, 3320-3339

Abstract: Online retailers are facing an increasing variety of product choices and diversified consumer decision journeys. To improve many operations decisions for online retailers, such as demand forecasting and inventory management and pricing, an important first step is to obtain an accurate estimate of the substitution patterns among a large number of products offered in the complex online environment. Classic choice models either do not account for these substitution patterns beyond what is reflected through observed product features or do so in a simplified way by making a priori assumptions. These shortcomings become particularly restrictive when the underlying substitution patterns get complex as the number of options increases. We provide a solution by developing a high-dimensional choice model that allows for flexible substitution patterns and easily scales up. We leverage consumer clickstream data and combine econometric and machine learning (graphical lasso, in particular) methods to learn the substitution patterns among a large number of products. We show our method offers more accurate demand forecasts in a wide range of synthetic scenarios when compared with classical models (e.g., the independent and identically distributed Probit model), reducing out-of-sample mean absolute percentage error by 10%–30%. Such performance improvement is further supported by observations from a real-world empirical setting. More importantly, our method excels in precisely recovering substitution patterns across products. Compared with benchmark models, it reduces the percentage deviation from the underlying elasticity matrix by approximately half. This precision serves as a critical input for enhancing business decisions such as assortment planning, inventory management, and pricing strategies.

Keywords: choice model; machine learning; retail management; high-dimensional; clickstream data (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
http://dx.doi.org/10.1287/mnsc.2020.02715 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:inm:ormnsc:v:71:y:2025:i:4:p:3320-3339

Access Statistics for this article

More articles in Management Science from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().

 
Page updated 2025-04-19
Handle: RePEc:inm:ormnsc:v:71:y:2025:i:4:p:3320-3339