EconPapers    
Economics at your fingertips  
 

Analyzing browsing across websites by machine learning methods

Andreas Falke () and Harald Hruschka ()
Additional contact information
Andreas Falke: Universitat Regensburg Wirtschaftswissenschaftliche Fakultat Regensburg
Harald Hruschka: Universitat Regensburg Wirtschaftswissenschaftliche Fakultat Regensburg

Journal of Business Economics, 2022, vol. 92, issue 5, No 5, 829-852

Abstract: Abstract The increasing importance of online distribution channels is paralleled by a rising interest in gaining insights into the customer journey and browsing behavior. We evaluate several machine learning methods (latent Dirichlet allocation, correlated topic model, structural topic model, replicated softmax model) with respect to their ability to reproduce the browsing behavior of households across websites. In addition, we compare these machine learning methods to a related classical technique, singular value decomposition. In our study, the replicated softmax model outperforms latent Dirichlet allocation, but the correlated topic model attains the overall best performance. Compared to singular value decomposition both the correlated topic model and the replicated softmax model lead to a more efficient compression of web browsing data. On the other hand, singular value decomposition surpasses latent Dirichlet allocation. We interpret results of the correlated topic model and the replicated softmax model by determining combinations of topics or hidden variables that are heterogeneous with respect to visited websites. We show that decision makers should not rely on bivariate measures of site visits, as these do not agree with measures of interdependences between sites that can be inferred from the correlated topic model or the replicated softmax model. We investigate how well topics or hidden variables measured by these methods predict yearly household expenditures. The correlated topic model leads to the best predictive performance, followed by the replicated softmax model. We also discuss how the replicated softmax model can be used to support online marketing decisions of websites.

Keywords: Online marketing; Web browsing; Machine learning; Topic models; Restricted Boltzmann machine (search for similar items in EconPapers)
JEL-codes: C38 C45 M31 M37 (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
http://link.springer.com/10.1007/s11573-021-01067-4 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:jbecon:v:92:y:2022:i:5:d:10.1007_s11573-021-01067-4

Ordering information: This journal article can be ordered from
http://www.springer.com/journal/11573

DOI: 10.1007/s11573-021-01067-4

Access Statistics for this article

Journal of Business Economics is currently edited by Günter Fandel

More articles in Journal of Business Economics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:jbecon:v:92:y:2022:i:5:d:10.1007_s11573-021-01067-4