Customer Acquisition via Explainable Deep Reinforcement Learning
Yicheng Song (),
Wenbo Wang () and
Song Yao ()
Additional contact information
Yicheng Song: Carlson School of Management, University of Minnesota, Minneapolis, Minnesota 55455
Wenbo Wang: Marketing Department, Hong Kong University of Science and Technology Business School, Clear Water Bay, Kowloon, Hong Kong
Song Yao: Olin Business School, Washington University in St. Louis, St. Louis, Missouri 63130
Information Systems Research, 2025, vol. 36, issue 1, 534-551
Abstract:
Effective customer acquisition heavily hinges on sequential targeting to ensure that appropriate marketing messages reach customers. Sequential targeting could guide customers through the acquisition process and thus, optimize long-term revenue for the firm. Toward this goal, reinforcement learning (RL) has demonstrated great potential in facilitating sequential targeting during user acquisition. However, decisions made by RL during this process often lack explainability. We introduce the deep recurrent Q-network with attention model, which optimizes the long-term reward of sequential targeting while enhancing the explainability of the decisions. The key idea of the proposed model is to revise Q-learning by adding an attention mechanism to create a bottleneck, forcing the model to focus on features of the next ad exposure that will lead to optimal long-term rewards. We estimate our model using a comprehensive data set from a digital bank. The empirical results show that the proposed model is explainable and also outperforms state-of-the-art methods in terms of long-term revenue optimization. Specifically, the attention mechanism within the model functions as forward planning. The forward planning can spot those features in the next ad exposure that are more likely to lead to the optimal outcome. We further demonstrate how the model makes targeting decisions of advertising channel choices by showing that the model can (1) learn optimal ad channels to target customers from different industries, (2) adjust advertising channels in response to dynamic customer behaviors, and (3) learn the seasonality of the customer’s industry and calibrate the ad channel correspondingly.
Keywords: explainable reinforcement learning; customer acquisition; DRQN-attention; long-term revenue optimization; advertising channel choice (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://dx.doi.org/10.1287/isre.2022.0529 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:inm:orisre:v:36:y:2025:i:1:p:534-551
Access Statistics for this article
More articles in Information Systems Research from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().