Learning Personalized Privacy Preference from Public Data
Wen Wang () and
Beibei Li ()
Additional contact information
Wen Wang: University of Maryland at College Park, Information System, College Park, Maryland 20742
Beibei Li: Carnegie Mellon University, Information Systems, Pittsburgh, Pennsylvania 15213
Information Systems Research, 2025, vol. 36, issue 2, 761-780
Abstract:
Learning consumers’ personalized privacy preferences is crucial for firms and policymakers to establish trust and compliance and guide effective policymaking. Existing approaches rely mostly on private information such as proprietary user behavior data and individual-level demographic and socio-economic factors, or require explicit user input, which can be invasive and burdensome, potentially leading to user dissatisfaction. Nowadays, individuals generate and share vast amounts of information about themselves in the public domain, which can provide a valuable multifaceted view of their behaviors, attitudes, and preferences. This information thus has the potential to provide valuable insights into individuals’ privacy preferences. In this study, we propose a novel framework to predict personalized privacy preference by leveraging a ubiquitous source of public data—social media posts. Deeply rooted in psychological and privacy theories, we use deep learning model and natural language processing algorithms to learn theory-driven psychosocial traits such as lifestyle, risk preference, personality, privacy-related economic preferences, linguistic styles, and more from social media posts. Interestingly, we find that psychosocial traits from public data provide greater predictive power than private information. Furthermore, we conduct multiple interpretability analyses to understand what drives the model’s performance. Finally, we demonstrate the practical value of our model and show that our framework can assist platforms and policymakers in forecasting the consequences of privacy policies. Overall, our framework provides managerial implications for enhancing consumer privacy control and trust, optimizing platform data management, and informing policymakers about better data privacy regulations.
Keywords: personalized privacy preference; public data source; deep learning; natural language processing; psychosocial traits (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://dx.doi.org/10.1287/isre.2023.0318 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:inm:orisre:v:36:y:2025:i:2:p:761-780
Access Statistics for this article
More articles in Information Systems Research from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().