Personality, Gender, and Age in the Language of Social Media: The Open-Vocabulary Approach
H Andrew Schwartz,
Johannes C Eichstaedt,
Margaret L Kern,
Lukasz Dziurzynski,
Stephanie M Ramones,
Megha Agrawal,
Achal Shah,
Michal Kosinski,
David Stillwell,
Martin E P Seligman and
Lyle H Ungar
PLOS ONE, 2013, vol. 8, issue 9, 1-16
Abstract:
We analyzed 700 million words, phrases, and topic instances collected from the Facebook messages of 75,000 volunteers, who also took standard personality tests, and found striking variations in language with personality, gender, and age. In our open-vocabulary technique, the data itself drives a comprehensive exploration of language that distinguishes people, finding connections that are not captured with traditional closed-vocabulary word-category analyses. Our analyses shed new light on psychosocial processes yielding results that are face valid (e.g., subjects living in high elevations talk about the mountains), tie in with other research (e.g., neurotic people disproportionately use the phrase ‘sick of’ and the word ‘depressed’), suggest new hypotheses (e.g., an active life implies emotional stability), and give detailed insights (males use the possessive ‘my’ when mentioning their ‘wife’ or ‘girlfriend’ more often than females use ‘my’ with ‘husband’ or 'boyfriend’). To date, this represents the largest study, by an order of magnitude, of language and personality.
Date: 2013
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (48)
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0073791 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 73791&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0073791
DOI: 10.1371/journal.pone.0073791
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().