EconPapers    
Economics at your fingertips  
 

Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web

Sanjiv Das () and Mike Y. Chen ()
Additional contact information
Mike Y. Chen: Ludic Labs, San Mateo, California 94401

Management Science, 2007, vol. 53, issue 9, 1375-1388

Abstract: Extracting sentiment from text is a hard semantic problem. We develop a methodology for extracting small investor sentiment from stock message boards. The algorithm comprises different classifier algorithms coupled together by a voting scheme. Accuracy levels are similar to widely used Bayes classifiers, but false positives are lower and sentiment accuracy higher. Time series and cross-sectional aggregation of message information improves the quality of the resultant sentiment index, particularly in the presence of slang and ambiguity. Empirical applications evidence a relationship with stock values--tech-sector postings are related to stock index levels, and to volumes and volatility. The algorithms may be used to assess the impact on investor opinion of management announcements, press releases, third-party news, and regulatory changes.

Keywords: text classification; index formation; computers-computer science; artificial intelligence; finance; investment (search for similar items in EconPapers)
Date: 2007
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (105) Track citations by RSS feed

Downloads: (external link)
http://dx.doi.org/10.1287/mnsc.1070.0704 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:inm:ormnsc:v:53:y:2007:i:9:p:1375-1388

Access Statistics for this article

More articles in Management Science from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Matthew Walls ().

 
Page updated 2019-11-30
Handle: RePEc:inm:ormnsc:v:53:y:2007:i:9:p:1375-1388