EconPapers    
Economics at your fingertips  
 

Collecting event‐related tweets from twitter stream

Xin Zheng and Aixin Sun

Journal of the Association for Information Science & Technology, 2019, vol. 70, issue 2, 176-186

Abstract: Twitter provides a channel of collecting and publishing instant information on major events like natural disasters. However, information flow on Twitter is of great volume. For a specific event, messages collected from the Twitter Stream based on either location constraint or predefined keywords would contain a lot of noise. In this article, we propose a method to achieve both high‐precision and high‐recall in collecting event‐related tweets. Our method involves an automatic keyword generation component, and an event‐related tweet identification component. For keyword generation, we consider three properties of candidate keywords, namely relevance, coverage, and evolvement. The keyword updating mechanism enables our method to track the main topics of tweets along event development. To minimize annotation effort in identifying event‐related tweets, we adopt active learning and incorporate multiple‐instance learning which assigns labels to bags instead of instances (that is, individual tweets). Through experiments on two real‐world events, we demonstrate the superiority of our method against state‐of‐the‐art alternatives.

Date: 2019
References: Add references at CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1002/asi.24096

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bla:jinfst:v:70:y:2019:i:2:p:176-186

Ordering information: This journal article can be ordered from
http://www.blackwell ... bs.asp?ref=2330-1635

Access Statistics for this article

More articles in Journal of the Association for Information Science & Technology from Association for Information Science & Technology
Bibliographic data for series maintained by Wiley Content Delivery ().

 
Page updated 2025-03-19
Handle: RePEc:bla:jinfst:v:70:y:2019:i:2:p:176-186