Predicting the popularity of tweets using internal and external knowledge: an empirical Bayes type approach
Wai Hong Tan () and
Feng Chen ()
Additional contact information
Wai Hong Tan: Universiti Malaysia Kelantan
Feng Chen: UNSW Sydney
AStA Advances in Statistical Analysis, 2021, vol. 105, issue 2, No 6, 335-352
Abstract:
Abstract The problem of tweet popularity prediction, or forecasting the total number of retweets stemming from an ancestral tweet, has attracted considerable interest recently. The prediction can be accomplished by fitting a point process model to the sequence of retweet times up to a certain censoring time and project the fitted model to a future time point. However, models employing such approach tend to have inferior prediction accuracy when the censoring time is too short before sufficient information can accumulate. To overcome this, we propose an empirical Bayes type approach of parameter estimation to combine internal knowledge on the times of historical retweets up to the censoring time and external knowledge on complete retweet sequences in the training data. We demonstrate the approach using several point process models with finite-dimensional parameters, where the prior distribution for the parameter of each model is constructed based on the external knowledge, and the likelihood is calculated based on the internal knowledge. The mode of the posterior distribution is used as the estimator of the finite-dimensional parameter, and the mean of the predictive distribution for the number of retweets implied by each of the estimated models is used to predict the tweet popularity. Using a large Twitter data set, we reveal that the proposed methodology not only enables prediction at time zero before the arrival of any retweet event, but also substantially improves the prediction performances of existing models, especially at earlier censoring times.
Keywords: Empirical Bayes; Kernel smoothing; Maximum a posteriori (MAP) estimation; Nonparametric regression (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s10182-021-00390-z Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:alstar:v:105:y:2021:i:2:d:10.1007_s10182-021-00390-z
Ordering information: This journal article can be ordered from
http://www.springer. ... cs/journal/10182/PS2
DOI: 10.1007/s10182-021-00390-z
Access Statistics for this article
AStA Advances in Statistical Analysis is currently edited by Göran Kauermann and Yarema Okhrin
More articles in AStA Advances in Statistical Analysis from Springer, German Statistical Society
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().