Please use this identifier to cite or link to this item: http://hdl.handle.net/123456789/1571
DC FieldValueLanguage
dc.contributor.authorTan, Waihongen_US
dc.contributor.authorFeng Chenen_US
dc.date.accessioned2021-05-05T04:46:45Z-
dc.date.available2021-05-05T04:46:45Z-
dc.date.issued2021-02-
dc.identifier.issn1863-8171-
dc.identifier.urihttp://hdl.handle.net/123456789/1571-
dc.descriptionWeb of Science / Scopusen_US
dc.description.abstractThe problem of tweet popularity prediction, or forecasting the total number of retweets stemming from an ancestral tweet, has attracted considerable interest recently. The prediction can be accomplished by fitting a point process model to the sequence of retweet times up to a certain censoring time and project the fitted model to a future time point. However, models employing such approach tend to have inferior prediction accuracy when the censoring time is too short before sufficient information can accumulate. To overcome this, we propose an empirical Bayes type approach of parameter estimation to combine internal knowledge on the times of historical retweets up to the censoring time and external knowledge on complete retweet sequences in the training data. We demonstrate the approach using several point process models with finite-dimensional parameters, where the prior distribution for the parameter of each model is constructed based on the external knowledge, and the likelihood is calculated based on the internal knowledge. The mode of the posterior distribution is used as the estimator of the finite-dimensional parameter, and the mean of the predictive distribution for the number of retweets implied by each of the estimated models is used to predict the tweet popularity. Using a large Twitter data set, we reveal that the proposed methodology not only enables prediction at time zero before the arrival of any retweet event, but also substantially improves the prediction performances of existing models, especially at earlier censoring times.en_US
dc.language.isoenen_US
dc.publisherSpringer Verlagen_US
dc.relation.ispartofAStA Advances in Statistical Analysisen_US
dc.subjectEmpirical Bayesen_US
dc.subjectKernel smoothingen_US
dc.subjectMaximum a posteriori (MAP) estimationen_US
dc.subjectNonparametric regressionen_US
dc.titlePredicting the popularity of tweets using internal and external knowledge: an empirical Bayes type approachen_US
dc.typeNationalen_US
dc.identifier.doi10.1007/s10182-021-00390-z-
dc.description.fundingUMK-FUND [R/FUND/A0100/01348A/001/2020/00840]en_US
dc.description.typeArticleen_US
dc.description.impactfactor1.16en_US
dc.description.quartileQ3en_US
dc.contributor.correspondingauthorwai.hong@umk.edu.myen_US
item.languageiso639-1en-
item.openairetypeNational-
item.grantfulltextopen-
item.fulltextWith Fulltext-
Appears in Collections:Faculty of Entrepreneurship and Business - Journal (Scopus/WOS)
Show simple item record

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.