Identifying interesting Twitter contents using topical analysis
•We discover interesting tweets for a wide audience based on topic identification.•We model Trend Sensitive-LDA that reflects the current and popular trends.•We weight topics by exploiting their representative words.•We weight topics by analyzing spatial and temporal variation of their probabilities...
Gespeichert in:
Veröffentlicht in: | Expert systems with applications 2014-07, Vol.41 (9), p.4330-4336 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | •We discover interesting tweets for a wide audience based on topic identification.•We model Trend Sensitive-LDA that reflects the current and popular trends.•We weight topics by exploiting their representative words.•We weight topics by analyzing spatial and temporal variation of their probabilities.•The most interesting tweets contain latent topics that are assigned a high weight.
Social media platforms such as Twitter are becoming increasingly mainstream which provides valuable user-generated information by publishing and sharing contents. Identifying interesting and useful contents from large text-streams is a crucial issue in social media because many users struggle with information overload. Retweeting as a forwarding function plays an important role in information propagation where the retweet counts simply reflect a tweet’s popularity. However, the main reason for retweets may be limited to personal interests and satisfactions. In this paper, we use a topic identification as a proxy to understand a large number of tweets and to score the interestingness of an individual tweet based on its latent topics. Our assumption is that fascinating topics generate contents that may be of potential interest to a wide audience. We propose a novel topic model called Trend Sensitive-Latent Dirichlet Allocation (TS-LDA) that can efficiently extract latent topics from contents by modeling temporal trends on Twitter over time. The experimental results on real world data from Twitter demonstrate that our proposed method outperforms several other baseline methods. |
---|---|
ISSN: | 0957-4174 1873-6793 |
DOI: | 10.1016/j.eswa.2013.12.051 |