Description
Micro-blogging platforms like Twitter have become some of the most popular sites for people to share and express their views and opinions about public events like debates, sports events or other news articles. These social updates by people complement the written news articles or transcripts of events in giving the popular public opinion about these events. So it would be useful to annotate the transcript with tweets. The technical challenge is to align the tweets with the correct segment of the transcript. ET-LDA by Hu et al [9] addresses this issue by modeling the whole process with an LDA-based graphical model. The system segments the transcript into coherent and meaningful parts and also determines if a tweet is a general tweet about the event or it refers to a particular segment of the transcript. One characteristic of the Hu et al’s model is that it expects all the data to be available upfront and uses batch inference procedure. But in many cases we find that data is not available beforehand, and it is often streaming. In such cases it is infeasible to repeatedly run the batch inference algorithm. My thesis presents an online inference algorithm for the ET-LDA model, with a continuous stream of tweet data and compare their runtime and performance to existing algorithms.
Details
Title
- Online ET-LDA - joint modeling of events and their related tweets with online streaming data
Contributors
- Acharya, Anirudh (Author)
- Kambhampati, Subbarao (Thesis advisor)
- Davulcu, Hasan (Committee member)
- Tong, Hanghang (Committee member)
- Arizona State University (Publisher)
Date Created
The date the item was original created (prior to any relationship with the ASU Digital Repositories.)
2015
Resource Type
Collections this item is in
Note
-
thesisPartial requirement for: M.S., Arizona State University, 2015
-
bibliographyIncludes bibliographical references (page 32)
-
Field of study: Computer science
Citation and reuse
Statement of Responsibility
by Anirudh Acharya