44
From Event Detection to Storytelling on Microblogs Janani Kalyanam* Sumithra Velupillai Mike Conway Gert Lanckriet

From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

From Event Detection to Storytelling on Microblogs

Janani Kalyanam* Sumithra Velupillai Mike Conway Gert Lanckriet

Page 2: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Research PipelineInteresting question

Collect data Analyze data (fit statistical models)

Page 3: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Research PipelineInteresting question

Collect data Analyze data (fit statistical models)

Page 4: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Photo credit: https://marqana.com/wp-content/uploads/embed_toomany.jpg

How do we make sense of this?

Page 5: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Some tools

- Topic Modeling

- Event Detection

Page 6: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Some tools - limitations

- Topic Modeling - very short documents

- Event Detection - detect only the onset of events

Page 7: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Our focus in this work...

Page 8: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Our focus

- Topic Modeling - very short documents

- Event Detection - detect only the onset of events

Page 9: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Model

doc-term doc-top top-term

Page 10: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Model

Very sparse

Page 11: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Observation

The size of the vocabulary increases only marginally with increasing number of documents.

(Yan et. al. 2013)

Page 12: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

ModelThe term-by-term matrix is relatively denser.

Page 13: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

ModelThe term-by-term matrix is relatively denser.

Hence, decompose

term-term term-top top-term

Page 14: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Event ProgressionAssume a set of documents arrive at every time step

Page 15: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Event ProgressionAssume a set of documents arrive at every time step

Relate the current data to the past history

Page 16: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Event ProgressionAssume a set of documents arrive at every time step

Relate the current data to the past history

Page 17: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Event Progression

top-top top-term

Page 18: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Quick Aside:

The “tracking matrix” - helps connect the present to the past. This will help build the “timelines” for events.

Page 19: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Model

Page 20: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Model

KNOWN

Page 21: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Optimization- Collective Matrix Factorization (Singh and Gordon 2008)

- Details in the paper

- Implementation on github

https://github.com/kjanani/matrix_factorization/blob/master/matrix_factorization.py

Page 22: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Interesting Question

Ebola Outbreak 2014. What really happened?

Page 23: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Interesting Question: GoalsEbola Outbreak 2014. What really happened?

- Want list of all events that occurred.

- How did they progress?

Page 24: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Tasks

- Topic Detection

- Event timelines

Page 25: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Topic Detection

- Estimated topics: Estimate at every timestep. Each row is a distribution over words.

- Groundtruth topics: hashtags

Page 26: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Topic Detection Baselines

- Two baselines from the event detection literature

- A few classic topic modeling baselines (NMF, LDA e.t.c.)

Page 27: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Topic Detection Results

Page 28: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Event Timelines

How to come up with timelines of events?

Page 29: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Event Timelines

How to come up with timelines of events?

Look at the tracking matrix

Page 30: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Tracking Matrix

- Tracking matrix is a square matrix of all positive entries

Page 31: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Tracking Matrix

- Tracking matrix is a square matrix of all positive entries- It show how the topics have changed from to - Row-i tells how topic-i at time is related to all the topics at time

Page 32: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines
Page 33: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

New events

Page 34: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Continuing events

Page 35: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Evolving events

Page 36: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Entropy H(X)

- Quantifies the amount of “randomness”

- Range [0, 2.32]

Page 37: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Based on Entropy

- < 1 → continuing events- 1 <= H(X) <= 2 → evolving events- > 2 → new events- Also ending events

Page 38: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Timeline Generation

- Look at heatmap- Connect dots to the previous timesteps (if possible)- Else, new events, or noise

Page 39: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Timeline

Page 40: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Example continuing events - memes

Page 41: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Example of evolving events

Page 42: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Thank you!

Questions?

Page 43: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Photo credit: http://www.realityblurred.com/realitytv/images/2015/03/mtv-the-real-world.jpg AND http://globalpse.org/en/wp-content/uploads/2016/04/Social-media-e1459493030525.jpg

Page 44: From Event Detection to Storytelling on Microblogsacsweb.ucsd.edu/~jkalyana/papers/ASONAM_presentation.pdf · 2016-08-21 · - Groundtruth topics: hashtags. Topic Detection Baselines

Social Media Era