Text Summarization via Semantic Representation

吳旻誠

2014/07/16

Gist-content Question

• Ask the main idea of the talk.

• The correct answers describe the closest overall theme of the content, and the distractors refer to only small portions of the content.

Gist-content Question

• Q. Which of the following is closest to the main idea of this talk?– (A)We've had three explanations for why we might

sleep.– (B)When you're tired, and you lack sleep, you have poor

memory, poor creativity, increased impulsiveness, and overall poor judgment.

– (C)If you have good sleep, it increases your concentration, attention, decision-making, creativity, social skills, health.

– (D)You do not do anything much while you're asleep.

Gist-content question generation

• Seem the most important sentence as the main idea of the talk.

• Use LexRank to Measure the importance of sentences.

LexRank

• Measure the importance of sentences.

• Graph-based Model(Undirected).

• The nodes represent the sentences.

• The edges are the cosine similarity between nodes.

LexRank

Conditions that should be satisfied

• Stochastic matrix.

• Irreducible.

• Aperiodic.

LexRank

Similarity Between Sentences

• But…what’s the similarity between the following sentences?– I will fully support you.– I'll back you up all the way.

Deep Learning

• A set of algorithms in Machine Learning area.

• Learning representations of data.

• Has been applied to fields like computer vision , automatic speech recognition, natural language processing.

Reduce the Dimensionality of Data with Neural Networks

Word2Vec

• A Google open source tool.

• Compute vector representations of words.

• Provides an efficient implementation of– Continuous Bag-of-Words(CBOW) architecture. – Skip-gram architecture.

Continuous Bag-of-Words Model(CBOW)

Skip-gram Model

Softmax

Hierarchical Softmax

• Uses a binary tree representation of the output layer with the W words as its leaves.

• Each word w can be reached by an appropriate path from the root of the tree.

Sentence Representations

Now we have the representations of words , but how can we represent sentences by these representations? A recursive Deep Learning model was proposed by Stanford Natural Language Processing Group.

Recursive Autoencoder

Dynamic Pooling

• Paraphrase_Identification_(State_of_the_art)

Text Summarization via Semantic Representation

Documents

A fuzzy video content representation for video summarization and content-based retrieval

“Semantic PDF Processing & Document Representation”

Universal Multimedia Access and Semantic Summarization for ... · This work proposes a presentation summarization framework for audio-visual presentations in a DL environment which

Arabic Text Summarization Based on Latent Semantic Analysis to Enhance Arabic Documents Clustering

Multimedia Document Summarization based on a Semantic ... fileMultimedia Document Summarization based on a Semantic Adaptation Framework S´ebastien Laborie, J´erome Euzenat and Nabil

Multi-Document Summarization and Semantic Relatednessmogren.one/lic/mogren2015licentiate.pdf · Multi-Document Summarization and Semantic Relatedness OLOF MOGREN c OLOF MOGREN, 2015

Automatic Concept Extraction in Semantic Summarization Process · 2018. 9. 25. · 10 Automatic Concept Extraction in Semantic Summarization Process Antonella Carbonaro Computer Science

LEVELS OF SEMANTIC REPRESENTATION: WHERE LEXICON AND ... · LEVELS OF SEMANTIC REPRESENTATION: WHERE LEXICON AND ... “Levels of semantic representation: where lexicon and grammar

Semantic Spatial Representation: a unique representation of an … · 2019. 6. 1. · Semantic Spatial Representation: ... tested in two environments of different scales. 1 Introduction

Semantic Analysis III + Intermediate Representation I

Semantic Feature Representation to Capture News …wdd/pub/Semantic Feature Representation...tion derived from semantic frame parsing for ranking com-panies, and for regressing on

Semantic Graphs Derived from Triplets with Application in Document Summarization

TEXT SUMMARIZATION USING LATENT SEMANTIC ANALYSIS A …etd.lib.metu.edu.tr/upload/12612988/index.pdf · 2011-03-01 · TEXT SUMMARIZATION USING LATENT SEMANTIC ANALYSIS Özsoy, Makbule

Topics in Semantic Representation - Cognitive sciencepsiexp.ss.uci.edu/research/papers/Griffiths_Steyvers_Tenenbaum...Topics in Semantic Representation Thomas L. Griffiths University

Importance of Semantic Representation: Dataless Classification

LEARNING DEEP SEMANTIC ATTRIBUTES FOR USER VIDEO SUMMARIZATION€¦ · LEARNING DEEP SEMANTIC ATTRIBUTES FOR USER VIDEO SUMMARIZATION Ke Sun1, Jiasong Zhu2, Zhuo Lei1, ... ﬁnd important

Semantic Feature Representation to Capture News Impactwdd/pub/Semantic Feature Representation t… · Semantic Feature Representation to Capture News Impact First Author Afﬁliation

Knowledge Representation and Reasoning on the Semantic … · Knowledge Representation and Reasoning on the Semantic Web: ... Description Logic, which provide the basic representation

Topics in Semantic Representation

Query-Based Single Document Summarization … Single Document Summarization Using an Ensemble Noisy ... Query-Based Single Document Summarization Using an ... representation is Gaussian-Bernoulli