Semantic Frame Identification with Distributed Word Representations

Semantic Frame Identification with DistributedWord Representations

Karl Moritz Hermann‡1

Jason Weston†Dipanjan Das†

Kuzman Ganchev†

†Google Inc., New York‡ Department of Computer Science, University of Oxford

Baltimore, 25.06.2014

1The majority of this research is result of an internship at Google.1 / 1

We investigate features and classifiersfor frame-semantic parsing

Frame-Semantic Parsing Intro

Frame-Semantic Parsing

The task of extracting semantic predicate-argument structuresfrom text.

John sold Mary a car .

COMMERCE BUYsell.V

Seller GoodsBuyer

iobjnsubjdobj

COMMERCE BUYsell.V

Seller GoodsBuyer

iobjnsubjdobj

COMMERCE BUYsell.V

Seller GoodsBuyer

iobjnsubjdobj

COMMERCE BUYsell.V

Seller GoodsBuyer

iobjnsubjdobj

Frame parsing as a two-stage task

Frame-Semantic Parsing is a combination of two tasks:

• Frame Identification

• Argument Identification

This paper focuses on Frame Identification

However, we also present results on the full pipeline task.

Representing frame instances

Default Approach

• Frame instances representedby candidate and context

• Context: sparse featuresbased on parse tree

Distributed Approach

• Replaces binary featureswith word embeddings

• Context: same features, butrepresented distributionally

Instances to Vectors

Step 1: Parsing

• Dependency parse of input sentence

• Paths are considered relative to candidate word

Step 1: Parsing

• Dependency parse of input sentence

• Paths are considered relative to candidate word

Step 2: Context word selection strategy

• Direct dependents based on parse

• Argument dependency paths learned from gold data

Step 3: Embeddings

• Replace words with embeddings

Step 4: Single vector creation

• Merge embeddings into a unified vector representation

• Effectively concatenation; zeros for empty slots

Joint-space Model (Wsabie) — Learning

Joint-space Model

• Instances represented in Rd based on pre-trained embeddings.

Joint-space Model

• Labels represented as discrete values given lexicon (size F ).

Joint-space Model

• Learn a linear mapping M : Rd → Rm.

Joint-space Model

• Learn a matrix Y ∈ RF×m to represent labels.

Joint-space Model

• Objective function:∑x

L(ranky (x)

)max(0, γ + s(x , y)− s(x , y)).

Joint-space Model (Wsabie) — Classification

Joint-Space Rm

• Project candidate intojoint-space

• Only consider labelprojections

• Restrict labels using lexicon

• Classify candidate usingsuitable distance metric

Joint-Space Rm

Experiments

Learning Setup

• Neural language model (∼ Bengio et al., 2003) trained on over100 billion tokens to learn 128-dimensional word embeddings

• FrameNet 1.5 and Ontonotes 4.0 (PropBank, WSJ-only) usedfor training the actual Wsabie models

• Hyperparameters optimised on development data

Baselines: Where is the power coming from?

Models Evaluated

• Log-Linear Words

• Log-Linear Embeddings

• Wsabie

10 / 1

Models Evaluated• Log-Linear Words

• Wsabie

10 / 1

• Wsabie

10 / 1

• Wsabie

10 / 1

Evaluation

Evaluation Settings

• We evaluate on FrameNet (here) and PropBank (see paper)

• FrameNet setup follows Das et al. (2014), with a restrictedlexicon during training (Semafor)

• Multiple evaluationsAll evaluates all framesRare predicates with frequency ≤ 11 in the training dataUnseen predicates not observed in the training data

11 / 1

Frame Identification Results (FrameNet - All Predicates)

FrameNet (Semafor)

86.49F

Das supervised Das bestLL-Embeddings LL-WordsWsabie

Figure : Frame Identification results on FrameNet dataset. We restrictthe training data to the Semafor lexicon for comparability with Das et al.,2014.

12 / 1

Frame Identification Results (FrameNet - Rare)

FrameNet (Semafor)

85.22F

13 / 1

Frame Identification Results (FrameNet - Unseen)

FrameNet (Semafor)

27.9727.27

46.15F

14 / 1

Full pipeline

Full frame-semantic analysis

In addition to the frame identification experiments, we also use ourmodels as part of a full frame-semantic parsing setup together witha standard argument identification method.

Argument Identification System

• Standard set of discrete features

• Local log-linear model

• Global inference via hard constraints and ILP

15 / 1

Full pipeline results (FrameNet)

FrameNet (Semafor)

68.69F

Das supervised Das bestLog-Words Wsabie

Figure : Full frame-structure prediction results for FrameNet dataset(Semafor).

16 / 1

Conclusion

Conclusion• Novel approach to frame identification

• Model outperforms prior state of the art on frameidentification

• In a pipeline setting with a standard argument identificationsystem, the model also sets a new state of the art on varioussemantic parsing tasks

• General approach, could easily be extended for alternativeframe-semantic parsing frameworks

17 / 1

The End

Thank you!Questions?

18 / 1

Frame Identification Results

Model Semafor Lexicon

All Ambiguous Rare Unseen

Das et al., 2014 supervised 82.97 69.27 80.97 23.08Das et al., 2014 best 83.60 69.19 82.31 42.67

Log-Linear Words 84.53 70.55 81.65 27.27Log-Linear Embed. 83.94 70.27 81.03 27.97Wsabie Embedding 86.49 73.39 85.22 46.15

Table : Frame identification results on the FrameNet test data

19 / 1

Frame Identification Results

Model Full Lexicon

All Ambiguous Rare

Log-Linear Words 87.33 70.55 87.19Log-Linear Embed. 86.94 70.26 86.56Wsabie Embedding 88.41 73.10 88.93

Table : Frame identification results on the FrameNet test data

20 / 1

Full Structure Prediction Results

Model Semafor Lexicon

Precision Recall F1

Das et al. supervised 67.81 60.68 64.05Das et al. best 68.33 61.14 64.54

Log-Linear Words 71.21 63.37 67.06Wsabie Embedding 73.00 64.87 68.69

Table : Full structure prediction results for FrameNet test data. Wecompare to the prior state of the art (Das et al., 2014).

21 / 1

Full Structure Prediction Results

Model Full Lexicon

Precision Recall F1

Log-Linear Words 73.31 65.20 69.01Wsabie Embedding 74.29 66.02 69.91

Table : Full structure prediction results for FrameNet test data. Wecompare to the prior state of the art (Das et al., 2014).

22 / 1

Argument-Only Results (CoNLL 2005 task)

PropBank

77.2377.14

Collins Charniak Combined Log-W Wsabie

Figure : Argument only evaluation (argument identification metrics)using the CoNLL 2005 shared task evaluation script (Carreras andMarquez, 2005). Results from Punyakanok et al. (2008) are taken fromTable 11 of that paper.

23 / 1

Frame Identification Results (PropBank)

PropBank

Log-E Log-W Wsabie

Figure : Frame Identification results on PropBank datasets.

24 / 1

Full pipeline results (PropBank)

PropBank

Log-W Wsabie

Figure : Full frame-structure prediction results for PropBank dataset.

25 / 1

Semantic Frame Identification with Distributed Word Representations

Documents

Describing Semantic Representations of Brain … Semantic Representations of Brain Activity Evoked by Visual Stimuli Eri Matsuo Ichiro Kobayashi Ochanomizu University 2-1-1 Ohtsuka,

Visual and semantic representations predict subsequent ... · 2/11/2020 · If one assumes that memory representations are the residue of visual and semantic processing (Pearson

Visual Representations for Semantic Target Driven Navigation · Visual Representations for Semantic Target Driven Navigation Arsalan Mousavian2; , Alexander Toshev1;?, Marek Fiˇser

SSP Re-hosting System: CLBM and Semantic Representations

Semantic Processing, Semantic Representations, and Lexical ...classes.ischool.syr.edu/ist664/NLPfall2013/Semantics.2013.ppt.pdf · Semantic Processing, Semantic Representations, and

1 Workﬂow Composition: Semantic Representations for Flexible …gil/papers/workflow-bookchapter-06.pdf · 2007. 2. 2. · 1 Workﬂow Composition: Semantic Representations for Flexible

Learning Semantic Entity Representations with Knowledge Graph … · 2018-01-04 · Learning Semantic Entity Representations with Knowledge Graph and Deep Neural Networks and its

Semantic Representations of Clinical Care Data Leveraging ... · Semantic Representations of Clinical Care SWAT4LS - Dec 2016 Solbrig & Prud’hommeaux Outline Part 2 — Semantic

Representations of Keypoint-Based Semantic … › ~yjiang › publication › itm_yjiang.pdf1 Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study Yu-Gang

Statistical Models for Frame-Semantic Parsingmiriamp/fillmore-tribute/slides/Dipanjan_Das.pdf · Statistical Models for Frame-Semantic Parsing Dipanjan Das* Google Frame Semantics

Modeling and Exploiting Knowledge through Semantic Representationstw.rpi.edu/media/latest/Disc2020SemanticRepr.pdf · Modeling and Exploiting Knowledge through Semantic Representations

Semantic Web: From Representations to Applications

Korean Semantic Role Labeling Using Korean PropBank Frame ...onlinepresent.org/proceedings/vol142_2016/15.pdf · Korean Semantic Role Labeling Using Korean PropBank Frame Files

Robust Semantic Representations for Inferring Human Co ...mediatum.ub.tum.de/doc/1279199/1279199.pdf · Robust Semantic Representations for Inferring Human ... This indicates that

Perceptual and Semantic Representations at Encoding

Minggu3 - Reasoning, Semantic Network, Frame 2015.pdf

Using Explicit Semantic Representations for User Programming of Sensor Devices

Frame Semantic Analysis: Sex Dysphemisms and …...Frame Semantic Analysis: Sex Dysphemisms and Violence Hayden Kellermeyer San Francisco State University ENG 895 Squib submitted in

Learning Spatial-Semantic Representations from …people.csail.mit.edu/mwalter/papers/hemachandra14.pdf · Learning Spatial-Semantic Representations from Natural Language Descriptions

Generative Lexicon and Semantic Theoryjamesp/classes/ling... · Generative Lexiconexploits richer representations and rules to enhance compositional mechanisms. Richer representations