HITS’ Cross-lingual Entity Linking System at TAC 2011 · HITS’ Cross-lingual Entity Linking...

HITS’ Cross-lingual Entity Linking

System at TAC 2011

One Model for All Languages

Angela Fahrni, Michael Strube, Vivi Nastase

Heidelberg Institute for Theoretical Studies gGmbHHeidelberg, Germany

1 / 48

2 / 48

3 / 48

4 / 48

5 / 48

6 / 48

7 / 48

8 / 48

Strategies for Cross-lingualEntity Disambiguation

9 / 48

10 / 48

11 / 48

12 / 48

13 / 48

Knowledge base

14 / 48

Knowledge base

15 / 48

Mapping between the Englishand Chinese Wikipedia Versions

16 / 48

17 / 48

18 / 48

19 / 48

Coverage after Lexicon Lookup

Training SetCoverage Average Ambiguity

EN 0.928 16.92ZH 0.867 5.56ZH_Lex 0.930 22.86

20 / 48

21 / 48

Different Techniques for DifferentLanguages

22 / 48

Different Models for DifferentLanguages

23 / 48

One Model for All Languages

24 / 48

Context DisambiguationApproach

25 / 48

26 / 48

27 / 48

28 / 48

29 / 48

30 / 48

31 / 48

• Training of a supervised model (SVM) using English traininginstances

• Training instances are derived from the internal hyperlinks inWikipedia

• Positive instances: linked Wikipedia articles• Negative instances: other Wikipedia articles the terms can refer to

• Model is used for English and Chinese

32 / 48

Entity Disambiguation

33 / 48

Entity Disambiguation Approach

• Training instances are derived from the training data provided byTAC

• Positive instances: correct entity for the query terms• Negative instances: other candidate entities for the query terms

• Training of a supervised model (SVM) based on• English and Chinese training instances• Chinese training instances• English instances

• Application of the models to both languages

34 / 48

One-Model-for-All-Languages:Features

• Abstraction from particular languages, i.e. no lexical features

• Relatedness measures

• Concept-based features

35 / 48

One-Model-for-All-Languages:Features

• Prior probability

• Distance between term and name of the respective Wikipedia article

• Context fit on conceptual level• Vector-based features based on concepts, categories, lists, portals• Average, maximum and minimum derived from pairwise calculated

relatedness measures• Relatedness measures based on outgoing links, incoming links,

HITS’ Cross-lingual Entity Linking System at TAC 2011 · HITS’ Cross-lingual Entity Linking...

Documents

ANQUILOGLOSIA. ANQUIGLOSIA DEFINICION Frenillo lingual es anormalmente corto o rígido. Frenillo lingual es anormalmente corto o rígido. Frenillo lingual

ORTHO LINGUAL COLLECTION - Proclinic€¦ · Lingual Bracket Removing Pliers | 678-703 Lingual view of the Lingual Bracket Removing Pliers debonding the bracket. Lingual Mathieu Pliers

Cross-lingual similarity calculation for plagiarism ... · Cross-lingual similarity calculation for plagiarism ... • Calculate similarity ... Cross-lingual similarity calculation

HITS’ Graph-based System at the NTCIR-9 Cross …research.nii.ac.jp/.../NTCIR/04-NTCIR9-CROSSLINK-FahrniA.pdfHITS’ Graph-based System at the NTCIR-9 Cross-lingual Link Discovery

intra-lingual and cross-lingual voice conversion using harmonic plus

Lingual Ortho.ppt

Multi-lingual glossary (English, French, German, Italian ...cws.cengage.co.uk/stolowy3/students/Multi-lingual Glossary_3rd edn.pdf · Multi-lingual glossary (English, French, German,

Anomalia Lingual

Patologia Lingual

profile lingual

Lingual Orthodontics.pdf

Lingual orthodontics

Lingual Incisor

CITTI HITS + + + CITTI HITS + + + CITTI HITS + + + CITTI HITS + + + CITTI HITS ... 29... · 2020-07-06 · + + + CITTI HITS + + + CITTI HITS + + + CITTI HITS + + + CITTI HITS + +

Linguistic Resources for the 2015 TAC KBP Cold Start and ......Linguistic Resources for the 2015 . TAC KBP Cold Start and Tri-Lingual Entity Discovery & Linking Evaluations . Joe Ellis

Lingual Institute

Simplified Lingual Technique

An Inter-lingual Reference Approach For Multi-Lingual Ontology

Lingual my

Botão Lingual / Lingual Button / Botón Lingual€¦ · Botão Lingual / Lingual Button / Botón Lingual . Instruções de Uso / Instructions for Use / Intrucciones de Uso SELECIONE