35
SPATIO-TEMPORAL VERACITY ASSESSMENT LABORATOIRE DE RECHERCHE EN INFORMATIQUE (LRI) {MALAVERRI, FATIHA.SAIS,GIANLUCA.QUERCINI}@LRI.FR JOURNEE ROD JOANA GONZALES MALAVERRI (LAHDAK) FATIHA SAÏS (LAHDAK) GIANLUCA QUERCINI (MODHEL)

SPATIO-TEMPORAL VERACITY ASSESSMENT

  • Upload
    others

  • View
    9

  • Download
    0

Embed Size (px)

Citation preview

Page 1: SPATIO-TEMPORAL VERACITY ASSESSMENT

SPATIO-TEMPORAL VERACITY ASSESSMENT

LABORATOIRE DE RECHERCHE EN INFORMATIQUE (LRI)

{MALAVERRI, FATIHA.SAIS,GIANLUCA.QUERCINI}@LRI.FR

JOURNEE ROD

JOANA GONZALES MALAVERRI (LAHDAK)

FATIHA SAÏS (LAHDAK)

GIANLUCA QUERCINI (MODHEL)

Page 2: SPATIO-TEMPORAL VERACITY ASSESSMENT

MOTIVATION

2

Extracted from https://goo.gl/B2i5aG

facts

facts

facts

facts

facts

knowledgeBase

Page 3: SPATIO-TEMPORAL VERACITY ASSESSMENT

REAL-WORLD SCENARIO: WHAT IS THE BARACK OBAMA’S CITIZENSHIP

3

American

Kenyan

Indonesian

British

I don’t know! But the probability that all errors in the certificate were inadvertent is 1 in 75 quadrillion.

https://goo.gl/5YTBxK

Page 4: SPATIO-TEMPORAL VERACITY ASSESSMENT

4

Extracted from https://goo.gl/B2i5aG

facts

facts

facts

facts

facts

knowledgeBase

Page 5: SPATIO-TEMPORAL VERACITY ASSESSMENT

VERACITY ASSESSMENT APPROACHES

5

• Majority voting

• Basic approaches

• Extended approaches

• Extra knowledge (ontologies) • Source dependency detection

[Beretta et al. 16]

Page 6: SPATIO-TEMPORAL VERACITY ASSESSMENT

VERACITY ASSESSMENT

6

What is the nationality of those US presidents?

d1. Edward Dickinson Baker

d2. Barack Obama

d3. Bill Clinton

British Kenyan

American

American American

American

American

British

American

USA USA

S1S2

S3

S4

d1:nat

S1 <British, Kenyan, American>

S2 <American, British, American>

S3 <American, American, American>

S4 <USA, USA, American>

d2:nat d3:nat

American

USA

USA

Page 7: SPATIO-TEMPORAL VERACITY ASSESSMENT

American

USA

USA

VERACITY ASSESSMENT

7

What is the nationality of those US presidents?

d1. Edward Dickinson Baker

d2. Barack Obama

d3. Bill Clinton

British Kenyan

American

American American

American

American

British

AmericanS1

S2

S3

S4

S1 <British, Kenyan, American>

S2 <American, British, American>

S3 <American, American, American>

S4 <USA, USA, American>

Source reliability

Fact confidence

+

d1:nat d2:nat d3:nat

Page 8: SPATIO-TEMPORAL VERACITY ASSESSMENT

VERACITY ASSESSMENT APPROACHES: LIMITATIONS

8

Single-Truth

S1: <d1:nat, British> is also true. S1 <British, Kenyan, American>

S2 <American, British, American>

S3 <American, American, American>

S4 <USA, USA, American>

d1:nat d2:nat d3:nat

Page 9: SPATIO-TEMPORAL VERACITY ASSESSMENT

VERACITY ASSESSMENT APPROACHES: LIMITATIONS

9

Single-Truth

S1: <d1:nat, British> is also true.

• No contextual information

S1: <d1:nat, British> is true in the temporal context [1811-1816]

S1: <d1:birthdate, 1811>

S1: <d1:birthPlace, London>

S1: <d1:immigrationDate, 1816>

S1 <British, Kenyan, American>

S2 <American, British, American>

S3 <American, American, American>

S4 <USA, USA, American>

d1:nat d2:nat d3:nat

Page 10: SPATIO-TEMPORAL VERACITY ASSESSMENT

VERACITY ASSESSMENT APPROACHES: LIMITATIONS

10

Single-Truth

S1: <d1:nat, British> is also true.

• No contextual information

S1: <d1:nat, British> is true in the temporal context [1811-1816]

S1: <d1:birthdate, 1811>

S1: <d1:birthPlace, London>

S1: <d1:immigrationDate, 1816>

• Some sources mainly created from data extracted from Wikipedia

How reliable S1: <d1:nat, British> is?

S1: <d1:birthPlace, London>

S1: <d1:immigrationDate, 1816>

S1 <British, Kenyan, American>

S2 <American, British, American>

S3 <American, American, American>

S4 <USA, USA, American>

d1:nat d2:nat d3:nat

Page 11: SPATIO-TEMPORAL VERACITY ASSESSMENT

VERACITY ASSESSMENT APPROACHES: LIMITATIONS

11

• No explanations

Why S1: <d1:nat, British> is true?

S1: <d1:birthPlace, London> S1 <British, Kenyan, American>

S2 <American, British, American>

S3 <American, American, American>

S4 <USA, USA, American>

d1:nat d2:nat d3:nat

Page 12: SPATIO-TEMPORAL VERACITY ASSESSMENT

GOAL

Build an approach to assess the veracity of facts taken from knowledge bases based on spatio-temporal information.

12

Page 13: SPATIO-TEMPORAL VERACITY ASSESSMENT

YAGO KNOWLEDGE BASE General purpose semantic knowledge

base (KB)

Integrates information extracted from Wikipedia infoboxes, WordNet, and GeoNames

> 10 million entities (persons, cities, organizations),

> 120 million facts about these entities

Attaches temporal and spatial dimensions to many of its facts and entities – meta facts.

13

Yago structure

Page 14: SPATIO-TEMPORAL VERACITY ASSESSMENT

APPROACH:RULE BASED TEMPORAL META FACTS GENERATION

14

Page 15: SPATIO-TEMPORAL VERACITY ASSESSMENT

RULE BASED TEMPORAL META FACTS GENERATION Focus on facts that may change over time:

Brad Pitt acted in the Fight Club in 1999 the Curious Case of Benjamin Button in 2008

Paul McCartney was/is married with Heather Mills from 2002 to 2008 Nancy Shevell since 2011

15

Page 16: SPATIO-TEMPORAL VERACITY ASSESSMENT

BASIC ALGORITHM TO INFER META FACTS

16

Page 17: SPATIO-TEMPORAL VERACITY ASSESSMENT

TIMESTAMP GENERATION RULE

17

Page 18: SPATIO-TEMPORAL VERACITY ASSESSMENT

TIMESTAMP GENERATION RULE

18

Page 19: SPATIO-TEMPORAL VERACITY ASSESSMENT

TIMESTAMP GENERATION RULE

19

Page 20: SPATIO-TEMPORAL VERACITY ASSESSMENT

TIMESTAMP GENERATION RULE

20

Page 21: SPATIO-TEMPORAL VERACITY ASSESSMENT

TEMPORAL INTERVAL RULE

21

Page 22: SPATIO-TEMPORAL VERACITY ASSESSMENT

TEMPORAL INTERVAL RULE

22

Page 23: SPATIO-TEMPORAL VERACITY ASSESSMENT

23

TEMPORAL INTERVAL RULE

validAfter

Page 24: SPATIO-TEMPORAL VERACITY ASSESSMENT

24

TEMPORAL INTERVAL RULE

Page 25: SPATIO-TEMPORAL VERACITY ASSESSMENT

25

TEMPORAL INTERVAL RULE

Page 26: SPATIO-TEMPORAL VERACITY ASSESSMENT

26

TEMPORAL INTERVAL RULE

Page 27: SPATIO-TEMPORAL VERACITY ASSESSMENT

CASE STUDY:MOVIE DOMAIN (YAGO)

27

Page 28: SPATIO-TEMPORAL VERACITY ASSESSMENT

Yago:

# of films: 151427

# of actors in Yago: 47800

# of release dates available: 136234

MOVIE DOMAIN (YAGO)

28

Page 29: SPATIO-TEMPORAL VERACITY ASSESSMENT

RESULTS: actedIn

29

Page 30: SPATIO-TEMPORAL VERACITY ASSESSMENT

RESULTS: actedIn

30- 8 MFs not inferred because some facts don’t have release date associated.

- 23 share the same information (Yago MFs & IMDb)- Results vs Yago MFs & IMDb:

- 16 have the same IMDb information

Notice: Yago MFs are more accurate.

Page 31: SPATIO-TEMPORAL VERACITY ASSESSMENT

RESULTS: actedIn

- 8 MFs not inferred because some facts don’t have release date associated.

- 23 share the same information (Yago MFs & IMDb)- Results vs Yago MFs & IMDb:

- 16 have the same IMDb information

Notice: Yago MFs are more accurate.

31

Page 32: SPATIO-TEMPORAL VERACITY ASSESSMENT

RESULTS: wroteMusicFor

32

Page 33: SPATIO-TEMPORAL VERACITY ASSESSMENT

RESULTS: wroteMusicFor

- 118 MFs not inferred because some facts don’t have release date associated.

- 3 share the same information (Yago MFs & IMDb)

- Results vs Yago MFs & IMDb:

33

Page 34: SPATIO-TEMPORAL VERACITY ASSESSMENT

ONGOING WORK:

Qualitative evaluation of the meta facts inferred.

Creating new set of rules.

Extend the approach to reason more globally on the whole graph while inferring meta facts.

Spatial reasoning.

E.g.: Film release dates are associated to specific locations (country, city).

(Semi-)automatic approach for rule generation.

FUTURE WORK:

34

Page 35: SPATIO-TEMPORAL VERACITY ASSESSMENT

MERCI !

35