1
Soft Cardinality + ML: Learning Adaptive Similarity Functions for Cross-lingual Textual Entailment Sergio Jimenez Claudia Becerra Alexander Gelbukh Task Results Soft Cardinality Conclusions A= , , B= , , |A| =3 |B|=3 Classical cardinality crisp count Soft cardinality soft count | A|’=2 .9 |B|’=1.3 Sets of Features n i n j p j i a a a sim w A i 1 1 ' , 1 Text A Spanish + English A t Transla tion Text A Spanish Text B English + Spanish B t Transla tion Text B English Translat e Lemmatizer -Stop- words Gold standard Cardinality features A t B t B A SVM classifier forwar d backwa rd no entailment bidirecti onal )) ( ), ( max( ) , ( _ 1 ) , ( b len a len b a dist edit b a sim t A B B A t SEMEVAL 2012 OFFICIAL RESULTS (accuracy) FEATURES spa- eng ita- eng fra- eng deu- eng AVERAG E n2 2nd.HDU.ru n1 0.630 0.554 0.564 0.558 0.577 3rd.Softca rd 0.552 0.566 0.570 0.550 0.560 FEATURES spa- eng ita- eng fra- eng deu- eng AVERAG E Sym.simSco res 0.404 0.410 0.410 0.410 0.409 Asym.LCS.s im 0.490 0.492 0.482 0.474 0.485 Classic.ca rd 0.560 0.534 0.570 0.542 0.552 SimScores 0.600 0.562 0.568 0.572 0.576 Classic.ca rd.w 0.584 0.576 0.588 0.590 0.585 Soft.card. w 0.598 0.602 0.624 0.604 0.607 • Cardinalities as features performs better than similarity scores. • Soft cardinality performs better than classical cardinality. • Soft cardinality approach obtained better results than the best official Given a pair of topically related text fragments (T1 and T2) in different languages, the CLTE task consists of automatically annotating it with one of the following entailment judgments: Bidirectional (T1 ->T2 & T1 <- T2): the two fragments entail each other (semantic equivalence) Forward (T1 -> T2 & T1 !<- T2): unidirectional entailment from T1 to T2 Backward (T1 !-> T2 & T1 <- T2): unidirectional entailment from T2 to T1 No Entailment (T1 !-> T2 & T1 !<- T2): there is no entailment between T1 and T2 Sym.simScores: scores of the following symmetric similarity functions: Jaccard, Dice, and cosine coefficients using classical cardinality and soft cardinality (edit-distance as auxiliar sim. function). In addition, cosine similarity, softTFIDF (Cohen et al., 2003) and edit distance (total 18 features). Asym.LCS.sim: scores of the following asymmetric similarity functions: sim(T1; T2) = lcs(T1;T2)=len(T1) and sim(T1; T2) = lcs(T1;T2)=len(T2) at character level (4 features). Classic.card: cardinalities using classical set cardinality (12 features). SimScores: combined features sets from Sym.SimScores, Asym.LCS.sim and the generalized Monge-Elkan measure (Jimenez et al., 2009) using p = 1; 2; 3 (30 features). Classic.card.w: Same as Classic.card but using idf weights. Soft.card.w: soft cardinality using idf weights as described in Section 2.3 using p = 1; 2; 3; 4; 5 (60 features).

Soft Cardinality + ML: Learning Adaptive Similarity Functions for Cross-lingual Textual Entailment

Embed Size (px)

Citation preview

Page 1: Soft Cardinality + ML: Learning Adaptive Similarity Functions for Cross-lingual Textual Entailment

Soft Cardinality + ML: Learning Adaptive Similarity Functions for Cross-lingual Textual Entailment

Sergio Jimenez Claudia Becerra Alexander Gelbukh

Task

Results

Soft Cardinality

Conclusions

A=, ,

B=, ,

|A|=3

|B|=3

Classical cardinality crisp count

Soft cardinality soft count

|A|’=2.9

|B|’=1.3

Sets of Features

n

in

j

pji

a

aasimwA

i1

1

'

,

1

Text ASpanish +

English At

Translation

Text ASpanish +

English At

Translation

Text ASpanish

Text ASpanish

Text BEnglish +

Spanish Bt

Translation

Text BEnglish +

Spanish Bt

Translation

Text BEnglish

Text BEnglish

Translate

Lemmatizer -Stop-words

Goldstandard

Cardinality features

A tB tBA

SVM classifier

forward

backward

no entailment

bidirectional

))(),(max(),(_

1),(blenalenbadistedit

basim

tA B BAt

SEMEVAL 2012 OFFICIAL RESULTS (accuracy)FEATURES spa-eng ita-eng fra-eng deu-eng AVERAGE

1st.HDU.run2 0.632 0.562 0.570 0.552 0.5792nd.HDU.run1 0.630 0.554 0.564 0.558 0.5773rd.Softcard 0.552 0.566 0.570 0.550 0.560

FEATURES spa-eng ita-eng fra-eng deu-eng AVERAGESym.simScores 0.404 0.410 0.410 0.410 0.409Asym.LCS.sim 0.490 0.492 0.482 0.474 0.485Classic.card 0.560 0.534 0.570 0.542 0.552SimScores 0.600 0.562 0.568 0.572 0.576Classic.card.w 0.584 0.576 0.588 0.590 0.585Soft.card.w 0.598 0.602 0.624 0.604 0.607

• Cardinalities as features performs better than similarity scores.

• Soft cardinality performs better than classical cardinality.• Soft cardinality approach obtained better results than the

best official SemEval result (after debugging).

Given a pair of topically related text fragments (T1 and T2) in different languages, the CLTE task consists of automatically annotating it with one of the following entailment judgments:

• Bidirectional (T1 ->T2 & T1 <- T2): the two fragments entail each other (semantic equivalence)

• Forward (T1 -> T2 & T1 !<- T2): unidirectional entailment from T1 to T2• Backward (T1 !-> T2 & T1 <- T2): unidirectional entailment from T2 to T1• No Entailment (T1 !-> T2 & T1 !<- T2): there is no entailment between T1 and T2

Sym.simScores: scores of the following symmetric similarity functions: Jaccard, Dice, and cosine coefficients using classical cardinality and soft cardinality (edit-distance as auxiliar sim. function). In addition, cosine similarity, softTFIDF (Cohen et al., 2003) and edit distance (total 18 features).

Asym.LCS.sim: scores of the following asymmetric similarity functions: sim(T1; T2) = lcs(T1;T2)=len(T1) and sim(T1; T2) = lcs(T1;T2)=len(T2) at character level (4 features).

Classic.card: cardinalities using classical set cardinality (12 features).

SimScores: combined features sets from Sym.SimScores, Asym.LCS.sim and the generalized Monge-Elkan measure (Jimenez et al., 2009) using p = 1; 2; 3 (30 features).

Classic.card.w: Same as Classic.card but using idf weights.

Soft.card.w: soft cardinality using idf weights as described in Section 2.3 using p = 1; 2; 3; 4; 5 (60 features).