24
Poster at Session P2 A Three-Way Model for Collective Learning on Multi-Relational Data 28th International Conference on Machine Learning Maximilian Nickel 1 Volker Tresp 2 Hans-Peter Kriegel 1 1 Ludwig-Maximilians Universität, Munich 2 Siemens AG, Corporate Technology, Munich June 30th, 2011 Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 1 / 17

A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

A Three-Way Model for Collective Learning onMulti-Relational Data

28th International Conference on Machine Learning

Maximilian Nickel1 Volker Tresp2 Hans-Peter Kriegel1

1Ludwig-Maximilians Universität, Munich

2Siemens AG, Corporate Technology, Munich

June 30th, 2011

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 1 / 17

Page 2: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

Outline

1 Introduction

2 RESCAL

3 Experiments

4 Summary

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 2 / 17

Page 3: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

IntroductionMulti-Relational Data

Multi-relational data is a part of many different important fields ofapplication, such as Computational Biology, Social Networks, theSemantic Web, the Linked Data cloud (shown below) and many more

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 3 / 17

Page 4: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

IntroductionMotivation to use Tensors for Relational Learning

Why Tensors?Modelling simplicity: Multiple binary relations can be expressedstraightforwardly as a three-way tensorNo structure learning: Not necessary to have information aboutindependent variables, knowledge bases, etc. or to infer it from dataExpected performance: Relational domains are high-dimensionaland sparse, a setting where factorization methods have shown verygood results

Problem: Tensor factorizations like CANDECOMP/PARAFAC (CP) orTucker can not perform collective learning or in the case of DEDICOMhave unreasonable constraints for relational learning.

(For an excellent review on tensors see (Kolda and Bader, 2009))

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 4 / 17

Page 5: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

IntroductionModelling and Terminology

Modelling binary relations as a tensor:Two modes of a tensor refer to theentities, one mode to the relations.The entries of the tensor are 1 whena relation between two entities existsand 0 otherwiseWe use the RDF formalism to modelrelations as (subject, predicate,object) triples

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 5 / 17

Page 6: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

Outline

1 Introduction

2 RESCAL

3 Experiments

4 Summary

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 6 / 17

Page 7: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

RESCALTensor Factorizaion

RESCAL takes the inherent structure of dyadic relational data intoaccount, by employing the tensor factorization

Xk ≈ ARkAT

A is a n × r matrix, representing the global entity-latent-componentspaceRk is an asymmetric r × r matrix that specifies the interaction of thelatent components per predicate

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 7 / 17

Page 8: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

RESCALTensor Factorizaion

RESCAL takes the inherent structure of dyadic relational data intoaccount, by employing the tensor factorization

Xk ≈ ARkAT

A is a n × r matrix, representing the global entity-latent-componentspaceRk is an asymmetric r × r matrix that specifies the interaction of thelatent components per predicate

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 7 / 17

Page 9: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

RESCALSolving canonical relational learning tasks

Link Prediction: To predict the existence of a relation between twoentities, it is sufficient to look at the rank-reduced reconstruction ofthe appropriate slice ARkAT

Collective Classification: Can be cast as a link prediction problemby including the classes as entities and adding a classOf relation.Alternatively, standard classification algorithms could be applied tothe entites’ latent-component representation ALink-based Clustering: Since the entities latent-componentrepresentation is computed considering all relations, Link-basedclustering can be done by clustering the entities in thelatent-component space A

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 8 / 17

Page 10: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

RESCALComputing the factorization

To compute the factorization, we solve the optimization problem

minA,Rk

loss(A,Rk) + reg(A,Rk)

where loss is the loss function

loss(A,Rk) = 12∑

k‖Xk −ARkAT‖2F

and reg is the regularization term

reg(A,Rk) = 12λ(‖A‖2F +

∑k‖Rk‖2F

)

Efficient alternating-least squares algorithm based on ASALSAN(Bader et al., 2007)Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 9 / 17

Page 11: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

RESCALCollective Learning Example

Predict party membership of US (vice) presidents

Bill

Party X

party

Al

vicePresidentOf

party

Johnparty

Lyndon

vicePresidentOf

party

Helpful to consider element-wise version of the loss function f

f (A,Rk) = 12∑i,j,k

(Xijk − aT

i Rkaj)2

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 10 / 17

Page 12: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

RESCALCollective Learning Example

Predict party membership of US (vice) presidents

Bill

Party X

party

Al

vicePresidentOf

party

Johnparty

Lyndon

vicePresidentOf

party

Helpful to consider element-wise version of the loss function f

f (A,Rk) = 12∑i,j,k

(Xijk − aT

i Rkaj)2

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 10 / 17

Page 13: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

RESCALCollective Learning Example

Predict party membership of US (vice) presidents

Bill

Party X

party

Al

vicePresidentOf

party

Johnparty

Lyndon

vicePresidentOf

party

Helpful to consider element-wise version of the loss function f

f (A,Rk) = 12∑i,j,k

(Xijk − aT

i Rkaj)2

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 10 / 17

Page 14: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

RESCALCollective Learning Example

Predict party membership of US (vice) presidents

Bill

Party X

party

Al

vicePresidentOf

party

Johnparty

Lyndon

vicePresidentOf

party

Helpful to consider element-wise version of the loss function f

f (A,Rk) = 12∑i,j,k

(Xijk − aT

i Rkaj)2

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 10 / 17

Page 15: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

RESCALCollective Learning Example

Predict party membership of US (vice) presidents

Bill

Party X

party

Al

vicePresidentOf

party

Johnparty

Lyndon

vicePresidentOf

party

Helpful to consider element-wise version of the loss function f

f (A,Rk) = 12∑i,j,k

(Xijk − aT

i Rkaj)2

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 10 / 17

Page 16: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

RESCALCollective Learning Example

Predict party membership of US (vice) presidents

Bill

Party X

party

Al

vicePresidentOf

party

Johnparty

Lyndon

vicePresidentOf

party

Helpful to consider element-wise version of the loss function f

f (A,Rk) = 12∑i,j,k

(Xijk − aT

i Rkbj)2

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 10 / 17

Page 17: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

RESCALCollective Learning Example

Predict party membership of US (vice) presidents

Billobject

Billsubject

Party X

party

AlvicePresidentOf party

Johnsubject

Johnobject

party

LyndonvicePresidentOfparty

Helpful to consider element-wise version of the loss function f

f (A,Rk) = 12∑i,j,k

(Xijk − aT

i Rkbj)2

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 10 / 17

Page 18: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

RESCALCollective Learning with RESCAL

Collective learning is performed via the entities’ latent-componentrepresentationImportant aspect of the model: Entities have a uniquelatent-component representation, regardless of their occurrence assubjects or objects

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 11 / 17

Page 19: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

Outline

1 Introduction

2 RESCAL

3 Experiments

4 Summary

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 12 / 17

Page 20: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

ExperimentsPredicting the party membership of US (vice) presidents

Task: Predict party membership of US (vice) presidentsNo other information included in the data other than the partymembership and who is (vice) president of whom

Random CP

DEDICOMSUNS

SUNS+AG

RESCAL0.0

0.2

0.4

0.6

0.8

1.0

AU

C0.16

0.44

0.64

0.48

0.74 0.78

Prediction of party membership

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 13 / 17

Page 21: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

ExperimentsComparison to state-of-the-art approaches

Task: Perform link prediction on the IRM datasets Kinships, UMLSand NationsComparison to MRC (Kok & Domingos, 2007), IRM (Kemp et al.,2007) and BCTF (Sutskever et al., 2009) as well as CP andDEDICOM

CP

DEDICOMBCTF

IRMMRC

RESCAL0.0

0.2

0.4

0.6

0.8

1.0

AU

C

0.94

0.69

0.90

0.66

0.850.95

Kinships

CP

DEDICOMBCTF

IRMMRC

RESCAL0.0

0.2

0.4

0.6

0.8

1.0

AU

C

0.95 0.95 0.98

0.70

0.98 0.98

UMLS

CP

DEDICOM IRMMRC

RESCAL0.0

0.2

0.4

0.6

0.8

1.0

AU

C

0.83 0.810.75 0.75

0.84

Nations

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 14 / 17

Page 22: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

ExperimentsRuntime and Implementation

RESCAL-ALS algorithm features very fast training times

Dataset Entities Relations Total Runtime in seconds

Rank 10 20 40

Kinships 104 26 1.1 3.7 51.2

Nations 125 57 1.7 5.3 54.4

UMLS 135 49 2.6 4.9 72.3

Cora 2497 7 364 348 680

Table: Average runtime to compute a rank-r factorization in RESCAL

Implementation uses only standard matrix operations

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 15 / 17

Page 23: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

Outline

1 Introduction

2 RESCAL

3 Experiments

4 Summary

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 16 / 17

Page 24: A Three-Way Model for Collective Learning on Multi-Relational …nickel/data/slides-icml2011.pdf · MaximilianNickel1 VolkerTresp2 Hans-PeterKriegel1 1Ludwig-Maximilians Universität,

Poster at Session P2

Summary

RESCAL is an tensor-based relational learning approach capable ofcollective learningCollective learning mechanism works through information propagationvia the entities’ latent-component representationsGood performance compared to current state-of-the-art relationallearning approachesFast training times and simple ImplementationCode available at http://www.cip.ifi.lmu.de/~nickel

Thank you!

Nickel, Tresp, Kriegel A Three-Way Model for Collective Learning June 30th, 2011 17 / 17