Expectation-Propagation for the Generative Aspect ModelGenerative aspect model Each document mixes...

Expectation-Propagation for the Generative Aspect Model

Tom Minka and John Lafferty

Carnegie Mellon University

Main points

• Aspect model is important and diff icult– PCA for discrete data

• How variational methods can fail

• Extensions of Expectation-Propagation

• Using EP for maximum likelihood

Generative aspect model

Each document mixes aspects in different proportions

Aspect 1 Aspect 2

1λ11 λ− 2λ

3λ21 λ− 31 λ−

(Hofmann1999; Blei, Ng, & Jordan 2001)

First interpretation

Document

Aspect 1 Aspect 2

) word( wp

λ λ−1

),...,(~)( 1 JDirichletp ααλ

Multinomial sampling

Second interpretation

“Aspect 2” “Aspect 2”

Word 2 Word 3Word 1Doc i

3iz2iz1iz“Aspect 1”

Sample fromaspect

Two tasks

Inference:

• Given aspects and document i, what is (posterior for) ?

Learning:

• Given some documents, what are (maximum likelihood) aspects?

Variational method

• Blei, Ng, & Jordan, NIPS’01

• Uses interpretation

• Inference: factored variational distribution

• Learning: – E-step

– M-step (aspects, aspect weights)

)()( zqq λ

),( zλ

),( zλ),( αp

Expectation Propagation

• Uses interpretation

• Inference: EP of Dirichlet posterior

• Learning:– E-step

– M-step (aspects, aspect weights)

• Fewer latent variables better

)only (λ

),( αp

Geometric interpretation

• Likelihood is composed of terms of form

• Want Dirichlet approximation:

∑==a

www awpwpt ))|(()()( λλ

awwat βλλ)(~

Variational method

• Bound each term via Jensen’s inequality:

• Coupled equations:

∏∑ ×≥a

awaawp const)()|( βλλ

)|()logexp( awpawa ><∝ λβ

Moment matching

• Context function: all but one occurrence

• Moment match:

• Fixed point equations for

∏≠

−=ww

w ww ttq'

'1\ ')()()( λλλ

)()(~)()( \\ λλλλ ww

ww qtqt ↔

One term

3.0)1(4.0)()( λλλ −+=wt

Ten word document

Moments for ten word doc

VarianceMean

Variational

0.01780.365

0.06120.393

0.06130.393

1000 word document

Moments for 1000 word doc

VarianceMean

Variational

0.00020.252

0.00150.252

0.00150.250

General behavior

• For long documents, VB recovers correct mean, but not correct variance

• Optimizes global criterion, only accurate locally

• What about learning?

Learning

• Want MLE for aspects• Requires marginalizing • is probabil ity of generating doc

from a random• Consider extreme approximation:

– Probability of generating from most likely– Ignores uncertainty in (“Occam factor” )– Used by Hofmann (1999)

)(docp

Toy problem

• Two aspects over two words– describes each

• One aspect fixed to

• is uniform, i.e.

)|1( awp =

1)2|1( == awp

)2|1()1()1|1()()1( awpawpwp =−+=== λλλ 1=α

)1( =wp1=λ 0=λ

Aspect1 Aspect2

Exact likelihood vs. Max

ii nn /1Dotted are observed frequencies:

)1|1( awp =

10 docs

EP likelihood vs. VB

)1|1( awp =

VB not much different from Max

100 documents

)1|1( awp =

EP better, VB worse

Longer documents

)1|1( awp =

VB not helped

Why VB fails

• Doesn’ t capture posterior variance of– No Occam factor

• Gets worse with more documents

• Doesn’ t matter if posterior is sharp!– Longer documents

– More distinct aspects

Both sharpen posterior, but do not help VB

Larger vocabulary

10 docs,Length 10

Larger vocabulary

10 docs,Length 10

Larger vocabulary

10 docs,Length 10

Expectation-Propagation for the Generative Aspect ModelGenerative aspect model Each document mixes...

Documents

Aspect presentation

Asigurãrile sub aspect economic şi sub aspect financiar

THEORETICAL CONSIDERATIONS REGARDING THE ... CONSIDERATIONS REGARDING THE AROMATICITY OF Λ3-HETEROBENZENES CONTAINING 15-GROUP ELEMENTS R. POPa*, M. ANDONIa, J. VAN STADENb, I. PĂUŞESCUc,

Aspect Oriented Programming With C++ and Aspect C++

Aspect ict

AJDT: getting started with aspect-oriented programming … · AJDT: getting started with aspect-oriented ... !Aspect monitor ... 37 AJDT: aspect-oriented programming in Eclipse

Position des Bildes des Eintrittspaltes · Czerny-Turner-Gittermonochromator (λ2 > λ1) Einige Monochromator-Spezifikationen Begriff Definition Bezeichnung Auflösungsvermögen Fähigkeit,

LNCS 8693 - Training Deformable Object Models for Human ...vigir.missouri.edu/~gdesouza/Research/Conference... · 410 B.DrayerandT.Brox betweencorrespondingkeypoints: argmin λ1,λ2

ASPECT (ВИД) Verbal Aspect for Elementary Russian

CMB Y nature of light - thphys.uni-heidelberg.dehofmann/jhw2005.pdf · Λ3 = ± quasiparticle mass: SU(2) SU(3) monopole condensation, decoupling of W± magnetic-charge conserving

SCOTT ASPECT TECHNOLOGIES ASPECT

SICS - Aspect Mídiae-mail: aspect@aspect.com.br visite nosso site: SICS@ é uma marca registrada da ASPECT Mídia ASPECT Mídia@ é marca Registrada da ASPECT Mídia Comércio e Serviços

Analitikai módszerek fejlesztése folyadék és szilárd minták LIBS …doktori.bibl.u-szeged.hu/2873/1/PhD értekezés_Metzinger... · 2016-03-10 · erősen nő ( λ3) az ún

Latent Aspect Rating Analysis without Aspect Keyword Supervision

ASPECT 4/4 COMPACT ASPECT 5/5 COMPACT ASPECT 5 … · 2018. 6. 2. · 3 Technical Specification GENERAL Model: Aspect 4 – CVSPA04WS/CVSPA04WB Aspect 4 Compact – CVSPA04CWS/CVSPA04CWB

pH Emission Spectrum Emission(3 λ) λ1 λ2 λ3 A λ λ1λ2λ3λ1λ2λ3 A Ex 1 Emission(3 λ) λ1λ2λ3λ1λ2λ3 A Ex 2 Emission(3 λ) λ1λ2λ3λ1λ2λ3 A Ex 3 λ1λ2λ3λ1λ2λ3

SOMMAIRE Présentation Aspect Géographique Aspect Organisationnel Aspect Commercial

Kaplan Aspect

Aspect Medical Systems - Infiniti Medical · of this document is prohibited without prior written consent of Aspect Medical Systems. Aspect Medical Systems, Inc. Aspect Medical Systems

English tense-aspect constructions and the aspect conceptual space