7
ESTIMATING GAUSSIAN MIXTURE DENSITIES VIA A MATLAB IMPLEMETATION OF THE EXPECTATION MAXIMIZATION ALGORITHM DR. ASOKA KORALE, C.ENG. MIET & MIESL

Estimating Gaussian Mixture Densities via an implemetation of the Expectaation Maximization Algorithm

Embed Size (px)

Citation preview

Page 1: Estimating Gaussian Mixture Densities via an implemetation of the Expectaation Maximization Algorithm

ESTIMATING GAUSSIAN MIXTURE DENSITIES

VIA A MATLAB IMPLEMETATION OF THE EXPECTATION MAXIMIZATION ALGORITHM

DR. ASOKA KORALE, C.ENG. MIET & MIESL

Page 2: Estimating Gaussian Mixture Densities via an implemetation of the Expectaation Maximization Algorithm

APPLICATIONS FOR GAUSSIAN MIXTURE DECOMPOSITION MODELING ANALYSIS

Slide | 2

Cluster Analysis – Data mapped to a set of Normal Densities – with

specified degree of membership – a model based clusteringCustomer Profiling – Characterizing the

Distributions encountered – Age, ARPU, Net Stay…

leading to a probabilistic description / modeling of the data Sentiment Analysis via Independent Term

Matching where each word is drawn from a specified Normal Distribution – combined by

their sum to determine overall sentiment score A model based approach to data

analysis

Goal: model arbitrary distributions as sums of Gaussian densities (with parameters estimated via expectation

maximization algorithm)– so that each data point is characterized with

respect to a distribution from which it is expected to have originated

Page 5: Estimating Gaussian Mixture Densities via an implemetation of the Expectaation Maximization Algorithm

RESULTS – ESTIMATING THE COMPONENT GAUSSIAN DENSITIES

Slide | 5

II. Standardize the Data and estimate empirical Probability

Density Function

I. Histogram of original Data – (which composite densities to

be estimated)

III. Estimate Gaussian Component Densities

(fx1/2/3) via EM Algorithm and their scaled Sum (fx)

IV. fx: Sum of the individual component densities scaled by their mixing probabilities (for comparison with II the empirical PDF of Data)

Page 7: Estimating Gaussian Mixture Densities via an implemetation of the Expectaation Maximization Algorithm

RESULTS – INTERPRETATION OF CLUSTER MEMBERSHIP

Slide | 7

Test with one dimensional data, through EM algorithm can estimate parameters for sums of “D” dimensional data

*Applicable for multi dimensional data and need to explore correlated random variables