61
Maximum Likelihood Estimation for Allele Frequencies Biostatistics 666 Lecture 7

Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

  • Upload
    lykien

  • View
    236

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Maximum Likelihood Estimation for Allele Frequencies

Biostatistics 666Lecture 7

Page 2: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Last Three Lectures:Introduction to Coalescent Models

Computationally efficient framework• Alternative to forward simulations

Predictions about sequence variation• Number of polymorphisms• Frequency of polymorphisms

Page 3: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Coalescent Models: Key Ideas

Proceed backwards in time

Genealogies shaped by• Population size• Population structure• Recombination rates

Given a particular genealogy ...• Mutation rate predicts variation

Page 4: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Next Series of Lectures

Estimating allele and haplotype frequencies from genotype data• Maximum likelihood approach• Application of an E-M algorithm

Challenges• Using information from related individuals• Allowing for non-codominant genotypes• Allowing for ambiguity in haplotype assignments

Page 5: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Objective: Parameter Estimation

Learn about population characteristics• E.g. allele frequencies, population size

Using a specific sample• E.g. a set sequences, unrelated individuals, or

even families

Page 6: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Maximum Likelihood

A general framework for estimating model parameters

Find the set of parameter values that maximize the probability of the observed data

Applicable to many different problems

Page 7: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Example: Allele Frequencies

Consider…• A sample of n chromosomes• X of these are of type “a”• Parameter of interest is allele frequency…

XnX ppXn

XnpL −−⎟⎟⎠

⎞⎜⎜⎝

⎛= )1(),|(

Page 8: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Evaluate for various parameters

0.0000.01.0

0.0060.20.8

0.1110.40.6

0.2510.60.4

0.0880.80.2

0.0001.00.0

L1-pp

Page 9: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Likelihood Plot

0

0.1

0.2

0.3

0.4

0.0 0.2 0.4 0.6 0.8 1.0

Allele Frequency

Like

lihoo

d

For n = 10 and X = 4

Page 10: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

In this case

The likelihood tells us the data is most probable if p = 0.4

The likelihood curve allows us to evaluate alternatives…• Is p = 0.8 a possibility?• Is p = 0.2 a possibility?

Page 11: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Example: Estimating 4Nµ

Consider S polymorphisms in sample of n sequences…

Where Pn is calculated using the Qn and P2 functions defined previously

)|(),|( θθ SPSnL n=

Page 12: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Likelihood Plot

4Nµ

Like

lihoo

d

With n = 5, S = 10

MLE

Page 13: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Maximum Likelihood Estimation

Two basic steps…

In principle, applicable to any problem where a likelihood function exists

)|( maximizes that ˆ of valueFind b)

)|()|( function likelihooddown Writea)

xL

xfxL

θθ

θθ ∝

Page 14: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

MLEs

Parameter values that maximize likelihood• θ where observations have maximum probability

Finding MLEs is an optimization problem

How do MLEs compare to other estimators?

Page 15: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Comparing Estimators

How do MLEs rate in terms of …• Unbiasedness• Consistency• Efficiency

For a review, see Garthwaite, Jolliffe, Jones (1995) Statistical Inference, Prentice Hall

Page 16: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Analytical Solutions

Write out log-likelihood …

Calculate derivative of likelihood

Find zeros for derivative function

)|(ln)|( dataLdata θθ =l

θθd

datad )|(l

Page 17: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Information

The second derivative is also extremely useful

The speed at which log-likelihood decreasesProvides an asymptotic variance for estimates

θθ

θ θθ

IV

ddatadEI

1

)|(

ˆ

2

2

=

⎥⎦

⎤⎢⎣

⎡−=

l

Page 18: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Allele Frequency Estimation …

When individual chromosomes are observed this does not seem tricky…

What about with genotypes?

What about with parent-offspring pairs?

Page 19: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Coming up …

We will walk through allele frequency estimation in three distinct settings:

• Samples single chromosomes …• Samples of unrelated Individuals …• Samples of parents and offspring …

Page 20: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

I. Single Alleles Observed

Consider…• A sample of n chromosomes• X of these are of type “a”• Parameter of interest is allele frequency…

XnX ppXn

XnpL −−⎟⎟⎠

⎞⎜⎜⎝

⎛= )1(),|(

Page 21: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Some Notes

The following two likelihoods are just as good:

For ML estimation, constant factors in likelihood don’t matter

∏=

−=

−⎟⎟⎠

⎞⎜⎜⎝

⎛=

n

i

xxn

XnX

ii ppnxxxpL

ppXn

nXpL

1

121 )1(),...,;(

)1(),;(

Page 22: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Analytic Solution

The log-likelihood

The derivative

Find zero …

)1ln()(lnln),|(ln pXnpXXn

XnL −−++⎟⎟⎠

⎞⎜⎜⎝

⎛=θ

pXn

pX

dpXpLd

−−

−=1

)|(ln

Page 23: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Samples of Individual Chromosomes

The natural estimator (where we count the proportion of sequences of a particular type) and the MLE give identical solutions

Maximum likelihood provides a justification for using the “natural” estimator

Page 24: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

II. Genotypes Observed

Genotype A1A1 A1A2 A2A2 TotalObserved n11 n12 n22 n=n11+n12+n22

Frequency p11 p12 p22 1.0

Genotype A1 A2 TotalObserved n1=2n11+n12 n2=2n22+n12 2n=n1+n2

Frequency p1=n1/2n p2=n2/2n 1.0

Genotypes

Alleles

Page 25: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Consider a Set of Genotypes…

Use notation nij to denote the number of individuals with genotype i / j

Sample of n individuals

Genotype A1A1 A1A2 A2A2 TotalObserved n11 n12 n22 n=n11+n12+n22

Frequency p11 p12 p22 1.0

Genotypes

Page 26: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Allele Frequencies by Counting…

A natural estimate for allele frequencies is to calculate the proportion of individuals carrying each allele

Genotype A1 A2 TotalObserved n1=2n11+n12 n2=2n22+n12 2n=n1+n2

Frequency p1=n1/2n p2=n2/2n 1.0

Alleles

Page 27: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

MLE using genotype data…

Consider a sample such as ...

The likelihood as a function of allele frequencies is …

Genotype A1A1 A1A2 A2A2 TotalObserved n11 n12 n22 n=n11+n12+n22

Genotypes

( ) ( ) ( ) 221211 ²2²!!!

!);(221211

nnn qpqpnnn

nnpL =

Page 28: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Which gives…

Log-likelihood and its derivative

Giving the MLE as …

( ) ( )

)1(22

)1ln(2ln2ln

1

1222

1

1211

1

1122211211

pnn

pnn

dpd

CpnnpnnL

−+

−+

=

+−+++==l

l

( )( )221211

12111 2

2ˆnnn

nnp++

+=

Page 29: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Samples ofUnrelated Individuals

Again, natural estimator (where we count the proportion of alleles of a particular type) and the MLE give identical solutions

Maximum likelihood provides a justification for using the “natural” estimator

Page 30: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

III. Parent-Offspring Pairs

N pairsa5+a7a2+a4+a6a1+a3

a6+a7a7a60A2A2

a3+a4+a5a5a4a3A1A2

a1+a20a2a1A1A1

A2A2A1A2A1A1ParentChild

Page 31: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Probability for Each Observation

1.0

A2A2

A1A2

A1A1

A2A2A1A2A1A1ParentChild

Page 32: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Probability for Each Observation

1.0p222p1p2p1

2

p22p2

3p1p220A2A2

2p1p2p1p22p1p2p1

2p2A1A2

p120p1

2p2p13A1A1

A2A2A1A2A1A1ParentChild

Page 33: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Which gives…

( ) ( )( ) ( )

( )CBBp

aaaaaaCaaaaaaB

pp

+=

+++++=+++++=

−=

=

1

765432

654321

12

ˆ

3223

1

Lln

Page 34: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Which gives…( ) ( ) ( )

( ) ( )

( ) ( )( ) ( )

( )CBBp

aaaaaaCaaaaaaB

pp

pCpBpappaa

ppappaapa

+=

+++++=+++++=

−=

−+=++++

+++=

1

765432

654321

12

11

327

22165

21422132

311

ˆ

3223

1

)1ln(lnconstantlnln

lnlnlnLln

Page 35: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Samples ofParent Offspring-Pairs

The natural estimator (where we count the proportion of alleles of a particular type) and the MLE no longer give identical solutions

In this case, we expect the MLE to be more accurate

Page 36: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Comparing Sampling Strategies

We can compare sampling strategies by calculate the information for each one

Which one to you expect to be most informative?

θθ

θ θθ

IV

ddatadEI

1

)|(

ˆ

2

2

=

⎥⎦

⎤⎢⎣

⎡−=

l

Page 37: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

How informative is each setting?

Single chromosomes

Unrelated individuals

Parent offspring trios43

)(

2)(

)(

aNpqpVar

NpqpVar

NpqpVar

−=

=

=

Page 38: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Other Likelihoods

Allele frequencies when individuals are…• Diagnosed for Mendelian disorder• Genotyped at two neighboring loci• Phenotyped for the ABO blood groups

Many other interesting problems…… but some have no analytical solution

Page 39: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Today’s Summary

Examples of Maximum Likelihood

Allele Frequency Estimation• Allele counts• Genotype counts• Pairs of Individuals

Page 40: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Take home reading

Excoffier and Slatkin (1996)

Introduces the E-M algorithmWidely used for maximizing likelihoods in genetic problems

Page 41: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Properties of Estimators

For Review

Page 42: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Unbiasedness

An estimator is unbiased if

Multiple unbiased estimators may existOther properties may be desirable

θθθ

θθ

−=

=

)ˆ()ˆ(

)ˆ(

Ebias

E

Page 43: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Consistency

An estimator is consistent if

for any ε

Estimate converges to true value in probability with increasing sample size

( ) ∞→→>− nP as 0|ˆ| εθθ

Page 44: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Mean Squared Error

MSE is defined as

If MSE → 0 as n → ∞ then the estimator must be consistent• The reverse is not true

{ }2

2

)ˆbias()ˆvar(

)()ˆ()ˆ(

θθ

θθθθθ

+=

⎟⎠⎞⎜

⎝⎛ −+−= EMSE

Page 45: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Efficiency

The relative efficiency of two estimators is the ratio of their variances

Comparison only meaningful for estimators with equal biases

efficient more is ˆ then1)ˆvar()ˆvar( if 1

1

2 θθθ

>

Page 46: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Sufficiency

Consider…• Observations X1, X2, … Xn

• Statistic T(X1, X2, … Xn)

T is a sufficient statistic if it includes all information about θ in the sample• Distribution of Xi conditional on T is independent of θ• Posterior distribution of θ conditional on T is independent

of Xi

Page 47: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Minimal Sufficient Statistic

There can be many alternative sufficient statistics.

A statistic is a minimal sufficient statistic if it can be expressed as a function of every other sufficient statistic.

Page 48: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Typical Properties of MLEs

Bias• Can be biased or unbiased

Consistency• Subject to regularity conditions, MLEs are consistent

Efficiency• Typically, MLEs are asymptotically efficient estimators

Sufficiency• Often, but not always

Cox and Hinkley, 1974

Page 49: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Strategies for Likelihood Optimization

For Review

Page 50: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Generic Approaches

Suitable for when analytical solutions are impractical

BracketingSimplex MethodNewton-Rhapson

Page 51: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Bracketing

Find 3 points such that • θa < θb < θc

• L(θb) > L(θa) and L(θb) > L(θc)

Search for maximum by• Select trial point in interval• Keep maximum and flanking points

Page 52: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Bracketing

1 2

3

4

5

6

Page 53: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

The Simplex Method

Calculate likelihoods at simplex vertices• Geometric shape with k+1 corners• E.g. a triangle in k = 2 dimensions

At each step, move the high vertex in the direction of lower points

Page 54: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

The Simplex Method II

highlow

Original Simplex

reflection

reflection andexpansion

contraction

multiplecontraction

Page 55: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

One parameter maximization

Simple but inefficient approach

Consider• Parameters θ = (θ1, θ2, …, θk)• Likelihood function L (θ; x)

Maximize θ with respect to each θi in turn• Cycle through parameters

Page 56: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

The Inefficiency…

θ1

θ2

Page 57: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Steepest Descent

Consider• Parameters θ = (θ1, θ2, …, θk)• Likelihood function L (θ; x)

Score vector

• Find maximum along θ + δS⎟⎟⎠

⎞⎜⎜⎝

⎛==

kdLd

dLd

dLdS

θθθ)ln(...,,)ln()ln(

1

Page 58: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Still inefficient…

Consecutive steps are perpendicular!

Page 59: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Local Approximations to Log-Likelihood Function

matrixn informatio observed theis)(² vectorscore theis )(

function oodloglikelih theis)(ln)(where

)()(21)()()(

of oodneighboorh In the

i

i

it

iii

i

dd

L

S

θIθSθθ

θθIθθθθθθ

θ

l

l

l

ll

−===

−−−−+≈

θ

θ

Page 60: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Newton’s Method

SIθθ

0θθIS

θθIθθθθSθθ

11

point trialnew aget and)(

zero... toderivative its settingby

)()(21)()()(

ionapproximat theMaximize

−+ +=

=−−

−−−−+≈

ii

i

it

iiill

Page 61: Maximum Likelihood Estimation for Allele Frequenciescsg.sph.umich.edu/abecasis/class/666.07.pdf · Next Series of Lectures zEstimating allele and haplotype frequencies from genotype

Fisher Scoring

Use expected information matrix instead of observed information:

2

2

2

2

)|(of instead

)(

θθ

θθ

ddatad

ddE

l

l

⎥⎦

⎤⎢⎣

⎡− Compared to Newton-Rhapson:

Converges faster when estimatesare poor.

Converges slower when close toMLE.