55
Construction of Substitution matrices BLOSUM BLOCKS SUBSTITUTION MATRIX • PAM POINT ACCEPTED MUTATIONS

Construction of Substitution matrices BLOSUM BLO CKS SU BSTITUTION M ATRIX PAM

  • Upload
    yen

  • View
    44

  • Download
    0

Embed Size (px)

DESCRIPTION

Construction of Substitution matrices BLOSUM BLO CKS SU BSTITUTION M ATRIX PAM P OINT A CCEPTED M UTATIONS. Substitution matrices - PowerPoint PPT Presentation

Citation preview

Page 1: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Construction of Substitution matrices

• BLOSUM

• BLOCKS SUBSTITUTION MATRIX

• PAM

• POINT ACCEPTED MUTATIONS

Page 2: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Substitution matrices

• Substitution matrix contains values proportional to the probability that amino acid A mutates into amino acid B for all pairs of amino acids through a period of evolution

• Substitution matrices are constructed from a large and diverse sample of sequence alignments

Page 3: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

How to construct substitution matrices

• Multiple alignment of well studies gene sequences from different species

• use orthologs: functionally similar

• observed substitutions tend to preserve functions

• minimal gaps

Page 4: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

How to construct substitution matrices ?

• Tabulate substitutions

• A to A: 9867 times

• A to R: 2 times

•A to N: 9 times

• etc….

Page 5: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

How to construct substitution matrices ?

Page 6: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Construction of Substitution matrices

• BLOSUM

Page 7: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Construction of Substitution matrices

• BLOSUM

Page 8: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

How to construct substitution matrices ?

Substitution matrix score =

Log Observed mutation rate in alignmentExpected random mutation rate

Page 9: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

How do we find the random mutation rate?

Page 10: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

The random mutation rate

• compute the overall occurrence of an amino acid in a protein database

Page 11: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

The random mutation rate

• compute the overall occurrence of an amino acid in a protein database

http://www.ebi.ac.uk/swissprot/sptr_stats/index.html

Page 12: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

The random mutation rate

Example:

Expected random mutation rate is 1 in 10000 and observed mutation rate of W to R is 1 in 10

Score = log (0.1/0.0001) = log (1000) = +3

Page 13: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 14: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 15: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 16: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 17: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Calculating BLOSUM62 scores

Page 18: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Calculating BLOSUM62 scores

Page 19: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Calculating BLOSUM62 scores

Page 20: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Calculating BLOSUM62 scores

Page 21: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Calculating BLOSUM62 scores

Page 22: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Calculating BLOSUM62 scores

Page 23: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Calculating BLOSUM62 scores

Page 24: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Calculating BLOSUM62 scores

Page 25: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Calculating BLOSUM62 scores

Page 26: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Calculating BLOSUM62 scores

Page 27: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

PAM matrices

• Point Accepted Mutations

[1 point mutation per 100 amino acids]

• does not take into account different evolutionary rates between conserved and non-conserved regions

Page 28: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

• PAM1 is 1% average change in amino acids

• PAM 250:??

Page 29: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 30: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Why use substitution matrices?????

Page 31: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Why use substitution matrices?

• Database searches

Page 32: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Database searching

Page 33: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Database searching

Page 34: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Database searching

• Query Sequence; Database sequences

Page 35: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Database searching: Filtering

• Dynamic programming is computationally expensive

• Apply DP to sequence pairs that are likely to be similar

• find short words: query-database

• DNA 7-28bases (BLAST?)• PROTEIN 3 amino acids (BLAST?)

Page 36: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

BLAST

• Basic Local Alignment Search Tool

• Heuristic method?

Page 37: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 38: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 39: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 40: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 41: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 42: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 43: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 44: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 45: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 46: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 47: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

Blast output parameter

E value

Page 48: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM

E value

• number of alignments one can expect see by chance.

• Number of alignments having the same or greater score.

• Dependent on size of database and length of query seq.

Page 49: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 50: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 51: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 52: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 53: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 54: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM
Page 55: Construction of Substitution matrices BLOSUM BLO CKS  SU BSTITUTION  M ATRIX  PAM