Dot Matrices Global Alignments Local Alignment

Dot MatricesGlobal AlignmentsLocal Alignment

HOW Can we Align Two Sequences ?

Dot Matrices

QUESTION

What are the elements shared by two sequences ?

Dot Matrices

>Seq1THEFATCAT>Seq2THELASTCAT

T H E F A T C A T

Window

Stringency

Dot Matrices

Sequences Window size

Stringency

Dot MatricesStrigency

Window=1Stringency=1

Dot Matrices

Dot MatricesLimits

-Visual aid

-Best Way to EXPLORE the Sequence Organisation

-Does NOT provide us with an ALIGNMENT

wheat --DPNKPKRAMTSFVFFMSEFRSEFKQKHSKLKSIVEMVKAAGER | | |||||||| || | ||| ||| | |||| ||||????? KKDSNAPKRAMTSFMFFSSDFRS----KHSDL-SIVEMSKAAGAA

Using Dynamic Programming To Align

Sequences

Our Scope

Coding a Global and a Local Algorithm

Understanding the DP concept

Aligning with Affine gap penalties

Sophisticated variants…

Saving memory

Outline

-Coding Dynamic Programming with Non-affine Penalties

-Adding affine penalties

-Turning a global algorithm into a local Algorithm

-Using A Divide and conquer Strategy

-The repeated Matches Algorithm

-Double Dynamic Programming

-Tailoring DP to your needs:

Global Alignments Without Affine Gap penalties

Dynamic Programming

How To align Two Sequences With a Gap Penalty, A Substitution

matrix and Not too Much Time

Dynamic Programming

A bit of History…

-DP invented in the 50s by Bellman

-Programming Tabulation

-Re-invented in 1970 by Needlman and Wunsch

-It took 10 year to find out…

The Foolish Assumption

The score of each column of the alignment is independent from the rest of the alignment

It is possible to model the relationship between two sequences with:-A substitution matrix-A simple gap penalty

The Principal of DP

If you extend optimally an optimal alignment of two sub-sequences, the result remains an optimal alignment

X- XXXXXX

Deletion

Alignment

Insertion

Finding the score of i,j

-Sequence 1: [1-i]-Sequence 2: [1-j]

-The optimal alignment of [1-i] vs [1-j] can finish in three different manners:

1… i1…j-1

1…i-11…j-1

1…i-11… j

Three ways to buildthe alignment 1…i

1…i-11…j-1

1….i1…j-1

1…i-11….j

In order to Compute the score of 1…i1…j

All we need are the scores of:

Formalizing the algorithm

F(i,j)= best

F(i-1,j) + Gap

F(i-1,j-1) + Mat[i,j]

F(i,j-1) + Gap X-

1….i1…j-1

1…i-11…j-1

1…i-11…j

Arranging Everything in a Table

- F A-FAS

1…I-11…J-1

1…I1…J-1

1…I-11…J

1…I 1…J

Taking Care of the Limits

In a Dynamic Programming strategy, the most delicate part is to take care of the limits:

-what happens when you start-what happens when you finish

The DP strategy relies on the idea that ALL the cells in your table have the same environment…

This is NOT true of ALL the cells!!!!

Filing Up The Matrix

Taking Care of the Limits

- F A-FAS

T -4Match=2MisMatch=-1Gap=-1

FAT---

-2FA--

-3FAS---

-2 +2-3

+1 +1-4

-3 +10

0 +4-1

+3 +30

-4 0+3

-1 +3+2

+2 +3-1

-5 -1+2

-2 +2+2

Match=2MisMatch=-1Gap=-1

Delivering the alignment: Trace-back

Score of 1…3 Vs 1…4

Optimal Aln Score

Trace-back: possible implementation

while (!($i==0 && $j==0)) { if ($tb[$i][$j]==$sub) #SUBSTITUTION

{ $alnI[$aln_len]=$seqI[--$i]; $alnJ[$aln_len]=$seqJ[--$j];

} elsif ($tb[$i][$j]==$del) #DELETION

{ $alnI[$aln_len]='-'; $alnJ[$aln_len]=$seqJ[--$j]; }

elsif ($tb[$i][$j]==$ins) #INSERTION{ $alnI[$aln_len]=$seqI[0][--$i]; $alnJ[$aln_len]='-'; }

$aln_len++; }

Local Alignments

Dynamic Programming

The Smith and Waterman Algorithm

F(i,j)= best

F(i-1,j) + Gep

F(i-1,j-1) + Mat[i,j]

F(i,j-1) + Gep X-

1…i1…j-1

1…i-11…j-1

1…i-11…j

Filing Up a SW Matrix

Filling up a SW matrix: borders

* - A N I C E C A T - 0 0 0 0 0 0 0 0 0C 0A 0T 0A 0N 0 D 0O 0G 0

Easy:Local alignments NEVER start/end with a gap…

Filing Up a SW Matrix

Filling up a SW matrix

* - A N I C E C A T - 0 0 0 0 0 0 0 0 0C 0 0 0 0 2 0 2 0 0 A 0 2 0 0 0 0 0 4 0T 0 0 0 0 0 0 0 2 6A 0 2 0 0 0 0 0 0 4N 0 0 4 2 0 0 0 0 2D 0 0 2 2 0 0 0 0 0O 0 0 0 0 0 0 0 0 0G 0 0 0 0 0 0 0 0 0

Best Local score

Beginning of the trace-back

Match=2MisMatch=-1Gap=-1

for ($i=1; $i<=$len0; $i++) { for ($j=1; $j<=$len1; $j++)

{ if ($res0[0][$i-1] eq $res1[0][$j-1]){$s=2;}

else {$s=-1;} $sub=$mat[$i-1][$j-1]+$s; $del=$mat[$i ][$j-1]+$gep; $ins=$mat[$i-1][$j ]+$gep; if ($sub>$del && $sub>$ins && $sub>0)

{$smat[$i][$j]=$sub;$tb[$i][$j]=$subcode;} elsif($del>$ins && $del>0 )

{$smat[$i][$j]=$del;$tb[$i][$j]=$delcode;} elsif( $ins>0 )

{$smat[$i][$j]=$ins;$tb[$i][$j]=$inscode;} else {$smat[$i][$j]=$zero;$tb[$i][$j]=$stopcode;}

if ($smat[$i][$j]> $best_score) { $best_score=$smat[$i][$j]; $best_i=$i; $best_j=$j; }

PrepareTraceback

TurningNW

Dot Matrices Global Alignments Local Alignment

Documents

Sequence Analysis Alignments dot-plots scoring scheme Substitution matrices Search algorithms (BLAST)

Hierarchical Translation Equivalence over Word Alignments · Hierarchical Translation Equivalence over Word Alignments ... alignments at the lexical level. Perhaps learning hierarchical

Alignments Jmcinerney

Pairwise Sequence Alignments

Whole genome alignments

Alignments Lecture

Sequence Alignments

E2 Alignmentspave.niaid.nih.gov/lanl-archives/compendium/94PDF/2/e2.pdf · 2019. 5. 7. · II-E2-1 SEP 94 E2 Alignments E2 Alignments E2 Alignments E2 Alignments E2 Alignments E2

Rapid Global Alignments

Multiple Sequence Alignments

Radical re-alignments (?) - Theatre Futurestheatrefutures.org.uk/.../files/2013/07/RadicalRe-alignments-1.pdfRadical re-alignments (?) innovations in curriculum design / assessment

Pairwise alignments Introduction Introduction Why do alignments? Why do alignments? Definitions Definitions Scoring alignments Scoring alignments Alignment

Dot Plots, Path Matrices, Score Matrices

Bioinfo 7 Alignment - TeachEnGteacheng.illinois.edu/LectureNotes/SequenceAlignment1... · 2017-07-26 · • Dot plots for graphic analysis • Local or global alignments for residue/residue

Sequence Comparison With Dot Matrices

Geometric Intuition Randy Gaul. Vectors, Points and Basis Matrices Rotation Matrices Dot product and how it’s useful Cross product and how it’s useful

L2 Alignments - National Institutes of Health · II-L2-1 SEP 94 L2 Alignments L2 Alignments L2 Alignments L2 Alignments L2 Alignments L2 Alignments L2 Alignments L2 Alignments L2

L2 Alignments - PaVE: Papilloma virus genome database · II-L2-1 OCT 95 L2 Alignments L2 Alignments L2 Alignments L2 Alignments L2 Alignments L2 Alignments L2 Alignments L2 Alignments

of Alignments

34 fronts-alignments