38
Protein modelling Protein structure is the key to understanding protein function Protein structure Topics in modelling and computational methods Comparative/homology modelling Fold recognition Fold prediction Dynamics of proteins

Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Embed Size (px)

Citation preview

Page 1: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Protein modelling

● Protein structure is the key to understanding protein function

● Protein structure● Topics in modelling and computational methods

– Comparative/homology modelling– Fold recognition– Fold prediction– Dynamics of proteins

Page 2: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Motivation

● Protein structure determines protein function● For the majority of proteins the structure is not

known

structures

sequences

0 250000 500000 750000 1000000 1250000 1500000

Structural coverage

Page 3: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Correlation structure & sequence

● Chothia & Lesk (1986): Correlation between structural divergence and sequence similarity

Fold space

Tim

e

Fold 1 Fold 2

Evolution

Page 4: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Comparative/homology modelling

Template sequenceTemplate structure

Target sequence

Alignment

Model

Page 5: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

The crucial importance of the alignment

● An alignment defines structurally equivalent positions!

Template sequence

Template structure

Target sequence

Alignment

Model

Page 6: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Steps in comparative modelling

● Find suitable template(s)● Build alignment between target and template(s)● Build model(s)

– Replace sidechains– Resolve conflicts in the structure– Model loops (regions without an alignment)

● Evaluate and select model(s)

Page 7: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

State of the art in homology modelling

● Template search

– (iterative) sequence database searches (PSIBLAST)● Alignment step

– multiple alignment of close to fairly distant homologues● Modelling step

– rigid body assembly– segment matching– satisfaction of spatial constraints

Page 8: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Modelling by spatial restraints

● Generate many constraints:– Homology derived constraints

● Distances and angles between aligned positions should be similar

– Stereochemical constraints● Bond lengths, bond angles, dihedral angles, nonbonded

atom-atom contacts

● Model derived by minimizing restraints

Modeller: Sali & Blundell (1993)

Page 9: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Loop modelling

● Exposed loop regions usually more variable than protein core

● Often very important for protein function● Loops longer than 5 residues difficult to built● Mini-protein folding problem

Page 10: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Model evaluation

● Check of stereochemistry– bond lengths & angles, peptide bond planarity, side-

chain ring planarity, chirality, torsion angles, clashes● Check of spatial features

– hydrophobic core, solvent accessibility, distribution of charged groups, atom-atom-distances, atomic volumes, main-chain hydrogen bonding

● 3D profiles/mean force potentials– residue environment

Page 11: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Knowledge-based mean force potentials

Melo & Feytmanns (1997)

● Compute typical atomic/residue environments based on known protein structures

Page 12: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

● Sequence from different species

● Is binding to ligand conserved?

Ligand

DNA

Modelling a transcription factor

Page 13: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Ligand binding domain

hydrogen bonds to ligand homo-serine lactone moiety binding acyl moiety binding

Page 14: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

DNA binding domain

Linker DNA binding domain

Page 15: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Template

Target

Variable loops

New Loop

MODELLER output

Page 16: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Ligand binding pocket

Page 17: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Errors in comparative modelling

Marti-Renom et al. (2000)

a)Side chain packing

b)Distortions and shifts

c)Loops

d)Misalignments

e)Incorrect template

TemplateModel

True structure

Page 18: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Modelling accuracy

Marti-Renom et al. (2000)

Page 19: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Applications of homology modelling

Marti-Renom et al. (2000)

Page 20: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Structural genomics

● Post-genomics:– many new sequences, no function

● Aim: a structure for every protein● High-throughput structure determination

– robotics– standard protocols for

cloning/expression/crystallization

Page 21: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Structural coverage

Vitkup et al. (2001)

high quality models

Complete models

Total = 43 %

Page 22: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Target selection

Page 23: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Protein modelling

● Protein structure is the key to understanding protein function

● Protein structure● Topics in modelling and computational methods

– Comparative/homology modelling– Fold recognition– Fold prediction– Dynamics of proteins

Page 24: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Fold recognition

● Structure is more conserved than sequence

Limit of sequence similarity searches

Structural similarity

Fold space

Target

Protein structures

Page 25: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Fold recognition / Threading

● Is a sequence compatible with a structure?● The idea: evolutionary related proteins share

common folding motifs● Contact matrix = motif● Mean-force potentials

to score every contact● Optimize alignment to

minimize pseudo-energy

AAGGT YAAT YAAGGTYAATY

Page 26: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Protein modelling

● Protein structure is the key to understanding protein function

● Protein structure● Topics in modelling and computational methods

– Comparative/homology modelling– Fold recognition– Fold prediction– Dynamics of proteins

Page 27: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Fold prediction – Rosetta method

● Knowledge based scoring function

P(structure) * P(sequence|structure)

P(sequence)P(structure|sequence) =

P(structure) = probability of a protein-like structure(no clashes, globular shape)

P(sequence|structure) = f(residue contacts in native structures)

Simons et al. (1997)

Bayes' law:

protein-likestructures

sequence consistentlocal structure

near-native structures

Page 28: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Environment specific scoring function

● Environment Ei specific interactions

● Environment – defined by the number of neighbours– implicitely distinguishes between buried and exposed

residuesP žaa1, aa2,‹ , aan#structureŸ= P žaai#E iŸ

P žaa i , aa j#rij , E i , E jŸP žaai#rij , E i , E jŸ P žaa j#r ij , E i , E jŸi i<j

cf. mean force potential

Simons et al. (1997)

Page 29: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Collection of putative backbone conformations

Protein sequence

Library of small segments

sequences structures

... ...For each window of 9 residues:

lookup 25 closest (sequence) neighbours in library

...

Simons et al. (1997)

Page 30: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

MC-SA optimization

Simons et al. (1997)

● for each random position– pick a random neighbour– replace backbone conformation– calculate probability of new structure

● MC: Monte-Carlo– accept up-hill moves with a certain probability

● SA: simulated annealing– first allow many changes, later less changes

Page 31: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Results

● Small molecules: ok● Proteins with mostly

α-helices: ok● Proteins with mostly

β-sheets: not so ok

Simons et al. (1997)

Page 32: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Dynamics of proteins

● Protein structure is the key to understanding protein function

● Protein structure● Topics in modelling and computational methods

– Comparative/homology modelling– Fold recognition– Fold prediction– Dynamics of proteins

Page 33: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Dynamics of proteins

● Local Motions (0.01 to 5 Å, 10-15 to 10-1 s)

– Atomic fluctuations

– Sidechain Motions

– Loop Motions

● Rigid Body Motions (1 to 10Å, 10-9 to 1s)

– Helix Motions

– Domain Motions (hinge bending)

– Subunit motions

● Large-Scale Motions (> 5Å, 10-7 to 104 s)

– Helix coil transitions

– Dissociation/Association

– Folding and Unfolding

Page 34: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Molecular dynamics/molecular modelling

● Molecular mechanics● Normal mode analysis● Quantum mechanical simulations● ...

Page 35: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Molecular mechanics● Atom representation

– sphere– charge– topology

● Forces– Bonded interactions– Non-bonded interactions

● Electrostatic interactions● Van-der-Waals interactions

– Forcefields: AMBER, GROMOS, ...● Newton's law of mechanics

http://cmm.info.nih.gov/modeling/guide_documents/molecular_mechanics_document.html

Page 36: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Molecular mechanics

● Molecular mechanics simulations take long!– because of the size of the system

● Proteins are large ● Water molecules to consider solvent effects● 10.000 to millions of atoms

– because of the number of iterations● update atom positions according to time-scale of fastest

fluctuations: bond vibrations ca. 1 fs● movements of interest frequently have long time-scale,

e.g. folding● 1s => 1015 iterations!

Page 37: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

Benefit of simulations

● Result is an ensemble of structures– Time-averaged statistical quantities– e.g., relative free energies of different conformations

● Protein engineering– e.g., relative free energies of different mutants

● Physical accuracy of models?– chemical reactions?– cutoff and long-range interactions? – dielectric constant?

movie from: C. Letner, G. Alter Journal of Molecular Structure (Theochem) 368 (1996) 205–212

Page 38: Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods

The end

Proteins are beautiful!

www.holmgroup.org