113
Garima Garima Kushwaha Kushwaha BBI-VIII BBI-VIII BBI-03005 BBI-03005

BBI-03005(Garima Kushwaha) Version1

Embed Size (px)

Citation preview

Page 1: BBI-03005(Garima Kushwaha) Version1

Garima Garima KushwahaKushwaha

BBI-VIIIBBI-VIIIBBI-03005BBI-03005

Page 2: BBI-03005(Garima Kushwaha) Version1

Agenda

HYDROGENASE Molecular Dynamic Simulation NAMD GOMACS PYMOL VMD CAVER

Page 3: BBI-03005(Garima Kushwaha) Version1

Periodic oil crises have become the norm for today’s market.

……………….. S o l u t i o n ……….... S o l u t i o n ………..

The use of The use of Molecular HydrogenMolecular Hydrogen as an as an alternative to fossil fuel as an active area alternative to fossil fuel as an active area

of research.of research.

Page 4: BBI-03005(Garima Kushwaha) Version1

The long term goal for this project is to develop efficient and economical technology

for the biological conversion of solar energy into molecular hydrogen.

Page 5: BBI-03005(Garima Kushwaha) Version1

HYDROGENASE

Hydrogenase enzymes were discovered in enteric bacteria by Stephenson and Strickland.

They are found in a wide array of organisms, ranging from aerobes to anaerobes, autotrophs to heterotrophs, prokaryotes to eukaryotic photosynthetic organisms, fermentative organisms, sulfate reducers, and others.

The current research focuses on prokaryotes.

Hydrogenase enzymes catalyze the reversible oxidation of molecular hydrogen to protons and electrons

[H2 <=> 2H + + 2e - ]

Page 6: BBI-03005(Garima Kushwaha) Version1

Classification of hydrogense enzymes are determined by their metal center.

The two most studied classes are:1. Ni-Fe hydrogenases 2. Fe-only hydrogenases

In genernal, Ni-Fe hydrogenase enzymes consume molecular

hydrogen as a fuel source, and Fe-only hydrogense or “iron-only” [FeFe]-hydrogenases

enzymes produce molecular hydrogen.

Current attempts to identify the structure of the active site and to formulate a mechanism have been fruitful. The x-ray crystal structures of two Fe-only hydrogenases have been obtained thus far.

HYDROGENASE (Cont..)

Page 7: BBI-03005(Garima Kushwaha) Version1

Much of the recent scientific interest in [FeFe]-hydro-genases, however, concerns a different role entirely: the H2 production properties of [FeFe]-hydrogenases offer the promise of a means for affordable large-scale pro-duction of H2 as a source of renewable energy.

New developments in the research of Fe-only hydrogenase has peaked interest for use of enzymes in the production of hydrogen.

HYDROGENASE (Cont..)

Page 8: BBI-03005(Garima Kushwaha) Version1

Simple and short pathway for water photolysis in a biological organism that may

therefore deliver an attractive conversion efficiency.

Page 9: BBI-03005(Garima Kushwaha) Version1

Hydrogenase fromHydrogenase from Clostridium pasteurianupasteurianumm

The Fe-only hydrogenases have been identified in a small group of microbes.

One such enzyme, the soluble, monomeric hydrogenase isolated from the Gram-positive anaerobe Clostridium pasteurianum, has been purified and extensively characterized both bio-chemically and spectroscopically.

Results from various spectroscopic methods have suggested that the Fe and S are organized into five distinct metal clusters. One of these, termed the H-cluster (hydrogen cluster), is proposed to be the site of hydrogen activation. X-ray crystallo-graphic methods are used to determine the structure of the C. pasteurianum Fe-only hydrogenase (CpI) to 1.8 Å resolution, revealing the structure of the active-site cluster.

Page 10: BBI-03005(Garima Kushwaha) Version1

Overall Structure

The overall structure of CpI resembles a mushroom, with a large cap connected to a stem.

This overall structure can be subdivided into four distinct nonoverlapping domains.

• The largest of the four domains, designated the active-site domain, makes up the mushroom cap;

• the re-maining three smaller domains constitute the stem, which contains the accessory [Fe-S] clusters termed FS4A, FS4B, FS4C, and FS2 .

• The active-site domain contains about two-thirds of the total protein (amino acid residues 210 to 574).

The fold of the active-site domain consists of two four-stranded twisted b sheets, each flanked by a number of a helices that appear to be two nearly equivalent lobes, with one b sheet and associated helices contained within each lobe.

Page 11: BBI-03005(Garima Kushwaha) Version1

(C1) Nomenclature for the [Fe–S] clusters of Fe-only hydrogenase CpI.

(C2) GRASP representation of the relative charge on the surface of CpI, with regions of acidic residues (aspartic acid and glutamic acid) indicated in red and regions of basic residues (lysine and arginine) indicated in blue [3].

Page 12: BBI-03005(Garima Kushwaha) Version1

Hydrogen production in CpI happens at the H cluster, a metallic cluster bound to and embedded in-side the CpI protein matrix.

Page 13: BBI-03005(Garima Kushwaha) Version1

H-Cluster

Page 14: BBI-03005(Garima Kushwaha) Version1

The promise of Cheap Renewable energy

So,cultures of hydrogenase-containing microorganisms have the ability to produce a

constant output of hydrogen gas (H2) from justsunlight and water.

If harnessed properly, hydrogenase and/orhydrogenase-containing organisms could be usedto supply affordable and renewable H2 to be used

as an energy fuel, and thus solve the "supply“aspect of the future hydrogen economy.

Page 15: BBI-03005(Garima Kushwaha) Version1

This idealistic picture is not without problems….

Notably, hydrogenase's H-cluster is extremely Notably, hydrogenase's H-cluster is extremely sensitive to the presence of oxygen gas (O2), sensitive to the presence of oxygen gas (O2),

which will bind to it permanently. which will bind to it permanently.

In the presence of O2, hydrogen production is In the presence of O2, hydrogen production is maintained for only a few minutes before the maintained for only a few minutes before the

hydrogenases become deactivated. hydrogenases become deactivated.

An anaerobic environment is required, making An anaerobic environment is required, making hydrogenase a costly and impractical source of hydrogenase a costly and impractical source of

H2. H2.

Page 16: BBI-03005(Garima Kushwaha) Version1

If we identify the pathways through which O2 reaches the H-cluster.

Then we can create an engineered version of hydrogenase in which these O2 pathways are blocked.

Thus decreasing hydrogenase's sensitivity towards O2.

Page 17: BBI-03005(Garima Kushwaha) Version1

Molecular Dynamic Simulation

This is a computational method that calculates the time dependent behavior of a molecular system.

 Itprovides a detailed information on the fluctuationsand conformational changes of proteins and nucleicacids.

Molecular dynamics (MD) is a form of computersimulation, wherein atoms and molecules areallowed to interact for a period of time under knownlaws of physics.

Page 18: BBI-03005(Garima Kushwaha) Version1

Because molecular systems generally consist of avast number of particles, it is impossible to find theproperties of such complex systems analytically;

M.D. simulation circumvents this problem by using numerical methods.

Molecular dynamics is a multidisciplinary field. Itslaws and theories stem from mathematics, physics,and chemistry, and it employs algorithms fromcomputer science and information theory. In the broadest sense, molecular dynamics isconcerned with molecular motion.

Page 19: BBI-03005(Garima Kushwaha) Version1

The driving force for chemical processes is describedby thermodynamics. The mechanism by whichchemical processes occur is described by kinetics.

Thermodynamics dictates the energetic relationshipsbetween different chemical states, whereas thesequence or rate of events that occur as moleculestransform between their various possible states isdescribed by kinetics:

Page 20: BBI-03005(Garima Kushwaha) Version1

Conformational transitions and local vibrations are the usual subjects of molecular dynamics studies. Molecular dynamics alters the intramolecular degrees of freedom in a step-wise fashion, analogous to energy minimization.

The individual steps in energy minimization are merely directed at establishing a down-hill direction to a minimum. The steps in molecular dynamics, on the other hand, meaningfully represent the changes in atomic position, ri, over time (i.e. velocity).

Newton's equation is used in the molecular dynamics formalism to simulate atomic motion:

The rate and direction of motion (velocity) are governed by the forces that the atoms of the system exert on each other as described by Newton's equation.

Page 21: BBI-03005(Garima Kushwaha) Version1

The force on an atom can be calculated from thechange in energy between its current position and itsposition a small distance away.

This can be recognized as the derivative of the energywith respect to the change in the atom's position:

Page 22: BBI-03005(Garima Kushwaha) Version1

Knowledge of the atomic forces and masses can then be used to solve for the positions of each atom along a series of extremely small time steps (on the order of femtoseconds = 10^-15 seconds).

The resulting series of snapshots of structural changes over

time is called a trajectory. The use of this method to compute trajectories can be more easily seen when Newton's equation is expressed in the following form:

Page 23: BBI-03005(Garima Kushwaha) Version1

In practice, trajectories are not directly obtainedfrom Newton's equation due to lack of an analyticalsolution.

First, the atomic accelerations are computed from the

forces and masses. The velocities are next calculated

from the accelerations based on the followingrelationship:

Lastly, the positions are calculated from the velocities:

Page 24: BBI-03005(Garima Kushwaha) Version1

A trajectory between two states can be subdividedinto a series of sub-state separated by a smalltime step, "delta t" (e.g.1 femtosecond)

Page 25: BBI-03005(Garima Kushwaha) Version1
Page 26: BBI-03005(Garima Kushwaha) Version1

NAMDNAMD, is a parallel molecular dynamics codedesigned for high-performance simulation of largebiomolecular systems. NAMD uses the popularmolecular graphics program VMD for simulationsetup and trajectory analysis, but is also file-compatible with AMBER, CHARMM, and X-PLOR.You can build NAMD yourself or download binariesfor a wide variety of platforms.

It has been developed by the joint collaboration ofThe Theoretical and Computational BiophysicsGroup (TCB) and the Parallel ProgrammingLaboratory (PPL) at the University of Illinois atUrbana-Champaign.

Page 27: BBI-03005(Garima Kushwaha) Version1

NAMD requires four files for any MDNAMD requires four files for any MD

Simulation run. These are as follows:-Simulation run. These are as follows:-

1. PDB file.1. PDB file.

2. PSF file.2. PSF file.

3. PARAMETER file.3. PARAMETER file.

4. CONFIGURATION file.4. CONFIGURATION file.

NAMD (Cont …)

Page 28: BBI-03005(Garima Kushwaha) Version1

Files in the PDB include information such as the name of the compound, the species and tissue from which is was obtained, authorship, revision history, journal citation, references, amino acid sequence, stoichiometry, secondary structure locations, crystal lattice and symmetry group, and finally the ATOM and HETATM records containing the coordinates of the protein and any waters, ions, or other heterogeneous atoms in the crystal. Some PDB files include multiple sets of coordinates for some or all atoms.

NAMD and VMD ignore everything in a PDB file except for the ATOM and HETATM records, and when writing PDB files the ATOM record type is used for all atoms in the system, including solvent and ions.

PDB File Format

Page 29: BBI-03005(Garima Kushwaha) Version1

A PSF file, also called a protein structure file, contains all of the molecule specific information needed to apply a particular force field to a molecular system. The CHARMM force field is divided into a topology file, which is needed to generate the PSF file, and a parameter file, which supplies specific numerical values for the generic CHARMM potential function. The PSF file contains five main sections of interest: atoms, bonds, angles, dihedrals, and impropers.

The X-ray structure of a protein from the Protein Data Bank does not contain information about the hydrogen atoms of that protein. NAMD provides the psfgen utility, which is capable of generating the required PSF and PDB files by merging PDB files and guessing coordinates for missing atoms. So the pdb file which will generated with psfgen along with the psf will contain guessed coordinates for hydrogen atoms of the structure.

Protein Structure File

Page 30: BBI-03005(Garima Kushwaha) Version1

A force field is a mathematical expression of the potential which atoms in the system experience. CHARMM, X-PLOR, AMBER, and GROMACS are four types of force fields, and NAMD is able to use all of them.

The CHARMM force field was used containing topology and parameter information. It contains all of the numerical constants needed to evaluate forces and energies, given a PSF structure file and atomic coordinates.

The current versions of the CHARMM forcefield are CHARMM22 for proteins and CHARMM27 for lipids and nucleic acids. The Individual parameter files are named, respectively, par_all22_prot.inp, par_all27_lipid.inp, and par_all27_na.inp. To enable hybrid systems, combinations are also provided, named par_all27_na_lipid.inp, par_all27_prot_lipid.inp, and par_all27_prot_na.inp.

The parameter fie used was par_all27_prot_lipid.inp.

Forcefield Parameter File

Page 31: BBI-03005(Garima Kushwaha) Version1

NAMD uses a file referred to as the configuration file. This file specifies what dynamics options and values that NAMD should use, such as the number of timesteps to perform, initial temperature, etc. The options and values in this file control how the system will be simulated. A NAMD configuration file contains a set of options and values. The options and values specified determine the exact behavior of NAMD, what features are active or inactive, how long the simulation should continue, etc.

NAMD Configuration File

Page 32: BBI-03005(Garima Kushwaha) Version1

At first the PDB file of the desired protein is downloaded from Protein Data Bank.

2. Then the information of water molecules is deleted from that PDB file to create a pdb file of the protein alone.

3. After that a file is created with .pdb extension which contains the coordinates of the protein alone without hydrogense by typing the following commands in the VMD TkConsole window:-

set protein [atomselect top protein]$protein writepdb proteinp.pdb

Steps Involved in Making the Protein Structure File

Page 33: BBI-03005(Garima Kushwaha) Version1

4. Now the PSF file of protein is created using the psfgen package of VMD.In order to create a psf, firslty a pgn file is created which will be the target of psfgen.

5.Open a terminal window and typed the following commands:-package require psfgentopology top all27 prot lipid.inppdbalias residue HIS HSEpdbalias atom ILE CD1 CDsegment U {pdb proteinp.pdb}coordpdb proteinp.pdb Uguesscoordwritepdb protein.pdbwritepsf protein.psf

Page 34: BBI-03005(Garima Kushwaha) Version1

6. After typing save the file by .pgn extension(i.e. protein.pgn).

7. In the terminal window type the following command:-

vmd -dispdev text -e protein.pgn

This run the package psfgen on the file protein.pgn and generate the psf and pdb file of protein with hydrogens.

Page 35: BBI-03005(Garima Kushwaha) Version1

A CHARMM forcefield topology file contains all of the information needed to convert a list of residue names into a complete PSF structure file. It also contains internal coordinates that allow the automatic assignment of coordinates to hydrogens and other atoms missing from a crystal PDB file.

The current versions of the CHARMM forcefield top_all22_prot.inp, top_all27_lipid.inp, and top_all27_na.inp. To enable computation on hybrid systems, combinations are also provided, named top_all27_na_lipid.inp, top_all27_prot_lipid.inp, and top_all27_prot_na.inp.

Topology Files

Page 36: BBI-03005(Garima Kushwaha) Version1
Page 37: BBI-03005(Garima Kushwaha) Version1

The protein needs to be solvated, i.e., put insidewater, to more closely resemble the cellularenvironment. It is done so in two ways, placingprotein in:

• A water sphere in surrounding vacuum, in preparation for minimization and equilibration without periodic boundary conditions.

• A water box, in preparation for minimization and equilibration with periodic boundary conditions.

Solvating the Protein

Page 38: BBI-03005(Garima Kushwaha) Version1

This is to examine the minimization and equilibration of hydrogenase in a water box with periodic boundary conditions.

The configuration file, 1feh_wb eq.conf is opened by typing nedit 1feh_wb eq.conf. In this the simulation parameters and the outputsection is modified.

The command used for running the simulation on parallel machine is as follows

nohup (full path of NAMD executable files)/charmrun+p(number of processors) (full path of NAMD executablefiles)/namd2 (name of configuration file) > (name ofoutput file) </dev/null

Simulation with Periodic Boundary Conditions

Page 39: BBI-03005(Garima Kushwaha) Version1
Page 40: BBI-03005(Garima Kushwaha) Version1

GROMACS Groningen Machine For Chemical Simulation

Gromacs is a molecular dynamics simulation package originally developed in the University of Groningen, now maintained and extended at different places, including University of Uppasal and University of Stokholm and the Max Planck Institute for Polymer Research.

Page 41: BBI-03005(Garima Kushwaha) Version1

The GROMACS project was originally started to construct a dedicated parallel computer system for molecular simulations, based on a ring architecture. The molecular dynamics specific routines were rewritten in the C programming language from the Fortran 77-based program GROMOS, which had been developed in the same group.

The highly optimized code makes GROMACS the fastest program for molecular simulations to date. Besides, the support of different force fields and the open source (GPL) character make GROMACS very flexible.

GROMACS is a high-end, high performance research tool designed for the study of protein-dynamics using classical molecular dynamics theory.

You may download it from http://www.gromacs.org GROMACS:runs on linux, unix, and on Windows(a recent development).

Page 42: BBI-03005(Garima Kushwaha) Version1

In order to start a molecular dynamics simulation using GROMACS we need three input files:

1. The atomic coordinates (and, optionally, velocities) are stored in a file that is conventionally called conf.gro

2. The molecular topology file (conventionally called topol.top) that describes the chemical composition of the system, including    information on the force field parameters, bond lengths, etc.

3. The molecular dynamics parameter file (conventionally called grompp.mdp) which holds parameters like the number of integration steps, treatment of cut-offs and so on.

Gromacs Input File

Page 43: BBI-03005(Garima Kushwaha) Version1

Steps for creating GROMACS Simulation

Page 44: BBI-03005(Garima Kushwaha) Version1

1. Creating a Gromacs topology from the PDB file

Processing of the pdb file is done with pdb2gmx command. The pdb2gmx command converts your pdb file to a gromacs file and writes the topology for you. This file is derived from an NMR structure which contains hydrogen atoms.

The x-y-z co-ordinates of the atoms are stored in a .gro file, and the atomic masses, charges, and bonds are stored in a .top file.

Command used pdb2gmx_d -ignh -ff G43al -f 1FEH.pdb -o feh.pdb -

p feh.top

Page 45: BBI-03005(Garima Kushwaha) Version1

2. Adding solvent water around the protein

• editconf takes the .gro file, and the dimensions of the box which you specify, and append the box dimensions on the last line of the .gro file.

Command used editconf -bt cubic -f feh.pdb -o feh.pdb -c -d 0.9

Page 46: BBI-03005(Garima Kushwaha) Version1

Now that the box dimensions have been specified, it is possible to add more than one of the molecules found in the .pdb file, and to fill the rest of the box with solvent molecules. Both of these operations are handled by the genbox command.

Command usedgenbox_d -cp feh.pdb -cs spc216.gro -o feh_b4em.pdb -p

feh.top

(On periodic images: GROMACS by default uses periodic boundary conditions. ) )

Page 47: BBI-03005(Garima Kushwaha) Version1

Ions are added to the system by the use of the genion

utility. However, genion does not work directly on.gro files. You must first translate the text-

based .grofile into a binary file with the use of the grompputility.

Command usedgenion –f em.mdp –c feh_b4em.pdb –feh.top –o feh_em.tpr

Page 48: BBI-03005(Garima Kushwaha) Version1

• Once the ions are in place, it is time to start running simulations. There are three types of simulation, although the first two are optional.

All three types are executed the same way by the use of first grompp and then mdrun.

grompp takes the .gro, .top and .mdp files, and produces a .tpr file, which is the input to mdrun.

mdrun outputs a large binary file containing the state of the system at regular time intervals (.trr) and also outputs a .gro file which contains the state of the system at the last time step.

Page 49: BBI-03005(Garima Kushwaha) Version1

3. Running energy minimization

All it does is nudge the atoms in the solute molecules

only, until the bond lengths and angles are in theirminimum potential energy configuration

(completelyignoring the other atoms in the system).

Command usedgrompp_d -f em.mdp -c feh_ion.pdb -p feh.top -o

feh_em.tpr mpirun -np 6

/home/class/gromacs-3.3.1/src/kernel/mdrun_d -v –s feh_em.tpr –o feh_em.trr –c feh_b4md.pdb –g em.log –e em.edr

Page 50: BBI-03005(Garima Kushwaha) Version1

4. Carefully equilibrating the water around the protein.

The second simulation is the * “position restraints†simulation. This �ones fixes the solute molecules in place, and allows the solvent and ions to “relax†into their minimum potential �energy positions.

Commands usedgrompp_d -np 6 –f pr.mdp –c feh_b4pr.pdb –r

feh_b4pr.pdb –p feh.top –o feh_pr.tpr

mpirun -np 6 /home/class/gromacs-3.3.1/src/kernel/mdrun_d -v –s feh_pr.tpr –o feh_pr.trr –c feh_b4md.pdb –g pr.log –e pr.edr &

Page 51: BBI-03005(Garima Kushwaha) Version1

5. Running the production simulation

The third simulation is the * “fullâ€simulation, �and is the one in which the full molecular dynamics is calculated. When running GROMACS on a high performance computer, this is the step which will be parallelised.

Command usedgrompp_d -np 6 –f md.mdp –c feh_b4md.pdb –r

feh_b4md.pdb –p feh.top –o feh_md.tpr

mpirun -np 6 /home/class/gromacs-3.3.1/src/kernel/mdrun_d -v –s 1sk8_md.tpr –o 1sk8_md.trr –c 1sk8_pmd.pdb –g md.log –e md.edr

Page 52: BBI-03005(Garima Kushwaha) Version1

A normal movie uses roughly 30 frames/second, so a10-second movie requires 300 simulation trajectoryframes. To make a smooth movie the frames shouldnot be more 1-2 ps apart, or it will just appear toshake nervously. Export a short trajectory from thefirst 2.5 ns in PDB format (readable by PyMOL) as

trjconv –s confout.gro -f traj.xtc -e 2500.0 –o movie.pdb.

Choose the protein group for output rather than theentire system. If you open this trajectory in PyMOL youcan immediately play it using the VCR-style controls onthe bottom right, adjust visual settings in the menus,and even use photorealistic ray-tracing for all images

inthe movie.

Making a movie

Page 53: BBI-03005(Garima Kushwaha) Version1
Page 54: BBI-03005(Garima Kushwaha) Version1

Pymol

PyMOL is a molecular visualization and manipulation system.

The program is designed to meet a variety of the molecular graphics, animation, and presentation needs of research scientists in academia and industry, including display of structure information.

Page 55: BBI-03005(Garima Kushwaha) Version1

Introduction to PyMol

What is PyMol for?– looking at pdb files (protein, nucleic acid, ligands etc.)– making publication quality figures (of models and maps)– NOT for model building

System requirements?– computer (faster is better): PC (Windows/Linux), Mac (OS

X)– a 3-button scroll mouse– works with hardware stereo

Where can I get it?– pymol.sourceforge.net– current version: 0.99– pymol.sourceforge.net/html/ - for the manual

Page 56: BBI-03005(Garima Kushwaha) Version1

How to start the program?

Double-click the application icon (?)or

From a terminal window, type “pymol”You should see a command window anda graphics window (may be combined)

Page 57: BBI-03005(Garima Kushwaha) Version1

Demo

Page 58: BBI-03005(Garima Kushwaha) Version1

Display > Background > white ----set background colour----set background colourDisplay > orthoscopic view ----no perspective ----no perspective

distortiondistortion

Useful Display Settings

Page 59: BBI-03005(Garima Kushwaha) Version1
Page 60: BBI-03005(Garima Kushwaha) Version1

Using mouse in graphics window

• • Unmodified controlsUnmodified controls

– – Left - rotate molecule (x, y and, at edges, z)Left - rotate molecule (x, y and, at edges, z)

– – Middle - translate molecule (x, y)Middle - translate molecule (x, y)

– – Right - zoom (=MovZ)Right - zoom (=MovZ)

– – Wheel - slab/clip Menu at bottom rightWheel - slab/clip Menu at bottom right

• • With shift keyWith shift key

– – Right - up/down: clip Right - up/down: clip

frontfront

- left/right: clip back- left/right: clip back

Page 61: BBI-03005(Garima Kushwaha) Version1
Page 62: BBI-03005(Garima Kushwaha) Version1
Page 63: BBI-03005(Garima Kushwaha) Version1
Page 64: BBI-03005(Garima Kushwaha) Version1
Page 65: BBI-03005(Garima Kushwaha) Version1
Page 66: BBI-03005(Garima Kushwaha) Version1
Page 67: BBI-03005(Garima Kushwaha) Version1
Page 68: BBI-03005(Garima Kushwaha) Version1
Page 69: BBI-03005(Garima Kushwaha) Version1
Page 70: BBI-03005(Garima Kushwaha) Version1
Page 71: BBI-03005(Garima Kushwaha) Version1

VMD

VMD is designed for the visualization and analysis of biological sytemssuch as proteins, nucleic acids, lipid bilayer assemblies, etc.

VMD can read standard Protein Data Bank (PDB) files and display the contained structure.

VMD provides a wide variety of methods for rendering and coloring a molecule: simple points and lines, CPK spheres and cylinders, licorice bonds, backbone tubes and ribbons, cartoon drawings, and others.

VMD can also be used to animate and analyze the trajectory

of a molecular dynamics (MD) simulation.

In particular, VMD can act as a graphical front end for an external MD program by displaying and animating a molecule undergoing simulation on a remote computer.

Page 72: BBI-03005(Garima Kushwaha) Version1

11 Choose the File New Molecule... menu item Fig. 2(a) in the VMD Main window. Another window, the Molecule File Browser (b), will appear in your screen.

2 Use the Browse... (c) button to find the file .

Note that when you select the file, you will be back in the Molecule File Browser window. In order to actually load the file you have to press Load (d).

Loading a Molecule

Page 73: BBI-03005(Garima Kushwaha) Version1

In order to see the 3D structure of our protein we use the mouse in multiple modes.

1. In the OpenGL Display, pressing first (left) mouse button down and moving the mouse is the rotation mode of the mouse and allows to rotate the molecule around an axis parallel to the

screen Fig. 3(a).

2. If you press the second (right) button and repeat the previous step, the rotation is be done around an axis perpendicular to your screen (b)

Displaying the Protein

Page 74: BBI-03005(Garima Kushwaha) Version1

3 In the VMD Main window, look at the Mouse menu (Fig. 4). Here, you will be able to switch the mouse mode from Rotation to Translation or Scale modes.

4 The Translation mode will allow you to move the molecule around the screen while holding the first (left) button down.

5 The Scale mode will allow you to zoom in or out by moving the mouse horizontally while holding the first (left) button down.

Page 75: BBI-03005(Garima Kushwaha) Version1

1 Choose the Graphics Representations... Menu item. A window called Graphical Representations will appear and you will see highlighted in yellow Fig. 5(a) the current graphical representation used to display your molecule.

2 In the Draw Style tab (b) we can change the style(d) and color (c) of the representation. In this section we will focus in the drawing style (the default is Lines).

3 Each Drawing Method has its own parameters. For instance, change the Thickness of the lines by using the controls on the right bottom part (e) of the Graphical Representations window.

4 Now, choose VDW (van der Waals) from Drawing Method. Each atom is now represented by a sphere. In this way you can see more easily the volumetric distribution of the protein.

Exploring Different Drawing Styles

Page 76: BBI-03005(Garima Kushwaha) Version1

5 To see the arrangements of atoms in the interior of the protein, use the new controls on the right bottom part of the window (e) to change the Sphere Scale to 0.5 and the Sphere Resolution to 13.

6 Note in Coloring Method Name, each atom has its own color, i.e: O is red, N is blue, C is cyan and S is yellow.

7 Press the Default button. This allows you to return to the default properties of the drawing method.

8 Choose the Tube style under Drawing Method and observe the

backbone of your protein. Set the Radius at 0.8.

9 By looking at your protein in the tube mode, can you distinguish how many helices, sheets and coils are present in the protein?

The last drawing method we will explore is NewCartoon. It gives a simplified representation of a protein based in its secondary structure. 10 Choose Drawing Method NewCartoon.

Page 77: BBI-03005(Garima Kushwaha) Version1
Page 78: BBI-03005(Garima Kushwaha) Version1

11 Choose Choose Coloring Method ResTypeColoring Method ResType Fig. Fig. 55(c). This (c). This allows you to distinguish non-polar residues allows you to distinguish non-polar residues (white), basic residues (blue), acidic residues (white), basic residues (blue), acidic residues (red) and polar residues (green). (red) and polar residues (green).

22 Select Select Coloring Method StructureColoring Method Structure (c) and confirm (c) and confirm that the NewCartoon representation displays that the NewCartoon representation displays

colors consistent with secondary structure.colors consistent with secondary structure.

Exploring Different Coloring Methods

Page 79: BBI-03005(Garima Kushwaha) Version1

1 In the Selected Atoms text entry Fig. 5(f) of the Graphical Representations window delete the word all, type helix and press the Apply button or hit the Enter/Return key on your keyboard (do this every time you type something). VMD will show just the helices present in our molecule.

2 In the Graphical Representations window choose the Selections tab Fig. 7(a). In section Singlewords (b) you will find a list of possible selections you can type. For instance, try to display sheets instead of helices by typing the appropriate word in the Selected Atoms text entry.

Exploring Different Selections

Page 80: BBI-03005(Garima Kushwaha) Version1

Combinations of boolean operators can also be used when writing a selection.

3 In order to see the molecule without helices and sheets, type the following in Selected Atoms: (not helix)and(not betasheet)

4 In the section Keyword (c) of the Selections tab (a) you can see properties that can be used to select parts of a protein with their possible values. Look at possible values of the Keyword resname (d). Display all the Lysines and Glycines presents in the protein by typing (resname LYS)or(resname GLY). Lysines play a fundamental role in the configuration of polyubiquitin chains.

5 In order to see which water molecules are closer to the protein you can use the command within. Type water and within 3 of protein. This selects all the water molecules that are within a distance of 3 angstroms of the protein.

Page 81: BBI-03005(Garima Kushwaha) Version1

The button Create Rep Fig. 8(a) in the Graphical Representations window allows you to create multiple representations.

1 For the current representation, set the Drawing Method to New Cartoon and the Coloring Method to Structure.

2 In Selected Atoms type protein.

3 Press the Create Rep button (a). Now, using the menu items of the Draw Style tab and the Selected Atoms text entry, modify the new representation in order to get VDW as the Drawing Method, ResType as the Coloring Method, and resname LYS typed in as the current selection.

Multiple RepresentationsMultiple Representations

Page 82: BBI-03005(Garima Kushwaha) Version1

5 Create a final representation by pressing again the Create Rep button. Select Drawing Method Surf, the Coloring Method Molecule and type protein in the Selected Atoms entry. For this last representation choose in the Material section (c) the Transparent menu item.

6 Note, that with the mouse, you can select the different representations you have created and modify each one independently. Also, you can switch each one on/off by double-clicking on it or delete each one by using the Delete Rep button (b). Turn off the second and last representations. At the end of this section, the Graphical Representations window should look like Fig. 8.

Page 83: BBI-03005(Garima Kushwaha) Version1

1 Choose the Extensions ->Analysis -> Sequence Viewer menu item. A window Fig. 9(a) with a list of the amino acids (e) and their properties (b)&(c) will appear in your screen.

2 With the mouse, click over different residues (e) in the list and see how they are highlighted. In addition, the highlighted residue will appear in your OpenGL Display window in yellow and bond drawing method, so you can visualize it easily. Use the right button of the mouse to unselect residues.

3 Using the Zoom controls (f) you can display the entire list of residues in the window. This is especially useful for larger proteins

4 Using the shift key while pressing the mouse button allows you to pick multiple residues at the same time. Look at residues 48, 63, 11 and 29 (e).

Sequence Viewer Extension

Page 84: BBI-03005(Garima Kushwaha) Version1

In the VMD Main window, choose the File ->SaveState menu item. Write an appropriate name (e.g.,myfirststate.vmd) and save it.

The File -> Load State menu item will allow you toload a previously saved VMD state, just like the fileyou saved.

Saving your Work

Page 85: BBI-03005(Garima Kushwaha) Version1
Page 86: BBI-03005(Garima Kushwaha) Version1

CAVERCAVER

CAVER provides rapid, accurate and fully automated calculation of pathways leading from buried cavities to outside solvent in static and dynamic protein structures.

Calculated pathways can be visualized by graphic program PyMol dissecting anatomy and dynamics of entrance tunnels. CAVER allows analysis of any molecular structure including proteins, nucleic acids, inorganic

materials, etc.

Page 87: BBI-03005(Garima Kushwaha) Version1

PyMol plug-in suitable for calculation of pathways in discrete protein structures

Stand alone version

enabling analysis of trajectories from molecular dynamics simulations.

CAVER Versions

Page 88: BBI-03005(Garima Kushwaha) Version1

1. Load a protein structure into PyMOL(File->open >Browse).

2. Select Residues or Atoms forming active place.Click Display and select Sequence. This will display the single letter sequence of the protein in the pdb file as well as ligands, waters, and ions. Click on FeS4, FeS, HC1 from the sequence. The selection is made by the default name as (sele).

3. Start this plugin(Plugin->Caver Tools).4. Check the name of your selection "(sele)" 5. Make sure that the path to the CAVER binary

is correct on the "CAVER Location" entry field.

Plugin integrates PyMOL with CAVER

Page 89: BBI-03005(Garima Kushwaha) Version1

6. Specify starting point by geometric centre of selected atoms/residues by click on the radio button "Use average point from centres of given selections" and write "(sele)" to Selection list

7. Click "Show starting point" button. You should see small crisscross object in the PyMOL Viewer window.

8. Now you can switch to the choice "Use X, Y, Z coordinates to specify starting point" and you can state starting point more precisely by clicking arrows. You should move crisscross object in an empty space near active place in the molecule.

9. Click the "Run CAVER" button.10. Wait some time and see results.

Page 90: BBI-03005(Garima Kushwaha) Version1

Stand-alone version

# Molecule preparation

1. Molecular dynamics run 2. Atom radius preparation 3. Atom types preparation

Page 91: BBI-03005(Garima Kushwaha) Version1

Dynamical behaviour of macromolecules can be studied by many different mathematical-physical

models. One of them is a classical molecular dynamics (MD). Program package

AMBER,CHARM,GROMOS and many others can be used for MD simulation.

Molecular dynamics run

Page 92: BBI-03005(Garima Kushwaha) Version1

Atom radius preparation CAVER considers molecule as a group of ball-

shaped atoms. Therefore, radius of every atom type has to be specified. The van der Waals radii (in A) are used for this purpose. An example of atom radius file named radius.dat can be seen below.H 0.6000HP 1.1000HA 1.4590CE2 1.9080CE3 1.9080CG2 1.9080CH2 1.9080.:

Page 93: BBI-03005(Garima Kushwaha) Version1

Atom types preparation

Atom type is necessary for appropriate assignement of atom radius. Types are specified in the file types.dat. The first line specifies a number of atoms in a molecule. This file can be easily generated from PDB using gawk. An example of atom types file named types.dat can be seen

below.4696

N

H1

H2

H3

CA

HG3

.

:

Page 94: BBI-03005(Garima Kushwaha) Version1

1. Grid cell resolution

2. Name of file with radii 3. Name of file with atomic types 4. Name of file with trajectory 5. Trajectory traversing 6. Specification of starting point 7. Cropping number

Configuration of CAVER

Page 95: BBI-03005(Garima Kushwaha) Version1

CAVER options are specified in a file named CAVER options are specified in a file named config.dat. An example of config.dat file is shown config.dat. An example of config.dat file is shown below.below.

0.80.8

radius.datradius.dat

types.dattypes.dat

simulation.trjsimulation.trj

20 1 0 220 1 0 2

3 3 1652 1667 22853 3 1652 1667 2285

44

Configuration of Caver

Page 96: BBI-03005(Garima Kushwaha) Version1

Running CAVER

caver [-h | --help] [-o <filename>] [-e | --enable-output-trajectory

options-h --help -- Display this help screen.-i file --input file -e --enable-output-trajectory Outputs PDB trajectory of shortest way for PYMOL output, and for GNUPLOT output.--enable-output-vmd --Outputs for VMD visualization.-o file --Output file Writes resulting table of radiuses.-d dir --Output-dir dir Set directory for output files.-t num --tun num --Tries to find num tunnels from active site. (for 0.99.2 version and higher)

CAVER configuration is stored in config.dat and must be present in the working directory.

Page 97: BBI-03005(Garima Kushwaha) Version1

What is a profile of a protein tunnel?

A tunnel connects a protein cavity with bulk solvent. The shape of protein tunnel can be

approximated as a pipeline with a varying width of cross section. This approximation is useful for estimation of the biggest probe accessing the

deepest place in the pocket. The size of a probe able to access internal cavity is limited by radius of tunnel gorge, i.e. the most narrow place in the

tunnel. The tunnel profile is a graph of cross section radius (radius of maximally inscribed ball) versus tunnel length measured from its deepest

place to the surface opening.

Page 98: BBI-03005(Garima Kushwaha) Version1

Sample Profile

Page 99: BBI-03005(Garima Kushwaha) Version1

NAMD: OUTPUT FOR SIMULATION

Following files were created : 1feh_wb_eq.coor.BAK 1feh_wb_eq.dcd 1feh_wb_eq.dcd.BAK 1feh_wb_eq.log 1feh_wb_eq.restart.coor 1feh_wb_eq.restart.coor.old 1feh_wb_eq.restart.vel 1feh_wb_eq.restart.vel.old 1feh_wb_eq.restart.xsc 1feh_wb_eq.restart.xsc.old 1feh_wb_eq.rocks.log 1feh_wb_eq.vel 1feh_wb_eq.vel.BAK 1feh_wb_eq.xsc 1feh_wb_eq.xsc.BAK 1feh_wb_eq.xst 1feh_wb_eq.xst.BAK

Page 100: BBI-03005(Garima Kushwaha) Version1

It is defined as atomic positions compared over theno. of time steps in the simulation.

RMSD for Individual Residues

Page 101: BBI-03005(Garima Kushwaha) Version1
Page 102: BBI-03005(Garima Kushwaha) Version1

Maxwell-Boltzmann Energy Distribution

Page 103: BBI-03005(Garima Kushwaha) Version1

Temperature distribution

Page 104: BBI-03005(Garima Kushwaha) Version1

Energies

Page 105: BBI-03005(Garima Kushwaha) Version1

GROMACS :Radius of Gyration

Page 106: BBI-03005(Garima Kushwaha) Version1

Root Mean Square Deviation

Page 107: BBI-03005(Garima Kushwaha) Version1

CAVER: Via Pymol plug-in

Page 108: BBI-03005(Garima Kushwaha) Version1

Surface on the protein

Page 109: BBI-03005(Garima Kushwaha) Version1

Cartoon structure of the protein with the surface on

tunnels

Page 110: BBI-03005(Garima Kushwaha) Version1

Surface on the protein with transparency

Page 111: BBI-03005(Garima Kushwaha) Version1

Results from Standalone version: Tunnel profile

Page 112: BBI-03005(Garima Kushwaha) Version1

Thus, for the case of CpI, despite the lack ofpermanently open gas channels, the

pathways taken by gas molecules such as O2

and H2 are predefined to lie in areas of theprotein which have a natural disposition

towards greater density fluctuations and theno. of fluctuating channels are much more

than the permanent ones.

Page 113: BBI-03005(Garima Kushwaha) Version1

THANK YOU!!