25
1 AMBER 8.0 Introductory Tutorial John E. Kerrigan, Ph.D. Robert Wood Johnson Medical School The University of Medicine and Dentistry of New Jersey 675 Hoes Lane Piscataway, NJ 08854 USA (732) 235-4473 phone (732) 235-5252 fax [email protected]

AMBER8 Intro

Embed Size (px)

Citation preview

1

AMBER 8.0 Introductory Tutorial

John E. Kerrigan, Ph.D. Robert Wood Johnson Medical School The University of Medicine and Dentistry of New Jersey 675 Hoes Lane Piscataway, NJ 08854 USA (732) 235-4473 phone (732) 235-5252 fax [email protected]

2

Synopsis. Today you will learn how to use one of the most versatile molecular mechanics/dynamics modeling packages, AMBER 8. [1] The AMBER suite of programs was developed by the late Peter A Kollman and colleagues at the University of California San Francisco (UCSF). The Amber package was designed with the ability to address a wide variety of biomolecules including proteins and nucleic acids as well as small molecule drugs. See http://amber.scripps.edu for more information. The following programs are central to the AMBER package: Leap – For preparation of the parameter and coordinate files. xLeap – X-window version with a GUI for structure editing. tLeap – Terminal version (no GUI) Antechamber – special program for preparation of parameter sets (“prep” files) for small molecules (drugs/ligands, lipids). Sander – The number cruncher (aka the “brains” of the Amber MD package) that performs the computations. Ptraj – Trajectory analysis tool. Research Problem. We will study the x-ray crystal structure of the oxytocin-neurophysin complex (1NPO.PDB). [2] We will focus on oxytocin itself. Remember, the x-ray crystal structure has no hydrogen atoms or lone pairs. The AMBER Leap program will take care of adding hydrogens, etc. When working with a crystal structure for the first time, you must carefully review the data in the beginning of the PDB file (REMARKS provide important information!). For example, the structure may be missing side chains in some of its residues or it may even be missing entire residues! A very important information item in a PDB file of a protein is the SSBOND records, which designate all of the disulfide bonds found in the structure. You will need this information for processing your file in Leap.

Note the key disulfide bond in the oxytocin structure to the left. We will use dynamics to study the impact of mutation on this structure. What do you think the impact of removing this disulfide bond will be to the structure? (Image created using VMD [3] and Raster3D [4])

Make a project directory and use it for this exercise. mkdir myproj

3

cd myproj Download 1NPO.pdb from the protein data bank (http://www.rcsb.org) if you are extracting the oxytocin coordinates from the original pdb file or just download oxyt.pdb from the course webpage. Review the PDB file in a text editor (nedit, gedit, kedit, etc.). nedit 1NPO.pdb You will notice that the structure is a dimer of two complexes. You should also notice that in the REMARK section we find that REMARK 6 informs us that the first several residues and the last several residues (86-95) in neurophysin were not found or resolved in the structure. We will ignore these for now as they are not important. However, if you know that you will need missing residues, you may rebuild them using the SEQRES information in your PDB file and the jackal suite of programs (another tutorial, another day!). Using your text editor, remove chains A, C, and D along with any water molecules and ions from the PDB file. We only need one complex! Save your new file as oxyt.pdb. We will use Swiss PDB viewer (DeepView) to analyze the structure for missing side chains. spdbv oxyt_np.pdb Swiss PDB viewer picks up one residue with a missing side chain (59). It is always a good idea to review your PDB file using DeepView! DeepView automatically rebuilds the missing side chain for you! Save your new file. File > Layer… > File Save: oxyt_np.pdb (Click on Save and write over the old file; CAUTION: make certain that you are saving the file to your project directory or DeepView will save the file to your home directory by default!) File > Quit xLeap and tLeap – These programs perform the same function with the difference being that xLeap opens in an X-window interface and tLeap operates from a terminal prompt (no GUI). The principal function of these programs is to prepare the AMBER coordinate (inpcrd or crd for short) and topology (prmtop or just top) files. xleap –s –f leaprc.ff03 The leaprc.ff03 file loads the parameters for the AMBER03 force field. [5] In the xleap X-window prompt or tleap prompt (both are “>”) type the following commands one at a time: oxy = loadPdb oxyt.pdb The bond command creates bonds between atoms. The syntax is: unit_name.residue_number.atom_name. Create the disulfide bond using … bond oxy.1.SG oxy.6.SG

4

check oxy

An illustration of the xLeap Window

After using the check command you will notice several warnings about close contacts and possibly a warning about the overall charge of the unit if the unit is not neutral in overall formal charge. The overall charge is important, as we must neutralize this charge for particle-mesh ewald electrostatics later on. To look at the unit in a graphical display use the “edit” command. edit oxy

5

The following mouse buttons perform as indicated: LEFT - Selection MIDDLE - Rotate RIGHT - Translate MIDDLE + RIGHT - Zoom In and Out The structure should be fine at this point. You should notice that Leap has added hydrogen atoms to the peptide structure. To exit the unit editor, use Unit > Close from the menu. Save a topology and coordinate file for in vacuo runs.

6

saveAmberParm oxy oxy_vac.top oxy_vac.crd We will use the solvateOct command to solvate the structure in TIP3P water [6] using a truncated octahedron periodic box. solvateOct oxy TIP3PBOX 9.0 You have told Leap to solvate the unit in a truncated octahedral box using a spacing distance of 9.0 angstroms around the molecule. Ideally, you should set the spacing at no less than 8.5 Å (~ 3 water layers) to avoid periodicity artifacts. [7] For particle-mesh ewald electrostatics, [8, 9] our box side length must be > 2 X NB cutoff. We will use a 10.0 Å cutoff in our solvated system; therefore, our box side must be > 20 Å. Our box side length will be (2 X 9) + peptide dimension, which should easily be greater than 20 Å. The system must be neutral in terms of overall formal charge. Fortunately, our system is neutral as is. If this had not been the case then we would have used the addIons command to neutralize the charge (use Na+ to counter a negative charge or Cl- to counter a positive charge). The saveAmberParm command saves the parameter file (prmtop or top file) and the initial coordinate file (inpcrd or crd file). We did this before solvating the system so we could perform an in vacuo dynamics simulation for comparision to the solvated system. saveAmberParm oxy oxy.top oxy.crd Quit Terminal Leap can be used to automate this process. The script below can be used to perform the same file preparation you did in xLeap. The oxy.scr script for tleap. source leaprc.ff03 oxy = loadPdb oxyt.pdb bond oxy.1.SG oxy.6.SG check oxy saveAmberParm oxy oxy_vac.top oxy_vac.crd solvateOct oxy TIP3PBOX 9.0 saveAmberParm oxy oxy.top oxy.crd Quit Tleap is very useful in situations where you might need to process a very large structure that requires a large solvent box and ions. Use the following command format for running tLeap. tleap –s –f oxy.scr You’re done with file preparation (hooray!). You will notice a file called “leap.log” in your working directory. Leap records every command and result (as well as commands issued by Leap under the hood) in a log file. Some of the information in this log file is required for publication (e.g. periodic box size; # of water molecules; # of ions if any, etc).

7

Make a separate directory for your in vacuo dynamics runs and move your oxy_vac.top and oxy_vac.crd files there. mkdir in_vacuo mv oxy_vac.* in_vacuo Molecular Dynamics The in vacuo Model. This computation will be performed in two steps using an NVT ensemble. An energy minimization will be done followed by a production dynamics run. Step 1. Energy Minimization of the System. We perform a steepest descents energy minimization to relieve bad steric interactions that would otherwise cause problems with or dynamics runs followed up with conjugate gradient minimization to get closer to an energy minimum. Using the Sander program. The SANDER program is the number crunching juggernaut of the AMBER software package. SANDER will perform energy minimization, dynamics and NMR refinement calculations. You must specify an input file to tell SANDER what computations you want to perform and how you would like to perform those computations. Study the input file for energy minimization below. min_vac.in oxytocin: initial minimization prior to MD &cntrl imin = 1, maxcyc = 500, ncyc = 250, ntb = 0, igb = 0, cut = 12 /

General format of the sander command and its flags (The following is just an example, do not try to run it as it will crash!): sander –O –i in –o out –p prmtop –c inpcrd –r restrt [-ref refc –x mdcrd –v mdvel –e mden –inf mdinfo] Issue the following command to run your in vacuo energy minimization. nohup sander –O –i min_vac.in –o min_vac.out –p oxy_vac.top –c oxy_vac.crd –r oxy_vacmin.rst &

8

We use “nohup” to prevent hang-ups. The ampersand at the end of the command runs the job in background. Monitor the progress using “tail –20 min_vac.out” command. The in vacuo minimization should proceed very rapidly (It only took 1 second on our 2.2 GHz linux workstation!). Step 2. Molecular dynamics. md_vac.in oxytocin MD in-vacuo, 12 angstrom cut off, 250 ps &cntrl imin = 0, ntb = 0, igb = 0, ntpr = 100, ntwx = 500, ntt = 3, gamma_ln = 1.0, tempi = 300.0, temp0 = 300.0, nstlim = 125000, dt = 0.002, cut = 12.0 /

nohup sander –O –i md_vac.in –o md_vac.out –p oxy_vac.top –c oxy_vacmin.rst –r oxy_vacmd.rst –x oxy_vacmd.mdcrd –ref oxy_vacmin.rst –inf mdvac.info & This computation took about 3.8 minutes on a 2.2 GHz linux workstation. The stuff in the input files in a nutshell. There are many more settings/flags above than we can possibly explain here. So, for more in-depth info please consult the AMBER User’s Manual. The settings important for minimization are highlighted and explained below. &cntrl and / – Most if not all of your instructions must appear in the “control” block (hence &cntrl). cut = nonbonded cutoff in angstroms. ntr = Flag used to perform position restraints (1 = on, 0 = off) imin = Flag to run energy minimization (if = 1 then perform minimization; if = 0 then perform molecular dynamics). macyc = maximum # of cycles ncyc = After ncyc cycles the minimization method will switch from steepest descents to conjugate gradient. ntmin = Flag for minimization method. (if = 0 then perform full conjugate gradient min with the first 10 cycles being steepest descent and every nonbonded pairlist update; if = 1 for ncyc cycles steepest descent is used then conjugate gradient is switched on [default]; if = 2 then only use steepest descent) dx0 = The initial step length dxm = The maximum step allowed drms = gradient convergence criterion 1.0E-4 kcal/mol•Å is the default Molecular Dynamics in a Water Box

9

This job will be accomplished in 4 steps: Step 1. Restrained Minimization – relieve bad vander Waals contacts in the surrounding solvent while keeping the solute (protein) restrained. Step 2. Unrestrained Minimization – Relieve bad contacts in the entire system. Step 3. Restrained Dynamics – Relax the solvent layers around the solute while gradually bringing the system temperature from 0 K to 300 K. Step 4. Production Run – Run the production dynamics at 300 K and 1 bar pressure.

****************** Step 1. - Restrained Minimization oxytocin: initial minimisation solvent + ions &cntrl imin = 1, maxcyc = 1000, ncyc = 500, ntb = 1, ntr = 1, cut = 10 / Hold the protein fixed 500.0 RES 1 9 END END

Hold the peptide fixed 500.0 (This is the force in kcal/mol used to restrain the atom positions.) RES 1 9 (Tells AMBER to apply this force to residue #’s 1 to 9). nohup sander –O –i min1.in –o min1.out –p oxy.top –c oxy.crd –r oxy_min1.rst –ref oxy.crd & Step 2. Unrestrained Minimization – oxytocin: initial minimisation whole system &cntrl imin = 1, maxcyc = 2500, ncyc = 1000, ntb = 1, ntr = 0, cut = 10 /

nohup sander –O –i min2.in –o min2.out –p oxy.top –c oxy_min1.rst –r oxy_min2.rst &

10

Step 3. Position-Restrained Dynamics. This initial dynamics run is performed to relax the positions of the solvent molecules. In this dynamics run, we will keep the macromolecule atom positions restrained (not fixed, however). In a position-restrained run, we apply a force to the specified atoms to minimize their movement during the dynamics. The solvent we are using in our system, water, has a relaxation time of 10 ps; therefore we need to perform at least > 10 ps of position restrained dynamics to relax the water in our periodic box. Contents of md1.in oxytocin: 20ps MD with res on protein &cntrl imin = 0, irest = 0, ntx = 1, ntb = 1, cut = 10, ntr = 1, ntc = 2, ntf = 2, tempi = 0.0, temp0 = 300.0, ntt = 3, gamma_ln = 1.0, nstlim = 10000, dt = 0.002, ntpr = 100, ntwx = 500, ntwr = 1000 / Keep protein fixed with weak restraints 10.0 RES 1 9 END END ntb = 1 Constant volume dynamics. imin = 0 Switch to indicate that we are running a dynamics. nstlim = # of steps limit. dt = 0.002 time step in ps (2 fs) temp0 = 300 reference temp (in degrees K) at which system is to be kept. tempi = 100 initial temperature (in degrees K) gamma_ln = 1 collision frequency in ps-1 when ntt = 3 (see Amber 8 manual). ntt = 3 temperature scaling switch (3 = use langevin dynamics) tautp = 0.1 Time constant for the heat bath (default = 1.0) smaller constant gives tighter coupling. vlimit = 20.0 used to avoid occasional instability in dynamics runs (velocity limit; 20.0 is the default). If any velocity component is > vlimit, then the component will be reduced to vlimit. comp = 44.6 unit of compressibility for the solvent (H2O) ntc = 2 Flag for the Shake algorithm (1 – No Shake is performed; 2 – bonds to hydrogen are constrained; 3 – all bonds are constrained). tol = #.##### relative geometric tolerance for coordinate resetting in shake.

11

You will note that we used a smaller restraint force (10 kcal/mol). For dynamics, one only needs to use 5 to 10 kcal/mol restraint force when ntr = 1 (uses a harmonic potential to restrain coordinates to a reference frame; hence, the need to include reference coordinates with the –ref flag.). Larger restraint forces lead to instability in the shake algorithm with a 2 fs time step. Larger restraint force constants lead to higher frequency vibrations, which in turn lead to the instability. Excess motion away from the reference coordinates is not possible due to the steepness of the harmonic potential. Therefore, large restraint force constants are not necessary. nohup sander –O –i md1.in –o md1.out –p oxy.top –c oxy_min2.rst –r oxy_md1.rst –x oxy_md1.mdcrd –ref oxy_min2.rst –inf md1.info & Monitor the progress using “tail –40 md1.out”. Our run took approximately 32 minutes with no other processes running in background on a 2.2 GHz linux workstation. Step 4. The Production Run. This is where we do the actual molecular dynamics run. You will do a 250 ps run. Contents of md2.in oxytocin: 250ps MD &cntrl imin = 0, irest = 1, ntx = 7, ntb = 2, pres0 = 1.0, ntp = 1, taup = 2.0, cut = 10, ntr = 0, ntc = 2, ntf = 2, tempi = 300.0, temp0 = 300.0, ntt = 3, gamma_ln = 1.0, nstlim = 125000, dt = 0.002, ntpr = 100, ntwx = 500, ntwr = 1000 /

ntb = 2 Constant pressure dynamics. ntp = 1 md with isotropic position scaling. taup = 2.0 pressure relaxation time in ps pres0 = 1 reference pressure in bar. Issue the following command to run your dynamics simulation. nohup sander –O –i md2.in –o md2.out –p oxy.top –c oxy_md1.rst –r oxy_md2.rst –x oxy_md2.mdcrd –ref oxy_md1.rst –inf md2.info & Monitor the progress using “tail –40 md2.out”. Our run took approximately 7.5 hours to run on a 2.2 GHz linux workstation. Analysis

12

Copy process_mdout.perl to your working project directory. Use the “which” command to determine the location of your perl interpreter (type “which perl”). On the SGI, the location (after #!) should be /usr/sbin/perl and on linux this location should be /usr/bin/perl. Check to make sure that the path in the perl script (1st line in the script) matches whatever path was given by the result of your which command. Make sure that the program is executable (use chmod 755 filename) where filename = process_mdout.perl. ./process_mdout.perl md1.out md2.out The resulting files are readable by the Grace program (http://plasma-gate.weizmann.ac.il/Grace/ ) or as space delimited files in Excel. The Grace program is available on UMDNJ SGI and linux machines in the computer labs. The Grace program is available for free download from the URL. We will take a look at summary.EPTOT (potential energy plot). Here is our result… xmgrace summary.EPTOT

The potential energy fluctuates throughout the simulation. There is a jump in energy early on during the water equilibration (restrained MD in the first 20 ps) phase followed by a general trend toward lower energy, which is a good sign that the dynamics is leading toward lower energy conformations. Plot other files. Use Microsoft Excel to read in the file as a space delimited file or use Grace on the unix machines.

13

summary.TEMP gives the temperature fluctuation with time. summary.PRES gives the pressure fluctuation with time. The RMS plot. We will use the ptraj program. Contents of rms.in trajin oxy_md1.mdcrd trajin oxy_md2.mdcrd rms first out oxy.rms @N,C,CA time 1.0

trajin – specifies trajectory file to process rms – computed RMS fit to the first structure of the first trajectory read in. out – specifies name of output file @N, C, CA – Atom mask specifier (backbone atoms) (Note: The @ symbol is the atom specifier;

alternately or in combination, you may use the colon : to specify residue ID # as well) For example if you only desired to examine the backbone atoms of residue #23, use :23@N,C,CA

time – tells ptraj that each frame represents 1 ps Issue the following command to run ptraj. ptraj oxy.top < rms.in xmgrace oxy.rms

14

Computing an average structure. Our RMSD plot indicates a region of stability from 50 to 200 ps. We will use these time points to compute an average. Contents of avg.in trajin oxy_md2.mdcrd 50 200 center :1-9 image center familiar rms first mass out av_rms.dat :1-9 average oxy_avg.pdb pdb

ptraj oxy.top < avg.in The first steps involve centering the box followed by mass weighted RMS fit analysis, which is required for computation of the final average structure. The water molecule positions are highly variable in the average structure and will be essentially worthless. Remove the water molecules … grep –v ‘WAT’ oxy_avg.pdb > oxy_avg_nw.pdb Use rasmol to view the structure.

15

rasmol oxy_avg_nw.pdb You will notice that the structure is rather crude. The hydrogen atom positions are distorted. You will need to refine the structure average using energy minimization. A NOTE ABOUT AVERAGE STRUCTURES: Average structures are good when you need a structure that is representative of a specific time range in the trajectory (e.g. a region of stability or equilibrium). Average structures are not good for monitoring the positions of hydrogen atoms. If you are interested in hydrogen atom positions, review a range of snapshots from the trajectory. Refinement of molecules with AMBER can be difficult. There are two problems that need to be addressed: 1. The residue names. 2. The hydrogen atom names. A few of the peptide residues will be labeled differently than IUPAC format (e.g. Histidine and Cysteine). These names must be corrected. There is no standardized format for hydrogen atom names in PDB files. This is not a problem because we will delete the hydrogen atoms and replace them before minimization. The heavy atom positions are most important for time average structures. You may generate amber coordinate file from an average structure computation as follows: trajin oxy_md2.mdcrd 50 200 center * image center familiar rms first mass out av_rms.dat * average oxy_avg.rst restrt

The asterisk * is a wildcard that specifies all atoms. Now convert the amber coordinate file to a PDB file that can be used with Leap as follows … ambpdb –aatm –p oxy.top < oxy_avg.rst > oxy_avg2.pdb grep –v ‘WAT’ oxy_avg2.pdb > oxy_avg2_nw.pdb This should give a PDB file with the proper set of hydrogen atom names for use with Leap. Create a new in vacuo crd and top files with Leap and use the in vacuo minimization input to setup the refinement of your average structure. Analysis of hydrogen bonds over the course of the trajectory. Use the following input file (hbond.in) to ptraj.

16

Illustration of hydrogen bonds to be analyzed. (Image created using VMD [3] and Raster3D [4]) trajin oxy_md2.mdcrd donor TYR O acceptor ASN N H acceptor CYX N H hbond distance 3.5 angle 120.0 includeself donor acceptor neighbor 2 series \ hbond

DONOR/ACCEPTOR – use to specify the donor acceptor atoms. DISTANCE – use to specify the cutoff distance in angstroms between the heavy atoms participating in the interaction. (3.0 is the default) INCLUDESELF – include intramolecular H-bond interactions if any. ANGLE – The H-bond angle cutoff (H-donor-acceptor) in degrees. (0 is the default) SERIES – Directs output of H-bond data summary to STDOUT. ptraj oxy.top < hbond.in > hbond.dat The statistical analysis in the hbond.dat file will be most important. Look for those H-bonds with a high % occupancy (> 60%). These are the more stable H-bonds. Throw out results with angles less than 120 degrees. The higher the % occupancy; the better. When analyzing H-bond data, it is best to establish reasonable guidelines for the distance and angle cutoffs. A recent paper by Chapman et al. provides a nice discussion of hydrogen bond criteria. [10] Torsion angle measurements of the peptide backbone. (torsions.in). trajin oxy_md2.mdcrd dihedral phi_1 :1@C :2@N :2@CA :2@C out phi_1 dihedral psi_1 :1@N :1@CA :1@C :2@N out psi_1 dihedral omega_1 :1@CA :1@C :2@N :2@CA out omega_1 dihedral phi_2 :2@C :3@N :3@CA :3@C out phi_2

17

dihedral psi_2 :2@N :2@CA :2@C :3@N out psi_2 dihedral omega_2 :2@CA :2@C :3@N :3@CA out omega_2 dihedral phi_3 :3@C :4@N :4@CA :4@C out phi_3 dihedral psi_3 :3@N :3@CA :3@C :4@N out psi_3 dihedral omega_3 :3@CA :3@C :4@N :4@CA out omega_3 dihedral phi_4 :4@C :5@N :5@CA :5@C out phi_4 dihedral psi_4 :4@N :4@CA :4@C :5@N out psi_4 dihedral omega_4 :4@CA :4@C :5@N :5@CA out omega_4 dihedral phi_5 :5@C :6@N :6@CA :6@C out phi_5 dihedral psi_5 :5@N :5@CA :5@C :6@N out psi_5 dihedral omega_5 :5@CA :5@C :6@N :6@CA out omega_5

ptraj oxy.top < torsions.in The individual files can be plotted with xmgrace or Microsoft Excel to view the dihedral angle fluctuation with time in the simulation. How to view a trajectory movie with VMD. A viewer utility that you may use on various platforms to view Amber coordinate files and trajectory files is Visual Molecular Dynamics (VMD; see http://www.ks.uiuc.edu/Research/vmd/ ). [3] In your unix shell, type “vmd” to start the VMD program (IMPORTANT: it is best to start the program from within your working project directory!). VMD Main: File > New Molecule > Molecule File Browser > Filename: Click on Browse… > Choose a molecule file: oxy.top > Click OK Molecule File Browser > Determine file type: parm7 > Click on Load Keep the Molecule File Browser window open! Molecule File Browser > Filename: Click on Browse… > Choose a molecule file: oxy_md2.mdcrd > Click OK Molecule File Browser > Determine file type: crdbox > Click on Load You will notice lots of water molecules. Use Graphics Representations to view just the protein. VMD Main: Graphics > Representations > Graphical Representations > Selected Atoms: protein Click Apply Use the left mouse button for x-y rotation and the middle mouse button for z rotation. Hit the s key on your keyboard and the mouse goes into scale (zoom in and out) mode. Hit the t button and the mouse will be in translation mode. Hit the r button to return to rotation mode.

18

The AMBPDB Conversion Program How to use the ambpdb program to convert an amber restart coordinate file to a PDB file. For example: ambpdb –aatm –bres –p oxy.top < oxy_md2.rst > oxy_md2.pdb The –aatm flag insures that the hydrogen atom names conform to PDB standard and the –bres flag insures that the residue names conform to PDB standard. Do not use the –bres flag if you will be using this pdb file as input to xLeap for another computation! We will use this pdb file for LSQ fit to the in vacuo structure. We will use a utility from the Gromacs software package [11] known as g_confrms to perform the LSQ fit. First, remove the water molecules from the PDB file you created. grep –v ‘WAT’ oxy_md2.pdb > oxy_md2_nw.pdb Next, convert your in vacuo structure to a pdb file. ambpdb –aatm –bres –p oxy_vac.top < oxy_vacmd.rst > oxy_vacmd.pdb g_confrms –f1 oxy_md2_nw.pdb –f2 oxy_vacmd.pdb –o oxy_vac_wet_fit.pdb Select Group 1 (Protein) for both structures. The program will process and report a RMSD for the fit in nm (gromacs uses nm not Angstroms!). Our RMSD = 0.34226 nm = 3.4226 Å Use “rasmol –nmrpdb oxy_vac_wet_fit.pdb” to view the fit structure. Use Colours > Model and Display > Cartoon to view the two structure backbones coloured independently.

Legend: Blue = water box dynamics structure; Red = in vacuo dynamics result Computing B-Factors

19

B-Factors are commonly reported in crystal structures and NMR structures (pseudo B-factors). These values give the approximate thermal (kinetic) energy of the particle or atom. The analysis below will give us information about the motion of the backbone by residue. bfactor.in trajin oxy_md2.mdcrd rms first out oxy_ca.b :1-9@CA atomicfluct out oxy.bfactor @CA byatom bfactor go ptraj oxy.top < bfactor.in Plot the oxy.bfactor file using either xmgrace (linux) or Excel (windows). The plot will give the bfactor on the y-axis and the residue number on the x-axis. Using Procheck to check the stereochemical integrity of your structure. Procheck is a protein structure analysis program [12] available from (http://www.biochem.ucl.ac.uk/~roman/procheck/procheck.html ) that you should use to evaluate the results of your dynamics study. Take your structure, read it into SPDBV[13] or Sybyl (Tripos, Inc.) and remove the hydrogen atoms to prepare it for analysis. Make a separate directory for the analysis and move the PDB file into this directory. If you are using the DOS/Windows version of Procheck, copy the procheck.prm and procheck.dat files to this directory. Issue the command below to initiate the analysis. The “2.0” is the resolution in angstroms (if you were analyzing a crystal structure). Two angstroms is a nominal value used for the analysis of structures derived from NMR or modeling (or, for the case of models use the resolution of the starting crystal structure). Here is how to run procheck on two different systems. Linux/Unix Procheck: procheck filename.pdb 2.0 DOS/Windows Procheck: pro filename 2.0 Do not use the .pdb suffix for DOS procheck! Procheck writes a number of files as output. A number of postscript files (plots) are generated [use ghostscript (http://www.cs.wisc.edu/~ghost/) or imagemagick (http://freealter.org/doc_distrib/ImageMagick-5.1.1/ ) to convert to PDF format or some other graphics format.]. We have ImageMagick on our Linux machines. Use imagemagick’s “convert” program (e.g. “convert file.ps file.png” converts file.ps to a PNG file). The following is a summary of our results. +----------<<< P R O C H E C K S U M M A R Y >>>----------+

20

| | | oxy_md2_nw.pdb 2.0 9 residues | | | +| Ramachandran plot: 83.3% core 16.7% allow 0.0% gener 0.0% disall | | | *| All Ramachandrans: 1 labelled residues (out of 7) | | Chi1-chi2 plots: 0 labelled residues (out of 0) | +| Main-chain params: 4 better 0 inside 2 worse | | Side-chain params: 5 better 0 inside 0 worse | | | *| Residue properties: Max.deviation: 3.6 Bad contacts: 0 | *| Bond len/angle: 6.0 Morris et al class: 1 1 2

Based on an analysis of 118 structures of resolution of at least 2.0 Angstroms and R-factor no greater than 20.0 a good quality model would be expected to have over 90% in the most favoured regions [A,B,L].

*| G-factors Dihedrals: -1.11 Covalent: -2.14 Overall: -1.71 | | | | M/c bond lengths: 85.7% within limits 14.3% highlighted | | M/c bond angles: 61.9% within limits 38.1% highlighted |

G-factors provide a measure of how unusual, or out-of-the-ordinary, a property is.

Values below -0.5* - unusual Values below -1.0** - highly unusual Important note: The main-chain bond-lengths and bond angles are compared with the Engh & Huber (1991) ideal values derived from small-molecule data. Therefore, structures refined using different restraints may show apparently large deviations from normality. The main chain bond lengths and angles are highly unusual in our case because this is an unrefined structure (i.e. not energy minimized). Perform an energy minimization on your structure and compare! file_01.ps is the Ramachandran plot

21

Use the convert utility (ImageMagick) to convert the postscript file to an acceptable image file. To convert to jpeg use the following command: convert file_01.ps file_rama.jpeg Load the image into imageview. imgview file_rama.jpeg or use ImageMagick’s display program display file_rama.jpeg

22

Note that at least four of our residues fall in the helix region (A) of the Ramachandran map. [14] The triangles are the glycine residues, which are special (they may appear in disallowed regions of the plot!). Some points may overlap and not be visible.

23

H

H

H

H

H

NH

O

HO

HN

H

H

O

H

H

ψ

φ

χ

ω

Dihedral angles in a polypeptide chain. Adapted from Schulz and Schirmer.[14] Questions to consider: 1. Do you notice any clustering of hydrophobic side chains in this model? 2. In your dihedral angle plots (φ, ψ, and ω), which was the most variable? What region of the backbone was most flexible (analyze the φ and ψ angles as well as the bfactor results)? What region was more rigid? What interactions contributed to this observed rigidity? 3. Comment on the differences observed between the in vacuo vs the water box results. Why does the in vacuo structure appear to be more compact? 4. Mutate one of the cysteine residues to an alanine. You will need to change the residue name of the remaining cysteine residue from CYX to CYS (Amber uses CYX to designate cysteine residues that are disulfide bonded and the normal CYS residue name to designate cysteine residues not involved in disulfide bonding.) in your PDB file (start with oxyt.pdb). One approach is to use the mutate function in DeepView (spdbv). Before loading the structure into

24

DeepView, change all of the CYX residue names to CYS (otherwise DeepView will be confused by CYX and change these to HETATM records!) using the Search/Replace function of any text editor. Mutate one of the CYS residues to ALA using the mutate button in DeepView, then use File > Save > Layer to save the new pdb file. After you save the new PDB file from DeepView, edit out the SPDBV lines in the file using any text editor. Perform a 250 ps water box simulation just as you did in the tutorial. Compare your mutated structure final snapshot with the structure snapshot (oxy_md2.rst) of the wild type structure. What differences do you notice in comparing the backbones? Are there new stabilizing interactions (H-Bonds or Hydrophobic?)? What role does the solvent (water) play in the outcome?

25

Bibliography: 1. Case, D.A., et al., Amber 8. 2004, University of California: San Francisco, CA. 2. Rose, J., et al., Crystal structure of the neurophysin-oxytocin complex. Nature Struct.

Biol., 1996. 3(2): p. 163-169. 3. Humphrey, W., A. Dalke, and K. Schulten, VMD - Visual Molecular Dynamics. J. Molec.

Graphics, 1996. 14.1: p. 33-38. 4. Merritt, E.A. and D.J. Bacon, Raster3D: Photorealistic Molecular Graphics. Methods

Enzymol., 1997. 277: p. 505-524. 5. Duan, Y., et al., A point-charge force field for molecular mechanics simulations of

proteins based on condensed-phase quantum mechanical calculations. J Comput Chem, 2003. 24(16): p. 1999-2012.

6. Jorgensen, W., et al., Comparison of Simple Potential Functions for Simulating Liquid Water. J. Chem. Phys., 1983. 79: p. 926-935.

7. Weber, W., P.H. Hünenberger, and J.A. McCammon, Molecular Dynamics Simulations of a Polyalanine Octapeptide under Ewald Boundary Conditions: Influence of Artificial Periodicity on Peptide Conformation. J. Phys. Chem. B, 2000. 104(15): p. 3668-3575.

8. Darden, T., D. York, and L. Pedersen, Particle Mesh Ewald: An N-log(N) method for Ewald sums in large systems. J. Chem. Phys., 1993. 98: p. 10089-10092.

9. Essmann, U., et al., A smooth particle mesh ewald potential. J. Chem. Phys., 1995. 103: p. 8577-8592.

10. Fabiola, F., et al., An Improved Hydrogen Bond Potential: Impact on Medium Resolution Protein Structures. Protein Sci., 2002. 11: p. 1415-1423.

11. Lindahl, E., B. Hess, and D. van der Spoel, GROMACS 3.0: a package for molecular simulation and trajectory analysis. J. Mol. Model, 2001. 7: p. 306-317.

12. Laskowski, R., et al., PROCHECK: A program to check the stereochemical quality of protein structures. J. Appl. Cryst., 1993. 26: p. 283-291.

13. Guex, N. and M.C. Peitsch, SWISS-MODEL and the Swiss-Pdb Viewer: An Environment for Comparative Protein Modeling. Electrophoresis, 1997. 18: p. 2714-2723.

14. Schulz, G.E. and R.H. Schirmer, Principles of Protein Structure. Springer Advanced Texts in Chemistry, ed. C.R. Cantor. 1979, New York: Springer-Verlag. 314.