X-RAY CRYSTALLOGRAPHY OF YEAST CYTOSKELETAL PROTEINS · 2018-01-09 · x-ray crystallography of fission yeast rng2 wang chern hoe b.sc. (hons.), nus a thesis submitted for the degree

X-RAY CRYSTALLOGRAPHY OF FISSION YEAST RNG2

X-R

AY

CR

YSTA

LLOG

RA

PHY

OF FISSIO

N Y

EAST R

NG

2

WA

NG

CH

ERN

HO

E 2004

WANG CHERN HOE

INSTITUTE OF MOLECULAR AND CELL BIOLOGY

2004

X-RAY CRYSTALLOGRAPHY OF FISSION YEAST RNG2

WANG CHERN HOE

B.Sc. (Hons.), NUS

A THESIS SUBMITTED

FOR THE DEGREE OF DOCTOR OF PHILOSOPHY

INSTITUTE OF MOLECULAR AND CELL BIOLOGY

AN A*STAR BIOMEDICAL SCIENCES INSTITUTE AFFILIATED TO THE NATIONAL

UNIVERSITY OF SINGAPORE

2004

Acknowledgements

This thesis is dedicated in memory of my mother and late father. Dad,

although you left me long ago, I have not forgotten about you and your hardwork as a

chemical engineer. Mom, I know you have suffered many years in his absence and

from the sudden lost of his financial support. Working from dawn to dusk all this while,

I hope this thesis brings you some pride. 情爱的刘小姣，若没你的支持，我是不会

有今天的！

I thank Stephen Wong Yeow Whye and Wong Siew Chin from the

Bioinformatics Institute (www.bii.a-star.edu.sg) for allowing me to use their Linux

cluster for the 5 nanosecond molecular dynamics simulation.

Lastly, I would like to thank my supervisor Dr Terje Dokland for his patience

and support throughout the entire period of my candidature.

i

Table of contents

Acknowledgements.......................................................................................................... i

Table of contents............................................................................................................. ii

List of figures................................................................................................................. iv

List of tables.................................................................................................................viii

List of abbreviations ...................................................................................................... ix

List of publications ......................................................................................................... x

Abstract .......................................................................................................................... xi

Summary .......................................................................................................................xii

1. Biological background ............................................................................................... 1

1.1. Fission yeast cytokinesis.................................................................................. 1

1.2. Fission yeast Rng2 ........................................................................................... 7

1.3. The calponin-homology domain...................................................................... 8

2. Crystallographic background ................................................................................... 12

2.1. Crystal symmetry........................................................................................... 12

2.2. Reciprocal lattice ........................................................................................... 17

2.3. Bragg's law..................................................................................................... 18

2.4. Scattering by an electron................................................................................ 20

2.5. Data reduction................................................................................................ 22

2.6. Scattering by an atom .................................................................................... 27

2.7. Anomalous scattering by an atom.................................................................. 28

2.8. Location of heavy atoms and the determination of protein phase angles...... 30

2.9. Phase improvement........................................................................................ 33

2.10. Refinement..................................................................................................... 34

ii

3. Materials and methods ............................................................................................. 37

3.1. Plasmid construction...................................................................................... 37

3.2. Protein expression.......................................................................................... 38

3.3. Protein purification ........................................................................................ 39

3.4. Protein crystallization .................................................................................... 40

3.5. Data collection ............................................................................................... 41

3.6. Structure determination.................................................................................. 42

3.7. Molecular dynamics simulation in an aqueous system.................................. 48

3.8. Phylogenetic analysis..................................................................................... 49

3.9. Generation of figures and tables .................................................................... 50

4. Results and discussion.............................................................................................. 51

4.1. Overall structure ............................................................................................ 51

4.2. Comparison of CH domain structures ........................................................... 58

4.3. Intermolecular contacts.................................................................................. 62

4.4. Comparison of Br- and Hg-derivatives of Rng232-189 .................................... 72

4.5. Calponin-homology domain phylogeny ........................................................ 78

5. Concluding remarks: Rng2 structure and function .................................................. 82

6. Appendix .................................................................................................................. 85

7. References ................................................................................................................ 96

iii

List of figures

Figure 1: Domain organization within Rng2. Domain boundaries are defined by performing a global Pfam search (Eddy, 1998) across a Pfam HMM library (Bateman et al., 2002), and shown in red numerals. ....................................................................... 8 Figure 2.1: The right-handed axial system is shown on a triclinic P lattice. ................ 15 Figure 2.2: One P unit cell of a monoclinic structure, with scattering entities at the lattice points. X-ray beam are "reflected" by the (110) planes of the monoclinic structure......................................................................................................................... 20 Figure 3: Thrombin cleavage of GST-Rng21-189. Lane M, molecular weight standard; Lane 1, before cleavage; Lane 2, eluate containing Rng21-189 after 3 hr cleavage at 4 °C with 20 U of thrombin; Lane 3, eluate using glutathione buffer; Lane 4, purified Rng21-

189................................................................................................................................... 40 Figure 4: Crystals of Rng21-189. They typically measure 200 µm in the longest dimension. ..................................................................................................................... 41 Figure 5: Harker section y = ½ corresponding to the Br positions in a Br-anomalous difference Patterson map............................................................................................... 43 Figure 6: Harker section y = ½ corresponding to the Hg positions in a Hg-anomalous difference Patterson map............................................................................................... 44 Figure 7(a): Multiple sequence alignment of calponin-homology domains from proteins known to bind F-actin. The blue square scale at the top shows the relative accessibility of each residue extracted from DSSP (Kabsch and Sander, 1983) - blue with red borders indicate exposed residues with an accessibility value A > 1; blue indicates accessible residues with A > 0.4, cyan indicates intermediate accessibility with 0.1 < A < 0.4, and white indicates buried residues with A < 0.1. The secondary structure of Rng2 and its residue numbering follows below and are shown in black. The predicted secondary structure of Rng2 based on PHD is shown at the bottom in red. The residues corresponding to Rng2 core α-helices (Figure 8c) are highlighted in yellow............................................................................................................................ 53 Figure 7(b): Structural alignment of all known calponin-homology domains. In the sequence name column, the number following the four digit PDB coordinate entry refers to either a CH1 or CH2 domain; it is absent for single CH domains. The last alphabet is a chain identifier. The cyan square scale at the bottom shows the hydropathic character of each residue according to the algorithm of Kyte and Doolittle (1982) with a window size of 3 - pink indicates hydrophobic residues with a value H > 1.5; grey indicates intermediate hydrophobicity with -1.5 < H < 1.5 and cyan indicates hydrophilic residues with H < -1.5. .............................................................................. 54 Figure 7(c): Multiple sequence alignment of calponin-homology domains from the IQGAP family of proteins............................................................................................. 54

iv

Figure 8: Structure of Rng232-189. (a) Ribbon representation of the crystal structure of Rng232-189 shown in two orientations with relative rotation of 180°. Bromine atoms are represented by spheres coloured according to B factors (blue to red denoting lower to higher values). The amino- and carboxyl-terminal residues are labelled together with α-helices a1-a7 and 310-helices h1-h3. For comparison purposes, the region corresponding to actin-binding sites ABS1 (a1) and ABS2 (h2-a6) of dystrophin CH1 domain (Norwood et al., 2000) are highlighted in green and blue, respectively. The 39 residues carboxyl-terminal to the CH domain (150-189) are highlighted in red. Orientations in (a)-(c) are the same. The molecular surfaces below the ribbon diagram are in the same orientation. Surface residue conservation scores ranging from 0 to 1 are coloured based on a white (not conserved) to red (conserved) scale. Kyte-Doolittle (Kyte and Doolittle, 1982) surface hydrophobicities ranging from -4.5 to 4.5 are coloured based on a cyan (hydrophilic) to wheat (hydrophobic) scale. Electrostatic surface potentials ranging from -155.214 kT/e to 318.183 kT/e are coloured based on a red (acidic) to blue (basic) scale, with a scale factor of 1............................................. 55 Figure 8(b): Cα backbone superposition of the CH domains from the Br-derivative of Rng2 CH, Hg-derivative of Rng2 CH, dystrophin CH1, dystrophin CH2, calponin CH, EB1 CH, fimbrin CH1-1, fimbrin CH1-2, utrophin CH1, utrophin CH2, plectin CH1, plectin CH2 and beta-spectrin CH2. For clarity, only the Br-derivative of Rng232-189 CH (pink), dystrophin CH1 (red), calponin CH (orange) and plectin CH2 (blue) domains are shown here................................................................................................ 55 Figure 8(c): Cα backbone superposition of the four core α-helices belonging to the Br-derivative of Rng232-189 CH (pink), dystrophin CH1 (red), calponin CH (orange) and plectin CH2 (blue) domains. Orientations in (a), (b) and (c) are the same................... 56 Figure 9: A 310-helical structure between residues 62-64 (helix h1). Water molecules are represented by cyan spheres and dotted lines indicate hydrogen bonds. The 2mFo - DFc map is contoured at 1σ and superimposed on the coordinates. ............................. 61 Figure 10: Intermolecular contacts in a P21 crystal of the Rng2 CH domain. (a1) Molecular packing projected from the c axis (upper) and b axis (lower) directions for the Br-derivative. The unit cell (P21) and its related symmetry elements are coloured red. The macrobonds, labelled A-E are represented by dashed lines and circles for bonds parallel and perpendicular to the projection plane respectively. The colouring scheme of the ribbon model of Rng2 follows figure 8(a). (a2) Same as (a1) for the Hg-derivative. The MAD and SAD data sets for the Br- and Hg-derivatives respectively were independently solved, accounting for the different origins which are related by half a unit translation along the a axis. Blue arrows point at the loop between residues 92-99 of Rng2. .............................................................................................................. 65 Figure 10(b)-(d): The 2mFo - DFc map is contoured at 1σ and superimposed on the coordinates. Grey and green carbons indicate separate molecules. Amino-acid codes are shown in lower case. The chain number is in upper case next to the residue number. (b1) Chemical environment at the interface of two molecules along macrobond A of the Br-derivative. There is a potential salt bridge and a potential water-mediated hydrogen bond between the ζ-amino group of Arg70 and the γ-carboxyl group of Asp151 The ζ-amino group of Arg70 is linked via a potential salt bridge to the δ-carbonyl group of Gln67. .............................................................................................. 66

v

Figure 10(b2): Macrobond A of the Br-derivative. A bromide anion with a temperature factor of 24.54 Å2 forms a potential hydrogen bond to the ζ-amino group of Arg175, which is doubly hydrogen bonded to the carbonyl group of Leu152. .......................... 67 Figure 10(b3): Macrobond A of the Br-derivative. There is a nonpolar cluster (>5 Å) of residues Ile172, Leu152, Phe82, Phe110 and Ile114 in a background of specific intermolecular hydrogen bonds and salt bridges. ......................................................... 68 Figure 10(c): Macrobond C of the Br-derivative. There is a potential direct hydrogen bond between the ζ-amino group of Arg99 and the δ-carbonyl group of Glu161. It also has another potential direct hydrogen bond between the α-imino of Tyr92 and γ-carbonyl group of Asn162. ........................................................................................... 69 Figure 10(d): Macrobond D of the Br-derivative. It has a potential water-mediated hydrogen bond from the γ-carbonyl group of Asn162 to the ε-amino group of Lys131........................................................................................................................................ 70 Figure 10(e): Macrobond E of the Br-derivative. It has a potential bromide-mediated hydrogen bond between the α-imino group of Pro117 and the ζ-amino group of Arg178. ......................................................................................................................... 71 Figure 11: Comparison between Br- and Hg-derivatives of Rng232-189. (a) Ribbon representation of the Br-derivative (red helices and blue coils) superimposed on the Hg-derivative (pink helices and cyan coils). Hg atoms are shown as spheres coloured according to their B factors. .......................................................................................... 74 Figure 11(b): Environment around the Hg atom near the variable loop from residues Tyr92-Arg99 in the Hg-derivative. The 2mFo - DFc map is shown contoured at 1σ. The Hg anomalous map (red) is superimposed onto the electron density map. .................. 75 Figure 11(c): Mass weighted Cα r.m.s.d. of Rng232-189 as a function of time over the entire trajectory of a 5 ns molecular dynamics simulation for Rng232-189, in the Br-derivative (red) and Hg-derivative (black). .................................................................. 76 Figure 11(d): Main chain (N, Cα, C and O) r.m.s.d. for each residue as a function of time during the same MD. α-Helical sections of the structure are highlighted in green........................................................................................................................................ 77 Figure 12: Consensus of phylogenies of 117 individual calponin-homology (CH) sequences from 73 proteins. The species represented include Arabidopsis thaliana (at), Dictyostelium discoideum (dd), Gallus gallus (gg), Homo sapiens (hs), Meleagris gallopavo (mg), Mus musculus (mm), Rattus norvegicus (rn), Saccharomyces cerevisiae (sc) and Schizosaccharomyces pombe (sp). The number appended to the species symbol represents the order of the CH domain with 1, 3 being amino-terminal and 2, 4 being carboxyl-terminal. The CH groups were designated following Korenbaum and Rivero (2002). Rng2 (Rng2_sp) is highlighted in dark grey at the top of the figure. The protein codes for the 9 species, together with their accession numbers, are as follows: Rng2_sp (NP_593860); Iqg1_sc (NP_015082); IQGAP1_hs (NP_003861); IQGAP2 (NP_006624); CNN_sp (NP_593863); CNN1_hs (NP_001290); 1h67_gg (P26932); CNN_mg (P37803); CNN3_hs (NP_001830); CNN2_hs (NP_004359); ARHGEF7b_hs (NP_663788); ARHGEF6_hs (NP_004831); LRRN4_hs (NP_002310); CHDC1_hs (XP_166260); LMO7_hs

vi

(NP_005349); VAV1_hs (NP_005419); VAV3_hs (NP_006104); VAV2_hs (NP_003362); TAGLN_hs (NP_003177); NP25_hs (NP_037391); TAGLN2_hs (NP_003555); Scp1_sc (NP_015012); GAS2L1_hs (NP_006469); GAR17_hs (NP_644814); GAS2_hs (NP_005247); ASPM_hs (NP_060606); SMTNb_hs (NP_599031); SMTNa_hs (NP_599032); SMTNc_hs (NP_008863); SMTN_mm (NP_038898); MICAL2_hs (NP_055447); MICAL3_hs (XP_032997); TANGN_hs (XP_170658); MIRAB13_hs (NP_203744); NICAL_hs (NP_073602); NUMA1_hs (NP_006176); 1pa7_hs (NP_036457); MAPRE3_hs (NP_036458); MAPRE2_hs (NP_055083); Sac6_sc1 (NP_010414); Fim1_sp1 (NP_596289); PLS1_hs1 (NP_002661); 1aoa1_hs1 (NP_005023); LCP1_hs1 (NP_002289); AtFim1_at1 (NP_200351); ACTN3_hs1 (NP_001095); ACTN4_hs1 (NP_004915); ACTN1_hs1 (NP_001093); ACTN2_hs1 (NP_001094); FLNB_hs1 (NP_001448); FLNA_hs1 (NP_001447); FLNC_hs1 (NP_001449); FLN_gg1 (BAB63943); ctxA_dd1 (AAB62275); abpC_dd1 (P13466); SYNE1_hs1 (NP_149062); CLMN_hs1 (NP_079010); SYNE2_hs1 (NP_055995); 1dxx1_hs1 (NP_003997); 1qag1_hs1 (NP_009055); MACF1a_hs1 (NP_036222); PLEC_rn1 (NP_071796); 1mb81_hs1 (NP_000436); SPTBN1_hs1 (NP_003119); SPTB_hs1 (NP_000338); SPTBN2_hs1 (NP_008877); SPTBN4_hs1 (NP_079489); SPTBN5_hs1 (NP_057726); NAV2_hs (NP_660093); NAV3_hs (NP_055718); PARVG_hs1 (NP_071424); PARVA_hs1 (NP_060692); PARVB_hs1 (NP_037459)..................................................................................................................... 80 Figure 13: Residues shown in stick representation on the ribbon model of Rng232-189 are conserved within the IQGAP family of proteins (Figure 7c). The transparent surface electrostatic potential highlight the presence of a basic patch where Rng232-189 possibly interacts with F-actin. ..................................................................................... 83

vii

List of tables

Table 1: S. pombe genes involved in various aspects of cytokinesis.............................. 3 Table 2: The seven crystal systems, their corresponding Bravais lattices and symmetries encountered in biological crystals. ............................................................ 14 Table 3: Rng2 plasmids. Restriction sites of the primers are shown in brown, whereas insert sequences are in blue........................................................................................... 36 Table 4: Data collection, processing, phasing and refinement statistics. Numbers in parentheses refer to the highest resolution shell. .......................................................... 46 Table 5: R.m.s.d. values of the pairwise comparison of all the CH domains using the Cα belonging to the four core helices of each structure. The core helices of Rng232-189 correspond to residues 43-54, 73-81, 99-112 and 133-144 (Figure 8c). Abbreviations: DMD, dystrophin; UTRN, utrophin; PLEC, plectin; CNN, calponin; SPTB, beta-spectrin; PLS3, fimbrin ABD1; EB1, end-binding protein 1........................................ 59 Table 6: Biochemical characteristics of several CH domains. Structures deposited in RCSB are in square brackets......................................................................................... 85

viii

List of abbreviations

ABD actin-binding domain ABP-120 actin-binding protein 120 ABS actin-binding site ACTN1 actinin, alpha 1 ARHGEF6 alpha rho guanine nucleotide exchange factor 6 ASPM asp (abnormal spindle)-like, microcephaly associated ATP adenosine 5'-triphosphate BPAG1 bullous pemphigoid antigen 1 CH calponin-homology CHASM calponin homology-associated smooth muscle protein CLMN calponin like transmembrane domain protein CNN calponin Da Dalton, atomic mass unit, 1.67377 × 10-27 kg DMD dystrophin DNase deoxyribonuclease EB1 end-protein 1 EHBP1 EH domain binding protein 1 ERK extracellular regulated kinase Esu estimated standard uncertainties FLNA filamin A GAP GTPase-activating protein GAR17 GAS2-related protein GAS2 growth arrest-specific 2 GDP guanosine 5'-diphosphate GEF guanine nucleotide exchange factor GST glutathione-S-transferase GTP guanosine 5'-triphosphate hGAR22 human Gas2-related gene on chromosome 22 IQGAP IQ motif containing GTPase-activating protein K Kelvin LCP1 lymphocyte cytosolic protein-1 LMO7 LIM domain only 7 LRCH4 leucine-rich repeats and calponin-homology (CH) domain containing 4 MACF1a microtubule-actin crosslinking factor, isoform a MAPRE1 microtubule-associated protein, RP/EB family, member 1 MICAL2 molecule interacting with CasL MIRAB13 molecule interacting with Rab13 NAV2 neuron navigator 2 NICAL NEDD9 interacting protein with calponin-homology and LIM domains NP25 neuronal protein NUMA1 nuclear/mitotic apparatus protein 1 PAGE polyacrylamide gel electrophoresis PARVA parvin, alpha PBC periodic bond chain PBS phosphate buffered saline pI isoelectric point PIP2 phosphatidyl inositol-3,4-bisphosphate PLEC1 plectin 1 PLS1 plastin 1 PMSF phenylmethylsufonyl fluoride r.m.s.d. root-mean-square deviation SDS sodium dodecyl sulphate SM22 smooth muscle protein 22-alpha SMTNa smoothelin isoform a SPB spindle pole body SPTB spectrin beta erythrocytic SPTBN1 spectrin beta non-erythrocytic 1

ix

SYNE1 synaptic nuclei expressed gene 1 TAGLN transgelin TAGN tangerin UTRN utrophin

List of publications

Wang, C.H., Walsh, M., Balasubramanian, M.K. and Dokland, T. (2003). Expression, purification, crystallization and preliminary crystallographic analysis of the calponin-homology domain of Rng2. Acta Crystallogr. D59, 1809-1812. Wang, C.H., Balasubramanian, M.K. and Dokland, T. (2004). Structure, crystal packing and molecular dynamics of the calponin-homology domain of Schizosaccharomyces pombe Rng2. Acta Crystallogr. D60, 1396-1403.

x

Name: Wang Chern Hoe

Degree: Doctor of Philosophy

Department: Institute of Molecular and Cell Biology

Thesis Title: X-ray crystallography of yeast cytoskeletal proteins

Abstract

Schizosaccharomyces pombe Rng2 is an IQGAP protein essential for the assembly of

an actomyosin ring during cytokinesis. Rng2 contains an amino-terminal calponin-

homology (CH) domain, eleven IQ repeats and a RasGAP homology domain. CH

domains are known mainly for their ability to bind F-actin, although they have other

ligands in vivo and there are only few examples of actin-binding single CH domains.

The structures of several CH domains have already been reported, but this is the third

report from actin-binding proteins that contain a single CH domain (the structures of

calponin and EB1 have been reported previously). The 2.21 Å resolution crystal

structure of the amino-terminal 189 residues of Rng2 determined using Br- and Hg-

derivatives includes 39 residues (150-189) carboxyl-terminal to the CH domain that

resembles neither the extended conformation seen in utrophin nor the compact

conformation seen in fimbrin, although residues 154-160 form an unstructured coil

which adopts a substructure similar to dystrophin residues 240-246 in the carboxyl-

terminal portion of the CH2 domain. It wraps around the stretch of residues that would

be equivalent to the proposed actin-binding site ABS1 and ABS2 from dystrophin.

Another feature revealed by comparing the two derivatives is the presence of two loop

conformations between Tyr92-Arg99.

Keywords: calponin-homology, CH, Rng2, yeast cell-division proteins.

xi

Summary

Cytoskeletal proteins form the framework for all biological processes occuring

in a cell. In particular, it is involved in separating a mother cell into two daughter cells

via the actomyosin ring. This process is known as cytokinesis, and is the most directly

visible biological process in a cell, responsible for the the ability of living organisms to

reproduce. The aim of this study was to solve by means of X-ray crystallography, the

structure of cell division proteins in the fission yeast Schizosaccharomyces pombe,

which happens to be an ideal model to analyse cell division; for the purpose of making

further insights into the molecular process of cytokinesis.

Rng2 is an IQGAP protein that is essential for the assembly of an actomyosin

ring during cytokinesis. It contains a single calponin-homology (CH) domain at the

amino-terminus, followed by eleven IQ repeats and a RasGAP homology domain. CH

domains are known mainly for their ability to bind F-actin, although they have other

ligands in vivo and there are only few examples of actin-binding single CH domains.

In this study, the structure of the amino-terminal 189 residues of Rng2 (Rng232-189) to

2.21 Å resolution was solved using the anomalous dispersion technique - two

wavelength Br-MAD and Hg-SAD. Its crystal structure revealed a CH domain that is

similar to the CH domains deposited in RCSB, although a molecular replacement

solution was impossible partly because unlike other CH domains, the 39 residues (150-

189) at its carboxyl-terminus bears no sequence similarity to known proteins. The 31

residues at its amino-terminus are also disordered and invisible in the electron density

map. When the amino-terminal 189 residues were expressed in S. pombe as a GFP-

Rng21-189 fusion protein (Wachtler et al., 2003), it localized to the same site as cortical

actin patches, suggesting that residues 1-189 of Rng2 bind F-actin. Two features new

xii

to CH domains were revealed. First of all, it has a carboxyl-terminal tail that wraps

around the stretch of residues that would be equivalent to the putative actin-binding

site ABS1 and ABS2 of dystrophin. The structure of this carboxyl-terminal tail of

Rng21-189, comprising residues 150-189, resembles neither the extended conformation

seen in utrophin nor the compact conformation seen in fimbrin. However, Rng2

residues 154-160 form an unstructured coil which adopts a substructure similar to

dystrophin residues 240-246 in the carboxyl-terminal portion of the CH2 domain.

The other new feature is the presence of two possible loop conformations

between Tyr92 and Arg99. A molecular dynamics (MD) simulation indicated that

Asn94 of the Br-derivative had a higher r.m.s.d. relative to the Hg-derivative,

corresponding to higher temperature factors in the crystal structure of the Br-derivative

compared to the Hg-derivative. This shows that the MD simulations were accurate and

lend weight to the view that the loop switches between two possible conformations.

These distinctive features were not highlighted in previously published CH domains

and may have useful implications in determining the F-actin binding orientation of

single CH domains.

xiii

Biological background

Fission yeast cytokinesis

Fission yeast cells maintain their specific sizes by coordinating their growth

and division via a cell-size checkpoint (Fantes and Nurse, 1978), which is exerted

primarily at the G2/M transition (Rupes, 2002). Wee1 delays entry into mitosis by

inhibiting the activity of Cdc2/Cdk1, the cyclin-dependent kinase that promotes entry

into mitosis (Russell and Nurse, 1987). The phosphatase Cdc25 promotes entry into

mitosis by removing the inhibitory phosphorylation of a highly conserved tyrosine

residue at the amino-terminus of Cdk1 (Dunphy and Kumagai, 1991). When Cdk1

activity reaches a critical level, it triggers early mitotic events such as chromosome

condensation, nuclear envelope breakdown and spindle assembly.

In the absence of defects at the end of mitosis, the three pairs of properly

segregated chromosomes are compartmentalized into two daughter cells during

cytokinesis, the final event of the cell cycle. It is perhaps the first major cell cycle

event that was observed directly. Schizosaccharomyces pombe cells are cylindrical

with a uniform diameter of about 3.5 µm and grow by polarized membrane and cell

wall addition at both ends to a length ranging from 7 to 16 µm. Fission yeast cells

divide by medial fission, with the haploid state being more stable. After cell division,

both daughter cells will initially grow from the end that existed in the mother cell prior

to division. Typical of all eukaryotic cells, the cell cycle has discrete G1, S, G2 and M

phases. Early in G2 phase, the new end created by the previous cell division will

initiate growth, and the cells will grow in a symmetrical way throughout the remaining

interphase. The M phase is the most dramatic, where replicated DNA become precisely

1

separated into two daughter nuclei during mitosis, and cytokinesis then divides the cell

equally into two identical daughter cells (Pringle et al., 1997).

Morphological transitions are well defined during the fission yeast cell cycle,

making it a simple and attractive model for the analysis of cytokinesis. It has the

smallest fully sequenced eukaryotic genome compartmentalized into three observable

chromosomes that clearly condense during mitosis, allowing individual chromosome

behaviour to be followed by cytological methods. S. pombe carries out mitosis and

cytokinesis much like animal and protozoan cells. The mitotic spindle is present for a

short time during mitosis and the spindle pole body (SPB) remains undivided until

mitosis, typical of higher eukaryotes (McCully and Robinow, 1971). Some differences

with animal cells are present, such as the actin ring emerging at an earlier stage of M

phase in fission yeast, and the synthesis of a division septum behind the constricting

ring. These processes are dependent on the proper organization of the actin

cytoskeleton. There are three types of filamentous actin structures in fission yeast:

cortical patches, cables and ring (Marks and Hyams, 1985). During interphase, F-actin

patches localize to both growing ends of the cell, whereas F-actin cables run

longitudinally and are linked to F-actin patches (Arai and Mabuchi, 2002). During

mitosis, the patches disappear and F-actin reappears in an actomyosin ring in the

medial region of the cell attached to F-actin cables (Arai and Mabuchi, 2002).

It is crucial that cytokinesis occurs only after mitosis to ensure proper

segregation of genetic material to each daughter cell. Since Nurse's pioneering work on

the cell cycle (Nurse et al., 1976), genetic screens have identified mutations in more

than 50 genes affecting distinct steps in cytokinesis (Rajagopalan et al., 2003; Guertin

et al., 2002). Phenotypic characterization of S. pombe cytokinesis defective mutants

2

has placed them in three categories: ring positioning, ring assembly and septation

initiation network (SIN; Table 1).

Table 1: S. pombe genes involved in various aspects of cytokinesis Ring positioning Septation initiation network

(SIN) Cell separation

mid1 dmf1

PH domain byr4 No known domain imp2 FCH and SH3 domains/PCH family protein

plo1 serine/threonine protein kinase

cdc7 serine/threonine protein kinase

bgs1 cps1

(1-3) beta-D-glucan synthase

pom1 serine/threonine protein kinase

cdc11 No known domain mok1 (1-3) alpha-D-glucan synthase

Ring assembly cdc14 clp1 flp1

Phosphatase Mitosis to interphase transition

cdc3 Profilin cdc16 Rab GTPase activating protein

sbp1 Ran GTPase activating protein

cdc4 Essential light chain mob1 No known domain mog1 Ran GTPase cdc8 Tropomyosin plo1 serine/threonine

protein kinase spi1 Ran GTPase

cdc12 FH domain spg1 Ras GTPase pim1 GEF cdc15 FCH and SH3

domains/PCH family protein

sid1 serine/threonine protein kinase

rna1 Ran GTPase activating protein

rng2 IQGAP sid2 serine/threonine protein kinase

rng3 UCS domain sid4 No known domain act1 Actin myo2 Myosin II heavy chain rlc1 Regulatory light chain myp2 Myosin II heavy chain fim1 Fimbrin ain1 Actinin crn1 Coronin

The Dyrk kinase Pom1 localizes to cell ends with the help of microtubules and

its activity is required for both symmetrical cell growth and symmetrical division

(Bahler and Nurse, 2001). The anillin-like protein Mid1 resides in the nucleus in

interphase cells.

Upon entry into mitosis, a broad equatorial band of proteins that will later form

the contractile ring is set up. Plo1, the single member of the polo family of kinases,

phosphorylates Mid1 during prophase and triggers its relocation to the medial region

of the cortex as a broad ring (Bahler et al., 1998). At the G2/M transition, this broad

3

band of Mid1 is joined by conventional myosin II (heavy chain Myo2, essential light

chain Cdc4 and regulatory light chain Rlc1) and IQGAP Rng2, followed by PCH

Cdc15 and formin Cdc12. Equatorial accumulation of these proteins depends on Mid1

but not actin. Myosin II localizes to a spot-like structure in interphase cells (Kitayama

et al., 1997) distinct from the formin Cdc12 spot (Chang, 1999). The myosin II-

containing spot seems to originate from the constricted ring of the previous mitotic

cycle. Since Myo2 is insoluble at physiological salt concentrations, the myosin II spots

may consist of myosin filaments (Bezanilla and Pollard, 2000). Rng2 and Myo51

physically interact with the essential light chain Cdc4 of myosin II (D’souza et al.,

2001). Thus, it is likely that Rng2 binds myosin II through its interactions with Cdc4.

During early anaphase, actin filaments and tropomyosin Cdc8 join the broad

band which then coalesces into a compact ring that includes alpha-actinin Ain1.

Formin Cdc12 and profilin Cdc3 cooperate to assemble the contractile ring (Kovar et

al., 2003). Profilins have been shown to bind monomeric actin, an actin-related protein

complex, phospholipid PIP2, and proline-rich peptides such as poly-L-proline and

VASP, and may regulate actin dynamics in the formation of F-actin structures (Chang

et al., 1997). Tropomyosin and alpha-actinin should bind to the actin ring as it

polymerizes. The interaction of myosin II with the actin ring provides a possible

mechanism for the contractile force. Alpha-actinin forms dynamic cross-links between

actin filaments, providing greater stiffness than actin alone at higher rates of

deformation (Xu et al., 1998).

During late anaphase, the compact contractile ring matures with the addition of

unconventional type II myosin Myp2 and septins. Myp2 is necessary under conditions

of stress and increases the efficiency of cytokinesis (Bezanilla et al., 2000). Septins are

GTPases which are not essential for fission yeast viability (Berlin et al., 2003).

4

In all eukaryotic cells examined, exit from mitosis requires inactivation of the

cyclin-dependent kinase Cdc2/Cdk1. Cdc2 is essential for both mitotic and meiotic cell

cycle progression in fission yeast. Cdc2 inactivation occurs coincidently with

chromosome segregation, ensuring that cell division does not initiate before

chromosomes have been separated (Trautmann et al., 2001). Loss of Cdc2 activity is

mediated through destruction of mitotic cyclins by ubiquitin-mediated proteolysis and

Cdc2 inhibitors. Ubiquitination of cyclin is driven by an E3 ubiquitin ligase known as

the anaphase-promoting complex/cyclosome (APC/C). Following cyclin destruction in

late mitosis, several events occur in rapid succession, such as the breaking down of the

mitotic spindle, the constriction and disassembly of the actomyosin ring, and the

formation of a septum. This pathway is carefully regulated by the septation initiation

network (SIN) so that cytokinesis and mitotic exit are prevented until the proper

segregation of chromosomes has occurred. At least eleven genes are involved in the

septation initiation network that is responsible for triggering the onset of septum

formation and cytokinesis (Table 1).

The localization of SIN components to the SPB is required for cytokinesis and

is dependent on Sid4-Cdc11, both constitutive residents of SPBs (Tomlin et al., 2002).

Cdc11 provides the physical link between Sid4 and the Ras GTPase Spg1, which

controls the onset of septation (Tomlin et al., 2002). In interphase cells, Spg1 is in the

GDP-bound form, but upon entry into mitosis it converts to the GTP-bound form

(Sohrmann et al., 1998). It localizes asymmetrically to the SPB in interphase and to

both spindle poles during mitosis (Sohrmann et al., 1998). A guanine nucleotide

exchange factor (GEF) for Spg1 has not been identified in S. pombe so far and Spg1

might prove to be negatively regulated solely by the recruitment of the GTPase-

activating protein (GAP) Cdc16-Byr4 to the SPB during interphase, where it acts to

5

restrain septum formation (Cerutti and Simanis, 1999). Cdc7 encodes a

serine/threonine protein kinase that shows no discrete localization during interphase

but is recruited to the SPB by GTP-Spg1 during mitosis (Sohrmann et al., 1998).

When the spindle separates during late anaphase, the protein kinase Sid1 functions in a

complex with phosphatase Cdc14/Clp1/Flp1 and is recruited to the SPB containing

elevated Spg1-Cdc7 (Guertin et al., 2000). Cdc2 inactivation, triggered by cyclin

proteolysis, regulates Sid1-Cdc14 localization to the SPB (Guertin et al., 2000). Mob1

localizes to the SPB and the medial ring during mitosis. It recruits the protein kinase

Sid2 to form the Sid2-Mob1 complex, which functions downstream of Sid1-Cdc14

(Hou et al., 2000). One candidate binding partner for Sid2-Mob1 is Cdc15, which is

required for Sid2 localization to the medial region (Sparks et al., 1999).

Imp2 colocalizes with the medial ring and destabilizes it after the initiation of

septum formation (Demeter and Sazer, 1998). It provides a link between SIN and the

division septum. Bgs1/Cps1 is an integral membrane protein that localizes to the cell

division site during late anaphase. Assembly of Cps1 into the actomyosin ring depends

on F-actin and SIN proteins (Liu et al., 2002). In addition, Cps1 localizes to areas of

polarized cell wall growth and might be involved in synthesizing the lineal (1-3) beta-

D-glucan of the primary septum during mating, sporulation and vegetative growth

(Cortes et al., 2002). Mok1 is a (1-3) alpha-D-glucan synthase localized on the cell

membrane and moves from the growing tips during interphase to the medial ring upon

mitosis (Katayama et al., 1999). Sterol-rich membrane domains are localized to

distinct regions of the plasma membrane in a cell cycle and secretory pathway

dependent manner. They may be involved in anchoring the actomyosin ring

components to the plasma membrane (Wachtler et al., 2003).

6

Ran GTPases Mog1 (Tatebayashi et al., 2001) and Spi1 (Matynia et al., 1996)

localize to the nucleus and are essential for the mitosis-to-interphase transition. Their

distribution between the GTP- and GDP-bound forms are regulated by GAP Rna1 and

GEF Pim1 (Matynia et al., 1996). In this class of mutants, the cells are typically

arrested in the cell cycle as septated, binucleated cells with highly condensed

chromatin, fragmented nuclear envelopes, and abnormally wide septa.

Fission yeast Rng2

Rng2 is a orthologue of mammalian IQGAPs (Eng et al., 1998; Wang et al.,

2003). It is associated with the spindle pole body during interphase and mitosis. It is

also detected in the medial actomyosin ring during mitosis and cytokinesis (Eng et al.,

1998) where its localization requires F-actin. Fission yeast has three simple F-actin

structures: cortical patches, cables and rings (Marks and Hyams, 1985). In contrast, its

Saccharomyces cerevisiae orthologue Iqg1, does not depend upon F-actin for its

localization to the ring (Epp and Chant, 1997). Rng2 contains several protein domains

and motifs. It has a calponin-homology (CH) domain in the amino-terminus, eleven IQ

repeats, two coiled-coil domains, a rasGAP domain and a rasGAP carboxyl-terminal

domain (Figure 1). IQGAP1, the human orthologue of Rng2 is rather well studied.

Residues 216-683 of IQGAP1 contain coiled-coil regions and have been shown to be

responsible for the oligomerization of IQGAP1 (Fukata et al., 1997). IQGAP1 is able

to bind Cdc42 (Ho et al., 1999) and inhibit its GTPase activity via residues 918-1657

(McCallum et al., 1996), which contains the RasGAP domain. IQGAP1 binds F-actin

and calmodulin via residues 1-216, which contains a CH domain (Fukata et al., 1997)

(Table 6). It binds calmodulin via residues 740-869 (Ho et al., 1999; Table 6), which

7

contains several IQ domains. Likewise, Cam1 and Cdc4 have been shown to bind

Rng2 (Eng et al., 1998; D'souza et al., 2001).

Figure 1: Domain organization within Rng2. Domain boundaries are defined by performing a global Pfam search (Eddy, 1998) across a Pfam HMM library (Bateman et al., 2002), and shown in red numerals.

The calponin-homology domain

The calponin-homology (CH) domain consists of about 100 residues first

identified at the amino-terminus of calponin, a protein implicated in the regulation of

smooth muscle contraction through its interaction with F-actin and inhibition of the

actin-activated MgATPase activity of phosphorylated myosin (Castresana and Saraste,

1995; Winder and Walsh, 1990; Tang et al., 1996). Three groups of CH domain

containing proteins are recognizable from sequence analysis. Human proteins

containing a single CH domain include IQGAP1, IQGAP2 and their fission yeast

orthologue Rng2; calponins CNN1, CNN2, CNN3; oncogenes VAV1, VAV2, VAV3;

ARHGEF6, ARHGEF7b; GAS2, GAS2L2, GAR17; LMO7; MAPRE1/EB1,

MAPRE2, MAPRE3; ASPM; NUMA1; NP25; SM22/TAGLN, TAGLN2; NAV2,

NAV3; LRCH1/CHDC1; LRCH2; LRCH3; LRCH4/LRRN4; SMTNa, SMTNb,

SMTNc; MICAL-L1/MIRAB13, MICAL-L2, MICAL2, MICAL3; NICAL; CHASM

and EHBP1. Human proteins containing tandem CH domains include alpha-actinins

ACTN1, ACTN2, ACTN3, ACTN4; beta-spectrins SPTB, SPTBN1, SPTBN2,

SPTBN4, SPTBN5; dystrophin DMD; utrophin UTRN; filamins FLNA, FLNB, FLNC;

plectin PLEC1; dystonin/BPAG1, MACF1a; SYNE1, SYNE2; calmin CLMN; parvins

8

PARVA, PARVB and PARVG. In addition, Dictyostelium discoideum encodes ABP-

120 (Bresnick et al., 1991) and cortexillin ctxA (Stock et al., 1999), which are

orthologous to human alpha-actinins, beta-spectrins, dystrophins and utrophins. Finally,

human proteins of the fimbrin/plastin family PLS1, LCP1, PLS3 contain tandem actin-

binding domains (ABDs).

There is abundant evidence that CH domains are able to bind F-actin in

isolation, in the form of tandem CH repeats known as actin-binding domains (ABDs).

For instance, recombinant alpha-actinin ABD (Kuhlman et al., 1992), dystrophin ABD

(Way et al., 1992, Fabbrizio et al., 1993) and utrophin ABD (Winder et al., 1995) have

been shown to bind F-actin, but not G-actin; and ABDs from filamin (Lebart et al.,

1994), fimbrin/plastin (Hanein et al., 1997,1998), MACF (Leung et al., 1999), plectin

(García-Alvarez et al., 2003) and beta-spectrin (Karinch et al., 1990) have been shown

to bind F-actin in isolation. ABDs from alpha-actinin (Leinweber et al., 1999), beta-

spectrin (Karinch et al., 1990; Raae et al., 2003), dystrophin (Renley et al., 1998),

utrophin (Winder et al., 1995), plectin (Garcia-Alvarez et al., 2003) and SYNE2 (Zhen

et al., 2002) have average dissociation constants (Kd) for ABD binding to F-actin

ranging from 2.5 to 22.3 µM (Table 6). Fimbrins form a distinct class since these

monomeric actin-bundling proteins contain two tandem ABDs. Plectin ABD is unique

in its ability to bind both F-actin and beta(4) integrin (Geerts et al., 1999; Table 6)

providing a mechanism to switch its localization from actin filaments to

hemidesmosomes when beta(4) integrin is expressed. Similarly, actopaxin/alpha-

parvin consists of a pair of CH domains that is able to bind both F-actin and the

LD1/LD4 motifs of paxillin (Nikolopoulos et al., 2000; Table 6). Since the LD1 motif

of paxillin is able to bind both actopaxin and the serine/threonine integrin-linked

kinase (ILK), this points to the role of actopaxin in integrin-dependent remodelling of

9

the actin cytoskeleton during cell adhesion. Aside from biochemical data, cryo-EM

reconstructions of F-actin decorated with fimbrin ABD (Hanein et al., 1997,1998),

utrophin ABD (Moores et al., 2000; Galkin et al., 2002) and calponin (Bramham et al.,

2002) have yielded atomic models of their respective complexes, although that of F-

actin decorated with alpha-actinin ABD (McGough et al., 1994) did not yield an

atomic model since the structure of the CH domain was unknown. From a larger

perspective, the CH domain plays an important role in developmental genetics, since

gain-of-function mutations in the ABD and rod domain repeats of filamin A are

implicated in many congenital malformations (Robertson et al., 2003).

Unlike the ABD, the single CH domain binds a wide variety of ligands, and

some single CH domain proteins are again implicated in single-gene genetic disorders.

In the case of ARHGEF6, both the amino-terminal CH and carboxyl-terminal coiled-

coil domains are necessary for binding β-parvin (Rosenberger et al., 2003; Table 6);

and a partial deletion of the single CH domain of ARHGEF6 is implicated in X-linked

mental retardation (Kutsche et al., 2000). The nuclear-mitotic apparatus protein,

NUMA, which contains a single CH domain and is localized in centrosomes, has been

correlated with abnormal mitotic spindle assembly after nuclear transfer in nonhuman

primates (Simerly et al., 2003). The CH domain acts in a regulatory fashion in the

oncogene VAV1 where its basal activity is inhibited in cis through interactions

between the CH and zinc finger region (Zugaza et al., 2002; Table 6). There is

opposing evidence regarding the F-actin binding ability of isolated CH domains. For

instance, the CH domains of calponin and SM22 are unable to bind F-actin (Gimona

and Winder, 1998; Gimona and Mital, 1998; Gimona et al., 2002; Mino et al., 1998;

Stradal et al., 1998; Table 6). Instead the CH domain of calponin was found to bind

caltropin with a slightly higher affinity than that of calmodulin (Wills et al., 1994).

10

This interaction was proposed to modulate the inhibitory activity of calponin towards

the actin activation of myosin ATPase. Calponin may also play a role in extracellular

regulated kinase (ERK) mediated signal transduction since its CH domain has been

shown to interact with ERK (Leinweber et al., 1999; Table 6).

In contrast to its regulatory role in ARHGEF6, VAV1, calponin and possibly

SM22, the single CH domain has also been shown to bind to the cytoskeleton. For

example, both the CH domain of GAR22 (Goriounov et al., 2003) and residues 1-216

of IQGAP1 which contains a CH domain (Fukata et al., 1997; Ho et al., 1999), are

able to bind F-actin in cosedimentation assays (Table 6). It is interesting to note that

residues 1-232 of IQGAP1, which contains a CH domain, is able to bind both F-actin

and calmodulin (Ho et al., 1999), whereas the CH domain of calponin is able to bind

calmodulin, caltropin and ERK, but not F-actin (Wills et al., 1994; Table 6). Residues

1-133 of EB1 contain a CH domain that has been shown to bind microtubules (Hayashi

and Ikura, 2003; Table 6). Similarly, residues 7-182 of smooth muscle calponin

encodes a CH domain and was shown to interact with desmin (Wang and Gusev, 1996;

Table 6). All these facts highlight the versatile nature of the CH fold that allows it to

bind different ligands depending on the specific residues evolved to suit the function of

the protein in question.

X-ray structures are known for several CH domains, including CH2 of beta-

spectrin (Carugo et al., 1997); the ABDs from dystrophin (Norwood et al., 2000),

utrophin (Keep, Winder et al., 1999; Keep, Norwood et al., 1999) and plectin (García-

Alvarez et al., 2003); the first ABD (ABD1) from fimbrin (Goldsmith et al., 1997);

and the single CH domain from the EB1 (Hayashi and Ikura, 2003). Well diffracting

crystals of alpha-actinin ABD (Ekström et al., 2003) have also been reported. The

NMR structure of a CH domain from calponin shows a similar fold in spite of their

11

sequence divergence (Bramham et al., 2002). However, it does not bind F-actin.

Instead, its three carboxyl-terminal calponin family repeats are essential for binding F-

actin (Gimona and Winder, 1998; Gimona and Mital, 1998; Mino et al., 1998; Table 6).

In a similar fashion, the carboxyl-terminus of SM22 which contains a calponin family

repeat is required for full F-actin affinity (Fu et al., 2000; Table 6).

Crystallographic background

Crystal symmetry

Crystals are distinguished by the presence of a regular repetition or symmetry

in the three dimensional space of an object, although in reality they often have defects

at a non zero temperature and may have impurities trapped in the lattice. Crystal

symmetry can be described in terms of symmetry operations which leave the object

unchanged. Symmetry operations encountered in biological crystals include rotations

and screw rotations. There is a corresponding symmetry element such as a point, axis

or plane with respect to which each symmetry operation is performed, with the

restriction that point group symmetry operations are compatible with infinite

translational repeats in a lattice; and cannot induce a higher symmetry than the space

group of the crystal. The permutation of all symmetry elements to three-dimensional

spatial patterns results in 230 space groups. These are in turn distributed among 14

Bravais lattices, 11 Laue groups, 32 point groups and 7 crystal systems (Table 2). The

first step in the derivation of the 230 space groups was made by Leonhard Sohncke. He

combined the 14 Bravais lattices with the 32 point groups, which involved n-fold

rotation axes, inversion centres and mirror planes; and added translational motion: n-

fold screw axes, glide lines and glide planes; where n = 1, 2, 3, 4, 6. This resulted in

12

the 65 space groups which he published in 1879. The second and final step accounted

for additional inversion centres which led to a further 165 space groups. They were

first worked out in 1885 by Evgraph Stepanovich Fedorov and independently by

Arthur Moritz Schönflies in 1891 and William Barlow in 1894. The 230 space groups

are arbitrarily numbered from 1 to 230, beginning with triclinic crystals and ending

with cubic crystals. They are systematically drawn and described in the "International

Tables for Crystallography Volume A". There are two space group symbols, one due

to Schönflies and the generally adopted one due to Carl Heinrich Hermann and Charles

Mauguin. The commonly used short Hermann-Mauguin symbol consist of a letter

indicating the centring type of the Bravais lattice (P, C, A, B, I, F, R, H); followed by

one to three characters designating the symmetry elements (m, a, b, c, e, n, d, g, 1, 2, 3,

4, 6, 1,

Bravais lattices are regular stacks of an infinite array of points in which every

point has the same environment as any other point. There are fourteen possible ways to

do this and the Bravais lattices are distributed unequally among the seven crystal

systems. Any point in the lattice may be chosen as an origin.

Proteins are built from enantiomers of optically active L-amino acids, whereas

nucleic acids are built from D-ribose or D-deoxyribose. As a result, their crystal forms

can be either right- or left-handed; and belong to one of the eleven enantiomorphous,

non-centrosymmetric classes of point group symmetries that contain neither mirror

planes nor inversion centres (Table 2).

3 , 4 , 6 , 21, 31, 32, 41, 42, 43, 61, 62, 63, 64, 65) for the corresponding primary

(monoclinic), secondary (rhombohedral) and tertiary (orthorhombic, tetragonal,

hexagonal and cubic) directions.

13

Crystal systems Bravais lattices Constraints imposed on unit cell geometry

Characteristic symmetry

Laue groups Enantiomorphous non-centrosymmetric point groups

Non-enantiomorphous non-centrosymmetric point groups

Non-enantiomorphous centrosymmetric point groups

Cubic cP cI cF

a = b = c α = β = γ = 90°

Four 3-fold rotation axis at 109.47°

3m mm3

23 432

m34

3m mm3

Hexagonal hP a = b α = β = 90°, γ = 120°

One 6-fold rotation axis

m/6 mmm/6

6P

622 6 6mmP

26m

m/6 mmm/6

hP a = b α = β = 90°, γ = 120° (hexagonal axes)

3P

32 3mP

3 m3

Trigonal

hR a b c = = α = β = γ (rhombohedral axes)


3 m3

Tetragonal tP tI

a b = α = β = γ = 90°


m/4 mmm/4

4P

422 4 4mmP

m24

m/4 mmm/4

Orthorhombic oP oI oC oF

α = β = γ = 90° Three 2-fold rotation axis at 90°

mmm 222 mm2P mmm

Monoclinic mP mC

α = γ = 90° (unique b-axis)


m/2 2P mP m/2

Triclinic aP none None 1 1P 1 P

Table 2: The seven crystal systems, their corresponding Bravais lattices and symmetries encountered in biological crystals.

There are ten polar, non-centrosymmetric point groups which have a unique axis not relat d by symmetry. e

14

Bravais lattices of various shapes and sizes are measured in terms of their unit

cell lengths a, b, c and the angles α, β, γ between them; where α refers to the angle

between lattice vectors b and c, β the angle between a and c, and γ the angle between a

and b. The right-handed axial system is conventionally used; here the unit cell lengths

and angles are defined with respect to a common origin at the back left-hand corner of

the cell (Figure 2.1). The unit cell is chosen based on the convention that it should

have the smallest volume for which its delineating vectors are parallel or coincide with

important symmetry directions in the lattice. The monoclinic system is characterized

by a two-fold rotation or inversion, with the b axis parallel to it.

Figure 2.1: The right-handed axial system is shown on a triclinic P lattice.

The eleven Laue groups are determined from Laue images captured using the

Laue X-ray method. The Laue group assigned to a crystal describes the symmetry of

the complete X-ray diffraction pattern obtainable from the crystal. This is because a

single X-ray image is two-dimensional in nature and cannot record the complete three-

dimensional diffraction symmetry.

The Neumann principle is of basic importance to understand how the symmetry

of a crystal is related to its point group. It states that the symmetry elements of any

physical property must include the symmetry elements of the crystal point group. In

15

agreement with Neumann’s principle, physical experiments such as X-ray diffraction

may not reveal the true symmetry of the crystal. For instance, a center of symmetry is

introduced into all X-ray diffraction patterns regardless of whether the crystal is

centrosymmetric or not.

Symmetry elements may occur singly or in certain combinations in a body. A

point group is defined as a set of symmetry elements all of which pass through a single

fixed point known as the origin. The symmetry operations of a point group must leave

at least one point unmoved. The point group symmetry is independent of whether the

unit cell is primitive (P), body centred (I), C-face centred (C), all-face centred (F) or

rhombohedrally centred (R). The result of repeating a point group pattern by the

translations of a Bravais lattice is a space group. A space group consists of a infinite

set of symmetry elements, the operation with respect to any of which brings them into

a state indistinguishable from that before the operation. It can also be described in

terms of two parts, a pattern motif and a repeat mechanism. The pattern motif consists

of the asymmetric unit. It does not have to occupy the entire unit cell although it may

be placed anywhere within the unit cell. It is impossible to deduce the point group and

space group solely from the observed symmetry, although it is possible to infer the

crystal system from either the point group symmetry or unit cell dimensions.

The trigonal system can either have a rhombohedral or hexagonal Bravais

lattice. Some crystals adopt the hexagonal lattice. The rest adopt a rhombohedral

lattice and are distinguished by systematically absent X-ray reflections. In practice, the

hexagonal lattice is used because it is easier to visualize. All crystal systems with the

exception of cubic forms are anisotropic in terms of their physical properties.

If a Bravais lattice has a higher symmetry than triclinic, each particle in the

unit cell will be repeated several times as a consequence of space group symmetry.

16

Each space group has only one general Wyckoff position. It describes a set of

symmetrically equivalent independent particles, each occupying a point that is left

invariant only by the identity operation. The number of molecules in a unit cell

depends on the number of such independent particles in each asymmetric unit

multiplied by the number of such asymmetric units present in the unit cell. An

asymmetric unit of a space group is the smallest space from which an entire lattice can

be filled exactly by application of all symmetry operations of the space group. In

addition, each space group may have several special Wyckoff positions. It describes a

set of symmetrically equivalent independent particles, each occupying a point that is

mapped onto itself by at least one further symmetry operation of the space group. If a

molecule occupies a special position, which is common in small molecule crystals; it

implies the presence of a symmetry axis running through it, and that unit cell will have

less molecules than anticipated from the number of asymmetric units multiplied by the

number of general positions per asymmetric unit.

Reciprocal lattice

For each direct (Bravais) lattice, there is a corresponding reciprocal lattice. It

has the same symmetry as the direct lattice from which it was deduced. d*(100), d*(010)

and d*(001) are referred to as a*, b* and c*, respectively and define a unit cell in the

reciprocal lattice. In general,

)()(*

hkldKhkld =

where d*(hkl) is the distance of the reciprocal lattice from the origin, and K is a

constant which is usually the wavelength λ of the X-ray used. The reciprocal lattice

units are thus dimensionless.

17

The reciprocal lattice points form a true lattice with a representative unit cell

outlined by three sides a*, b* and c*, and three angles α*, β* and γ*. The size of the

reciprocal lattice is governed by the choice of the X-ray wavelength λ used. A

reciprocal lattice row h,k,l; 2h,2k,2l;… are derived from the families of planes (nh, nk,

nl) with n = 1, 2, … in the Bravais lattice.

Bragg's law

When X-rays encounter any form of matter, they are partly transmitted and

partly absorbed. Matter absorbs X-rays in two distinct ways, by scattering and by true

absorption. The scattering of X-rays by atoms is similar to the scattering of visible

light by dust particles in the air, and occurs in all directions. In some directions, the

scattered beams satisfy Bragg's law and are completely in phase, thus reinforcing each

other (constructive interference) to form a diffracted beam with an increased amplitude.

In most directions, they do not satisfy Bragg's law and annul one another (destructive

interference). With the exception of very light elements, scattering is responsible for

only a small fraction of the total absorption of X-rays with wavelengths in the usual

range used for diffraction. Thus the diffracted beam is rather strong compared to the

sum of all the rays scattered in the same direction, but is extremely weak compared to

the incident beam since the atoms in a crystal scatter only a small fraction of the

incident energy. True absorption is caused by electronic transitions within the atom

and results in fluorescent radiation.

William Lawrence Bragg's treatment of X-ray diffraction is an

oversimplification of the complete process. He described it in 1912 as a consequence

of the partial reflection of X-rays by various lattice planes of the same family, like

sheets of atomic mirrors. Although the term 'reflection' is commonly used, it differs

18

fundamentally from diffraction. Reflection of visible light takes place in a thin surface

layer, occurs at all angles of incidence with no regard for Bragg's law, and is highly

efficient. However, the intensity of a diffracted beam scattered by all atoms of the

crystal that lie in the path of the incident beam is very weak, and it occurs only at

angles of incidence that satisfies Bragg's law. Diffraction is essentially reinforced

coherent scattering. A diffraction vector or structure factor F(hkl) has the Miller

indices h, k, l; an amplitude |F(hkl)| and a phase angle α(hkl). The structure factor for a

protein is called FP and that for a heavy atom derivative is called FPH.

Bragg's approach was the result of early experiments showing that if a crystal

was turned from one diffracting position to another through an angle θ, then the

diffracted ray was rotated through an angle of 2θ. The Bragg reflection of three parallel

rays is illustrated in figure 2.2, which shows a c-axis projection of a monoclinic unit

cell. His treatment of diffraction leave implicit the requirement for the physical

superposition of interacting X-rays (i.e. waves 1 and 3 in figure 2.2). The difference in

'path' between the waves scattered at D and B is equal to AB + BC = 2d sin θ. If it is

multiple of λ then the two waves combine themselves with maximum positive

interference, resulting in Bragg's law

λθ nd =sin2

There is a special relationship between the interplanar spacing d(hkl), Bragg's angle θ,

X-ray wavelength λ, and order of diffraction n relative to the same family of lattice

planes, with all planes in the (hkl) family cooperating in the scattering process. X-rays

penetrate deeply into the crystal and are reflected by a series of lattice planes which

satisfy Bragg's law. All X-rays reflected from the same plane remain in phase after

reflection, since no path difference is introduced.

19

Figure 2.2: One P unit cell of a monoclinic structure, with scattering entities at the lattice points. X-ray beam are "reflected" by the (110) planes of the monoclinic structure.

Since sin θ cannot exceed unity,

1sin2

pθλ=

dn

Therefore nλ must be less than 2d. Since the smallest value of n is 1,

d2pλ

For most sets of crystal planes, d is of the order of 3 Ǻ or less, meaning that

electromagnetic radiation exceeding 6 Ǻ will not be diffracted.

Scattering by an electron

A tightly bound electron set into oscillation by the oscillating electric field of

X-rays is continuously accelerating and decelerating during its motion and emits an

electromagnetic wave (photon). Thus, the oscillating electron scatters X-rays; or in

other words, the scattered X-ray beam is radiated from the electron under the influence

20

of the incident X-ray beam. Unlike the electron, the nucleus has an extremely large

mass and does not oscillate significantly; as a result, X-ray scattering is proportional to

the electron density of the atoms in the crystal. Electrons can scatter X-rays coherently

and incoherently. Coherent or Raleigh scattering is the only kind capable of being

diffracted. It comes from incident X-ray photons that lack sufficient energy to eject the

scattering electrons from the atom. Since no energy is transferred, the scattered X-rays

have the same wavelength and frequency as the incident beam; although there is a

phase change of π/2. The intensity of unmodified radiation decreases with increasing

Bragg angle, i.e. higher resolution and shorter wavelengths.

Incoherent or Compton modified radiation was discovered by Arthur Holly

Compton in 1923, and occurs whenever X-rays encounter loosely bound or free

electrons. The incident X-ray beam is an electromagnetic wave of photons, each with

the energy quantum hν. When an incident photon strikes a loosely bound electron in an

elastic collision, the electron gets knocked aside and the photon deflects through an

angle 2θ. Some of the energy of the incident photon is transferred to the electron as

kinetic energy. Compton modified radiation has a slightly increased wavelength and a

random phase relative to the incident beam. In contrast to unmodified radiation, its

intensity increases with increasing Bragg angle and with shorter wavelengths. It does

not take part in diffraction due to its random phase and is responsible for increasing the

background in diffraction images. Since it is due to collisions of quanta with loosely

bound electrons, its intensity relative to coherent radiation increases as the proportion

of loosely bound electrons increases. For this reason, diffraction images of organic and

protein crystals, which consist of elements with low atomic numbers, have increased

background.

21

Data reduction

The analysis and reduction of single crystal diffraction data consists of several

steps. Initially, several diffraction spots or Bragg intensities from a single frame in the

series of oscillation images collected are chosen. It is important that the oscillation

range is small enough such that the diffraction pattern will consist of spots limited

between two ellipses that form a lune that do not overlap. If the lunes overlap, the

spots will not have a unique Miller index. An oscillation range of 1.0 and 0.3° is

usually chosen for protein (small unit cell) and virus (large unit cell) crystals

respectively, since this provides enough spots to establish the periodicity of the

diffraction pattern. All reflections within the same lune originate from the same family

of parallel planes in the reciprocal lattice layer. The lunes are most obvious with large

unit cell crystals such as those from viruses, which have their reciprocal lattice planes

perpendicular to the incident X-ray beam or parallel to the detector. Within each lune,

diffraction spots are arranged along lines that reflect the regularity of the reciprocal

lattice. As a consequence of mapping the curved Ewald sphere onto the flat detector

surface, the straight lines of reflections become hyperbolas at high diffraction angles,

i.e. high resolution, but can be approximated by a plane at low angles.

The diffraction spots chosen from an oscillation image are then mapped onto

reciprocal space, based on the the center of the oscillation range used. The real-space

indexing procedure as implemented in Denzo (Otwinowski and Minor, 1997) uses Fast

Fourier Transform (FFT) to completely search all possible indices of all reflections

that are consistent with the chosen spots. To decrease computational time, the range of

indices is restricted to those found in protein crystals. Lower resolution reflections are

used first, before extending to higher resolution after fitting more parameters. When

integer values of an index h is found for all the chosen spots, one real space vector of

22

the crystal axis is located. The remaining two indices, k and l are then located, and

three linearly independent vectors defining a minimal unit cell volume that would

index all of the observed spots is determined. The three vectors forming a primitive

unit cell is then transformed into one of the fourteen possible Bravais lattices (Table 2).

The lattice with the lowest distortion index is chosen, together with its refined unit cell,

crystal and detector orientation parameters. The crystal orientation parameters describe

the orientation of the reciprocal lattice with respect to the spindle, beam and vertical

axes of the detector. The detector parameters include the X-ray wavelength, crystal to

detector distance, precise coordinates of the direct beam, detector missetting angles

and internal scanner alignment parameters, in the case of image plates.

The process of indexing the diffraction spots provides an estimation of the

mosaicity of the crystal. It is usually of the order of more than 0.5° for protein crystals,

with higher values suggesting lower quality. Real crystals are not ideal because the

regular repetition of unit cells is interrupted by lattice defects. Its diffraction pattern is

the cumulation originating from tiny blocks with slightly different orientations. In

addition, the X-ray is monochromated to a narrow wavelength window with a

bandpass δλ/λ. This increases divergence to the total rocking curve, which is the sum

of beam divergence and crystal mosaicity. In practical terms, mosaicity is defined as

the oscillation angle of a crystal where all the observed reflections are matched with

the predicted reflections.

Bragg intensities are calculated by subtracting an estimate of the detector

background from the reflection profile. Its precise integration requires its Miller

indices and an accurate prediction of its position on the oscillation image. The position

of weak reflections are predicted from those of strong reflections. An analytical

derivation for the integrated intensity I(hkl) from the structure factor F(hkl)

23

20

2

2

2

2

3

)()( hklFELPTIVmce

VhklI rcr ×⎟⎟

⎠

⎞⎜⎜⎝

⎛=ωλ

has the terms wavelength λ, angular speed of rotation of the reciprocal lattice ω, unit

cell volume V, electron charge e, electron mass m, speed of light c, crystal volume Vcr,

intensity of the incident beam I0, Lorentz factor L, polarization factor P, transmission

factor Tr and the extinction coefficient E. The Lorentz factor depends on the geometry

of the diffraction system. It arises from the different length of time required for each

reciprocal lattice node to cross the Ewald sphere. For instance, peaks nearer to the

rotation axis spend more time passing through the Ewald sphere than those further

away. The polarization factor originates from the fact that electrons do not scatter

along its direction of oscillation, but in other directions. When unpolarized incident X-

ray beams from laboratory sources hits a plane, the scattered beams will become

polarized; its intensity is reduced at high angles of scattering. Incident beams with

electric vectors parallel to the diffraction place have little attenuation, whereas those

with electric vectors perpendicular to the diffraction plane will be significantly

weakened. The transmission factor is related to the absorption factor. The use of longer

X-ray wavelengths to diffract crystals containing more heavy atoms will result in

higher absorption of X-rays and correspondingly reduce the scattered X-ray intensity.

The extinction coefficient depends on the inherent mosaicity in crystals and has two

components. Secondary extinction plays a major role in relatively large, perfect

crystals. In these cases, the lattice planes first encountered by the incident beam will

reflect a significant fraction, and as a consequence, the deeper planes will receive less

primary radiation. It can be recognized in the final stages of crystal structure

refinement where |Fo| < |Fc| for some high intensity reflections.

24

For mosaic crystals, primary extinction plays a more significant role. For an

incident beam at a Bragg angle, there are multiple internal reflections due to coherent

or Raleigh scattering within each mosaic block of the crystal. Each reflection shifts the

phase angle by π/2. After two reflections, the beam travels in the same direction as the

incident beam but with reduced intensity and a phase difference of π.

The analytical derivation assumes that the integrated intensities measured do

not change with time. Unfortunately, exposure to X-rays triggers crystal

decomposition, particularly for protein crystals at room temperatures, where the

amount of radiation damage is significant. An efficient way of minimizing radiation

damage is by flash cooling the crystals to 100 K. This process also decreases

background scattering and reduces atomic B factors. However, it broadens the crystal

mosaicity from below 0.02° to above 0.2°, and also degrades diffraction resolution to

varying degrees. Flash cooling creates far more disorder in protein crystals due to their

high solvent content (20-90%), compared to small molecule or inorganic crystals.

After all the Bragg reflections are integrated, a single isotropic scale and B

factor for each oscillation image in a data set is calculated. One or more frames can be

chosen as the reference, and its scale and B factor will not be refined. Merging of

redundant or equivalent reflections, such as those related by space group symmetry,

occurs throughout this refinement process. It provides an opportunity to identify and

reject outliers, which are reflections with intensities that are significantly different

from their equivalents. It should not account for more than 1% of the entire data set

and comes from errors in the classification of partially and fully recorded reflections.

These are usually spots that lie close to the blind region, which refer to the reciprocal

lattice lying on either side of the spindle/Φ axis that do not cross the Ewald sphere

even after a 360° rotation. Alternatively, the reflections may be wrongly measured

25

because the CCD pixels are either shadowed, inactive or affected by 'zingers', i.e.

sparks from trace radioactivity of the fiber optic taper which transports visible light

transformed from X-rays by the phosphor.

The last step is the scaling and merging of different data sets, such as the three

data sets from a single crystal in a MAD experiment; and the global or post-refinement

of crystal parameters using the entire data set, which is more precise than the

processing of a single oscillation image.

The quality of X-ray data is usually assessed by the global Rsym factor, based

on intensities or rather F2

Rsym = ∑

∑ −

hklobs

hklobs

hklI

hklIhklI

)(

)()(

It is a ratio of the spread of intensities of multiple measurements of symmetry

equivalent reflections to an estimate of the reflection intensity. Rsym is highly

influenced by data multiplicity and is always higher for data sets with a high symmetry

space group (Diederichs and Karplus, 1997). This is a disadvantage since its value can

be manipulated. Rsym was first introduced as a reliability indicator for data collected by

precession photography, where it was summed over symmetry-related intensities on

the same film, with Rsca reporting the agreement over different films. Over time, Rsym

and Rsca were combined into the modern day Rsym which is summed over all observed

equivalent reflections. Rmerge, although used interchangeably with Rsym, is obtained

after merging and scaling symmetry equivalent reflections from several crystals and

data sets.

Complementary information about the data quality is given by the ratio of

intensities to their uncertainties,

26

∑∑

)( hkl

hkl

II

σ

Correct estimation of intensity standard uncertainties is important in subsequent

applications based on statistical treatments, such as maximum-likelihood or direct

methods of phasing and refinement.

Scattering by an atom

When an X-ray beam encounters an atom, each electron scatters part of the

radiation coherently according to the Thomson equation. However the nucleus has an

extremely large mass relative to the electron and does not oscillate. X-rays scattered by

an atom of atomic number Z containing Z electrons has an amplitude of Z times the

wave from a single electron when the scattering is in the forward direction (2θ = 0). In

other directions of scattering, the electrons are situated at different points in space and

this results in phase differences. Since the electron cloud has a radius comparable to

the X-ray wavelength, the contribution falls off at higher diffraction angles, i.e. higher

resolution. This is represented by the atomic scattering factor ),( λθf , which describes

the efficiency of scattering X-rays in a given direction by an atom compared to a single

electron. An alternative definition describes it as the ratio of the amplitude of a wave

scattered by an atom to the amplitude scattered by a single electron. The signal is

isotropic and is treated as a real number,

)(),( θλθ °= ff

Along the incident beam where sin θ = 0, ),( λθf has its maximum value because

there is no interference, and is equal to Z. Interference increases with increasing sin θ

and decreasing λ since the path differences within the atom become larger. The normal

27

scattering factor, )(θ°f applies when the scattered radiation has a wavelength much

shorter than the absorption edge of the scattering atom. However when these two

wavelengths are nearly the same, resonance occurs and the X-rays will have sufficient

energy to excite electrons from lower to higher energy shells, resulting in an

anomalous signal which is expressed as a complex number,

)(")(' λλ iff +

Anomalous scattering by an atom

Under normal circumstances when all atomic scattering factors are real with

zero )(" λf contribution, the X-ray diffraction pattern from a crystal is

centrosymmetric and obey Friedel's law, such that )()( hklFhklF = . The X-ray

diffraction pattern, which is the equivalent of a 3D weighted reciprocal lattice, will

exhibit the symmetry of the crystal's point group, with an additional center of

symmetry if not already present i.e. Laue group. However Friedel's law breaks down

when heavy atoms are present in a protein crystal. The tightly bound electrons close to

the atomic nucleus scatter X-rays anomalously, causing reflections of the Bijvoet pair

and )(hklF )(hklF to have unequal intensities, and with phases that are no longer

complementary. This is because electrons are not completely free but are bound to the

nucleus by forces which depend on the exchange of spin-1 gauge bosons (photons)

between the electron and the atomic nucleus; and on the quantum state of the electron,

a spin-½ lepton. Usually )(" λf is proportional to the atomic absorption of the X-rays

and to their fluorescence and )(' λf follows the derivative of this function, according

to the Kramer-Kronig transformation (James, 1958). It is usually expressed as a

complex quantity,

28

)(")(')(),( λλθλθ iffff ++°=

In contrast to the normal atomic scattering factor )(θ°f , the real (dispersive)

correction )(' λf and imaginary (anomalous) correction )(" λf depend only on the

wavelength of the X-rays used and do not diminish with increasing diffraction angle

(or resolution). The soft X-rays from in-house radiation sources such as CrKα (λ =

2.291 Å) and CuKα (λ = 1.542 Å) generate rather small dispersion effects for most

organic compounds and are normally insufficient for obtaining their phases. However,

anomalous dispersion induces substantial variation of the scattered intensities

depending on the chosen wavelength and is an important tool for solving crystal

structures. Thus anomalous scattering from S atoms collected from an in-house CrKα

radiation source can be exploited to phase the X-ray diffraction data from thaumatin

and trypsin crystals (Yang et al., 2003). The contribution to the imaginary component

of anomalous scattering )(" λf for sulfur of 1.14 e- is twice that from CuKα radiation

( )(" λf = 0.56 e-), resulting in reduced data redundancy required for the sulfur single

wavelength anomalous dispersion (SAD) phasing procedure compared with a highly

redundant data set required if CuKα radiation was used. Most of the atoms in

macromolecules have negligible )(' λf and )(" λf . The few anomalous scatterers that

remain result in small intensity differences which must be accurately measured for

SAD, also known as single wavelength anomalous scattering (SAS); and multiple

wavelength anomalous dispersion (MAD) phasing. In the MAD technique, changes to

the heavy atom structure factor FH are induced by tuning the X-ray wavelength to

different points in its absorption spectrum, thus changing )(' λf and )(" λf . As with

the multiple isomorphous replacement (MIR) and single isomorphous replacement

with anomalous scattering (SIRAS) phasing methods, three data sets are usually

required. In the case of MAD, the three data sets are measured at different wavelengths

29

- edge or inflection wavelength λ1 where )(' λf has its minimum, white line or peak

wavelength λ2 where )(" λf has its maximum and where the Bijvoet differences

−+ − PHPH FF are the largest, and a low or high energy remote wavelength λ3 where

)(' λf and )(" λf are small.

Location of heavy atoms and the determination of protein

phase angles

The purpose of crystallographic analysis is to obtain the atomic positions

starting from the diffraction data. However, from the experimental data, only

integrated intensities I(hkl) can be calculated after applying the correction factors L, P,

Tr and E. Since I(hkl) = |F(hkl)|2, the amplitudes |F(hkl)| can be found, but not the

phase angles α(hkl). There are four principle techniques to solve the phase problem -

isomorphous replacement, anomalous dispersion, molecular replacement and direct

methods. It is pointless to implement techniques that will allow us to recover the phase

of individual reflections since this quantity changes continually during the experiment.

Thus, the values of some phases, such as those from anomalous scatterers need to be

solved before the origin of the crystal can be established as a reference for evaluating

the remaining phases. This leads to the fact that the recovered phases of the protein

structure are actually relative phases, since they are calculated relative to the phase of

other scattered amplitudes.

The anomalous dispersion technique will be the focus here since it was used

to solve the structure. The main problem with this technique is noise because it is

based on small differences between observed structure factors, thus necessitating the

collection of highly redundant data (Dauter and Adamiak, 2001). MAD is quickly

30

becoming the most popular method to solve protein crystal structures. In the beginning,

the positions of anomalous substructures were calculated using Patterson and

reciprocal space direct methods based on the tangent formula. The drawback was that

as the number of atom increases, the phase probability distributions become more

easily invalidated by the few aberrant reflections, putting an upper limit of about 20

anomalous scatterers that can be solved because conventional direct methods tend to

become weaker as the number of atom increases. A significant advance was made with

Shake and Bake (Weeks et al., 1993), an algorithm for solving crystal structures by ab

initio, dual space direct methods. It increased the size of the substructure that can be

solved to as many as 160 Se sites, by incorporating information from both strong and

weak reflections. For substructure applications, the resolution of the FA data is not

critical (FA refers to the anomalous contribution to the structure factor F, where F = FP

+ FA). This is because the distances between heavy atoms are usually greater than the

resolution of the data. In fact, data used to a higher resolution without significant

dispersive and anomalous information will only add noise. However, it is essential that

the data is collected with a high redundancy to optimize the anomalous signal to noise

ratio and to eliminate the outliers properly. An effective way to decide the resolution

where data is truncated is to calculate the correlation coefficient (CC) between the

signed anomalous differences −+ − PHPH FF at different wavelengths as a function of

resolution, and truncate the data where the CC falls below 25% (Schneider and

Sheldrick, 2002). A 3.5 Å anomalous wavelength data set is usually sufficient,

although this will result in a poor quality electron density map if the crystal only

diffracted to this resolution (Brodersen et al., 2000). This procedure was first

implemented in SnB (Miller et al., 1994) and then in SHELXD (Sheldrick, 1998). They

31

use the strongest normalized structure factors E, which are usually the most significant

15-20% of the F in each resolution shell. o

After the heavy atom substructure has been derived using Patterson or direct

methods, the heavy atom positions can be refined by allowing |FPH(obs)| to approach

|FPH(calc)| using the least squares or maximum likelihood method. The interatomic

vectors of these sites should be checked for correspondence with peaks in the

difference Patterson map, made using differences between the acentric reflections,

such as −+ − PHPH FF for anomalous difference Patterson (or Bijvoet Patterson) maps.

The Harker peaks on the Harker section should be more than 5σ.

With the refined heavy atom positions, the protein phase angles can be

determined. However, it is important to note that it is not formally possible to

determine protein phases exactly from two measurements, such as when the data is

restricted to a single wavelength in the case of SAD or with only one native and heavy

atom derivative in the case of SIR. There is a twofold ambiguity in the estimation of

the protein phase, which presents itself as either the correct protein structure or its

enantiomorph. One method of distinguishing between them requires a high resolution

electron density map to check for the presence of L-amino acids and right handed α-

helices. An alternative method uses the fact that since anomalous scatterers are part of

the structure, there is a higher probability that the protein phases will be similar to

those from the anomalous substructure. Modern crystallographic software such as

SHARP (de La Fortelle and Bricogne, 1997), SOLVE (Terwilliger and Berendzen,

1999) and MLPHARE (Otwinowski, 1991) make use of this approach to provide

realistic estimates of the initial phases and figures of merit. An advantage of using

SAD is that there is no problem of non-isomorphism between different crystals or data

sets. Since radiation damage is a problem in modern synchrotrons during a MAD

32

experiment, it is beneficial to first try a SAD on the peak wavelength, and then add a

second wavelength if the increased non-isomorphism does not negate the benefits of

using multiple wavelengths.

Phase improvement

The first electron density map inspected by crystallographers is usually the

unmodified Fo map obtained directly from phasing the positions of the heavy atoms.

However, for purposes of interpreting and building the structure, several steps of phase

improvement are required. A powerful approach to improve phase distributions

depends on density modification procedures such as those implemented in SOLOMON

(Abrahams and Leslie, 1996), DM (Cowtan, 1999) and RESOLVE (Terwilliger, 2000).

These methods repeatedly modify the initial density and use it to generate a new set of

phases which are combined with the experimental ones. In spite of this, they still

require a complete and accurately measured anomalous data set, especially for

improving the phases of SAD data sets.

There are several density modification tools. The most common calculation

involves solvent flattening with various solvent contents to determine its optimal value.

It can be applied to resolutions down to 5 Å. The principle is based on the localization

of static solvent molecules as a monolayer or double layer adjacent to the protein

surface. Solvent molecules positioned further away are dynamic and their electron

density can be reset to a low constant value. The solvent flattened phases can be used

to detect minor heavy atom sites with low occupancies through difference Fourier

techniques.

The next method requires an estimate of Z, the number of molecules per unit

cell, from the formula proposed by Matthews (1968)

33

MWZVVm ×

=

For most protein crystals, Vm, which is the ratio of unit cell volume V to the molecular

weight MW in Daltons, is about 2.3 Å3 Da-1. The number of molecules in each

asymmetric unit can then be derived from Z and the crystal space group. It is quite

common for protein molecules in each asymmetric unit to be related by non-

crystallographic symmetry (NCS). Once the NCS averaging operators are located, a

NCS mask can be applied and NCS averaging is then performed between symmetry-

related areas of density in the unit cell.

Histogram modification is a technique borrowed from 2D image processing

where the histogram of an electron density map of the protein region is matched to the

expected histogram computed from the map of a well refined protein. The method

works down to 4.5 Å and is weaker than solvent flattening although it performs better

at extending phases to higher resolutions for structures that do not have a large

proportion of heavy metal atoms. The topology of the protein is not important but the

resolution range and solvent content needs to be matched to the computed histogram.

Lastly, multi-resolution modification is an extension of solvent flattening and

histogram matching that allows them to be applied at two resolutions simultaneously.

Refinement

Refinement is the process of improving the fit between the calculated protein

model and its observed structure factors. The R factor is typically 50% for a starting

model obtained from tracing the electron density, rather close to the value of 59% for a

random acentric structure (Wilson, 1950). The R factor is used to estimate errors in a

34

data set. It is usually expressed as the percentage agreement between the observed

F(hkl)calc and calculated structure factors F(hkl)obs.

R = ∑

∑ −

hklobs

hklcalcobs

hklF

hklFhklF

)(

)()(

Refinement lowers the R factor to 10-20% for protein crystals. It consists of

changing the three positional parameters (x, y, z) and temperature factor B, for all

atoms in the structure, except hydrogen atoms. This is because hydrogen atoms have

only one electron and consequently have little influence on X-ray scattering. Small

molecule crystal structure analysis is always overdetermined since there are more

observations than unknowns. However, this ratio is very low in protein crystal

structure refinement, and stereochemical parameters obtained from amino acid and

small peptide structures have to be incorporated in the refinement process as

'observations'. The data from small molecule structures have precise bond lengths,

bond angles, torsion angles and van der Waals contacts; and are assumed to be valid in

proteins. The stereochemical information can be applied as rigid constraints, allowing

only the dihedral angles to vary. These constraints make it difficult to move small parts

of the structure to an ideal position because many angular motions are involved. On the

other hand, the stereochemical parameters can be restrained around a standard value,

controlled by an energy term. Restraints make it easy to move small parts of the

structure, but not entire molecules or domains.

Low resolution Bragg reflections (d ≥8-10 Å), are usually omitted from the

refinement process and bulk solvent correction is applied, since their intensities are

seriously affected by the disordered solvent. Besides, the configuration for MAD and

MIR data collection is not ideal for measuring such data, because the beamstop is

positioned close to the crystal to block the low order reflections, and the shortest

35

possible crystal to detector distance is used to avoid using a helium path. MAD and

MIR data collection techniques are optimized for collecting medium to high resolution

data (d ≤8 Å) where their effects are dominant. Ordered solvent molecules contribute

to high resolution diffraction and are introduced during the refinement process if the

protein crystal diffracts to better than 2.5 Å resolution.

36

37

Materials and methods

Plasmid construction

The relevant DNA segments coding for CHDK, CHDT, CHDv2, CHDv3,

CHDv4, IQ, GRD1, GRD2, GRD2v3, GRDv4 and GRDv5 of rng2 (gene

SPAC4F8.13c, GenBank accession number NP_593860.1) (Table 3) was amplified

from the pREP4X-GST-rng2 (pCDL222.1) construct containing the full length rng2

sequence using the corresponding 5' forward primers and the 3' reverse complement

primers. Residues 1-189 of Rng2 contain the calponin-homology domain (Rng21-189).

The PCR product of CHDK (Table 3) was cloned into the pGEX-KG bacterial

expression vector (Guan and Dixon, 1991) for producing proteins with an amino-

terminal fusion to a Glutathione S-Transferase (GST) tag; whereas the PCR products

of CHDT, CHDv2, CHDv3, CHDv4, IQ, GRD1, GRD2, GRD2v3, GRDv4 and

GRDv5 (Table 3) were cloned into the pRSETC bacterial expression vector

(Invitrogen) for producing amino-terminal polyhistidine (6×His) fusion proteins. In the

case of the GST-fusion protein, there is a 26 amino acid residue linker

(SDLVPR^GSPGISGGGGGINSRLHGST) separating the GST domain from the

protein. It has a glycine rich sequence PGISGGGGG immediately after the thrombin

cleavage site that increases the thrombin cleavage efficiency (Guan and Dixon, 1991).

The linker is cleaved at the indicated site between arginine and glycine by thrombin. In

the His6-CHDT fusion protein, there is an amino-terminal fusion of

MRGSHHHHHHGMASMTGGQQMGRDLYDDDDKDRWIP preceding the insert;

whereas the His6-CHDv2, His6-CHDv3, His6-CHDv4, His6-IQ, His6-GRD1, His6-

GRD2, His6-GRD2v2, His6-GRD2v3 and His6-GRD2v4 fusion proteins have an

amino-terminal fusion of MRGSHHHHHHGMAS due to the 5' NheI restriction site.

Construct 5' forward primer 3' reverse complement primer

Rng2 residue number pI/MW (after cleavage for GST fusion proteins)

Expression

pGEX-KG-CHDK (SalI) 5' GACGAC GTCGAC AATGGACGTAAATGTGGG 3' (HindIII) 5' GCGCCG AAGCTT TGA AGTTAGGAAGGATTAC 3'

1-189 6.66/23951.3 +

pRSETC-CHDT (EcoRI) 5' GACGAC GAATTC ATGGACGTAAATGTGGGA 3' (SacI) 5' GCGCCG GAGCTC TCA TAAAGCTTTGAAGTTAGG 3'

1-190 6.19/25788.39 low

pRSETC-CHDv2 (NheI) 5' GACGAC GCTAGC AGAGAAACCCTTCAAGCA 3' (EcoRI) 5' GCGCCG GAATTC TCA CATAGAAAGGAAATAAGA 3'

32-149 6.34/15407.68 -

pRSETC-CHDv3 (NheI) 5' GACGAC GCTAGC GACGTAAATGTGGGATTA 3' (EcoRI) 5' GCTCCG GAATTC TCA TAAAGCTTTGAAGTTAGG 3'

2-190 6.79/23062.39 low

pRSETC-CHDv4 (NheI) 5' GACGAC GCTAGC GACGTAAATGTGGGATTA 3' (EcoRI) 5' GCGCCG GAATTC TCA TAAAGTAGGAGAAGTATA 3'

2-238 8.38/28293.19 low

pRSETC-IQ (NheI)5' GACGAC GCTAGC GAACAAAGTTCTTCTGTC 3' (EcoRI) 5' TCGACC GAATTC TCA CTTATGAAAATTTTGAAG 3'

358-684 10.29/39910.3 -

pRSETC-GRD1 (NheI) 5' GATGAT GCTAGC GAAAATCCCGTTGCTCAG 3' (EcoRI) 5' TCATCA GAATTC TCA GATTGAAAAGATCTCAGA 3'

867-1078 9.22/25804.99 -

pRSETC-GRD2 (NheI) 5' GATGAT GCTAGC GAAAATCCCGTTGCTCAG 3' (NcoI) 5' TCATCA CCATGG TCA AAATTTGCCAACTTCATC 3'

1202-1431 9.69/28594.80 low

pRSETC-GRD2v2 (NheI) 5' GACGAC GCTAGC AAGCTTATTCCTTTTTCT 3' (EcoRI) 5' GCTCCG GAATTC TCA TGAATAAAATTTGGAGAA 3'

1368-1489 9.81/15712.34 -

pRSETC-GRD2v3 (NheI) 5' GACGAC GCTAGC CGGAATAATTACCAAGAT 3' (EcoRI) 5' GCTCCG GAATTC TCA TGAATAAAATTTGGAGAA 3'

1297-1489 10.0/24236.01 low

pRSETC-GRD2v4 (NheI) 5' GACGAC GCTAGC GTTACAGATTCAGACGAA 3' (EcoRI) 5' GCTCCG GAATTC TCA TGAATAAAATTTGGAGAA 3'

1239-1489 9.45/30857.29 low

Table 3: Rng2 plasmids. Restriction sites of the primers are shown in brown, whereas insert sequences are in blue.

37

Protein expression

The plasmids were transformed into Escherichia coli BL21-CodonPlus-RIL (F-

ompT hsdS(rB- mB

-) dcm+ Tetr gal endA Hte [argU ileY leuW Camr]) from Stratagene

and sequenced. A 0.5 ml aliquot of freshly saturated cells was mixed with an equal

portion of a cryoprotectant consisting of 65% glycerol, 0.1 M MgSO4, 25 mM Tris pH

8 and frozen away at -80 °C.

To check the solubiblity of the various recombinant proteins, freshly thawed

cells were inoculated into 2 ml of Luria-Bertani (LB) media supplemented with 100

µg/ml ampicillin and grown in a shaker overnight at 37 °C. The overnight culture was

then inoculated into 50 ml of 2×YT media in a 250 ml conical flask and induced for

three hours with 0.5 mM IPTG at A600 ≈ 0.6. A 25 ml aliquot of the bacterial culture

was harvested by centrifugation (4000 g for 15 mins at 4°C), resuspended in 1 ml of

cold PBS pH 7.4, 2 mM DTT and homogenized in a cold pressure cell with a

FRENCH press (Thermo Spectronic) operated at a gauge pressure of 1500 psi. A 100

µl aliquot of the cell lysate was set aside, and the remainder was centrifuged (10000 g

for 20 mins at 4 °C) to separate the supernatant containing the soluble fraction from

the pellet containing the insoluble fraction. An analyis using SDS-PAGE indicated that

only the pGEX-KG-CHDK plasmid expressing the GST-CHDK (GST-Rng21-189)

fusion protein was both soluble and had sufficient yield (Table 3).

For the purpose of GST-Rng21-189 production, freshly thawed cells were

inoculated into 50 ml of Luria-Bertani (LB) media supplemented with 100 µg/ml

ampicillin and grown in a shaker for seven hours at 37 °C. A 10 ml aliquot was then

used to inoculate 1 liter of 2×YT media in a 5 liter Erlenmeyer flask and induced with

0.5 mM IPTG at A600 = 0.6. The cells were harvested 3 hours later by centrifugation

(4500 g for 10 mins), resuspended in 5 ml of cold PBS pH 7.4, 2 mM DTT and

38

homogenized in a cold pressure cell with a FRENCH press (Thermo Spectronic)

operated at a gauge pressure of 1500 psi. 10 µg ml-1 of DNase and RNase was added to

the lysate and incubated for 30 mins at 4°C on a rocking platform. Centrifugation at

10000 g for 20 mins was used to separate the supernatant from the pellet, which was

discarded. The supernatant was clarified using a 0.45 µm filter.

Protein purification

The GST-Rng21-189 fusion protein was bound to a 5 ml GSTrap FF column

(Pharmacia), washed with 20 mM Tris pH 8, 0.5 M NaCl and eluted with 50 mM Tris,

10 mM reduced glutathione pH 8. Twenty units of thrombin were added to the eluate

and incubated overnight at 4 °C. The fusion protein was cleaved efficiently (Figure 3).

The eluate was dialyzed into 20 mM Tris, 0.5 M NaCl using a 3500 MW cutoff

dialysis tubing (Pierce). Thrombin and the cleaved GST were removed by passing

through a 5 ml HiTrap Benzamidine FF (Pharmacia) column and a 5 ml GSTrap FF

column that was connected in series. Rng21-189 was concentrated to 15 mg ml-1 in 50

mM Tris pH 8, 0.15 M NaCl, 2.5 mM CaCl2 with centrifugal filter devices from

Amicon.

39

Figure 3: Thrombin cleavage of GST-Rng21-189. Lane M, molecular weight standard; Lane 1, before cleavage; Lane 2, eluate containing Rng21-189 after 3 hr cleavage at 4 °C with 20 U of thrombin; Lane 3, eluate using glutathione buffer; Lane 4, purified Rng21-

189.

Protein crystallization

Crystallization screens of Rng21-189 were initially performed with commercially

available solutions (Wizard I and II, deCODE genetics) using the sitting drop vapour

diffusion method. Crystallization setups contained 2 µl of Rng21-189 protein solution

(15 mg ml-1 in 50 mM Tris pH 8, 0.15 M NaCl, 2.5 mM CaCl2) and 2 ul of Wizard I/II

solution. The wells were sealed with clear sealing tape (Hampton Research). A rod

shape crystal was observed in one well from the Wizard II screen. This condition was

slightly optimized by exploring different precipitant concentrations (18-21% PEG),

precipitants (PEG 3000 and 4000), salt concentrations (0.2-0.3 M calcium acetate) and

buffer pHs (pH 7-7.5). Up to five monoclinic crystals of Rng21-189 appeared in each

40

sitting drop well with 21% PEG 3000, 0.3 M Calcium Acetate and 0.1 M Tris pH 7

after incubation at room temperature (22 °C) for eight days. They grew to maximal

dimensions (0.3 × 0.2 × 0.05 mm) within two weeks (Figure 4).

Figure 4: Crystals of Rng21-189. They typically measure 200 µm in the longest dimension.

Data collection

The crystals were first transferred into the reservoir buffer solution to remove

the precipitate. This was followed by 15 mins fixation in 21% PEG 3000, 3%

glutaraldehyde at 15 ºC in a sitting drop tray. This brief fixation prevents crystals with

dimensions greater than 0.2 mm from cracking across the unique axis while soaking in

cryobuffer or in derivatization solutions (Fitzpatrick et al., 1994). My fixation protocol

was modified by doubling the glutaraldehyde concentration and shortening the fixation

times since my crystals did not diffract after overnight fixation. In contrast, extended

cross-linking times of a day or more with lysozyme crystals did not make the crystals

disordered (Wang et al., 1998).

The native crystals were soaked in a cryobuffer containing 21% PEG 3000,

40% glycerol. Bromide derivatives were soaked for 30 mins in 21% PEG 3000, 25%

glycerol, 1 M NaBr. Mercury derivatives were soaked for 2 h in saturated HgCl2, 40%

glycerol. The crystals were mounted on nylon loops, frozen in liquid nitrogen and

41

transported to the synchrotron in a dry shipping dewar (Taylor-Wharton CP-100) for

data collection by the European Synchrotron Radiation Facility (ESRF).

X-ray diffraction data from Rng21-189 were measured under cryogenic

conditions at the UK/EMBL MAD beamline BM14 at the ESRF using a MarCCD 165

mm detector (Mar USA). The data set was collected as a continuous series of 2°

oscillation images covering a rotation range of 200°. Indexing and integration of the

images was performed in DENZO and scaling of the intensity data was performed in

SCALEPACK from the HKL package (Otwinowski and Minor, 1997; Table 4).

Structure determination

A three wavelength MAD data set for the Br-derivative was collected.

However, the inflection and remote wavelengths had sufficient anomalous signal to do

a two wavelength MAD (Gonzalez, 2003). This served as input data to SHELXD

(Sheldrick, 1998) after extracting Bijvoet differences by XPREP (Bruker). SHELXD

was run with 388 Es greater than 1.5 looking for 4 anomalous sites. The six highest

peaks with a correlation coefficient (CC) of 38.41 and a figure-of-merit (PATFOM) of

46.95 were accepted after the Harker peaks (2x, 2z) at the Harker section y = ½

corresponding to the Br positions in a Br-anomalous difference Patterson (or Bijvoet

Patterson) map were examined for verification (Figure 5).

42

Figure 5: Harker section y = ½ corresponding to the Br positions in a Br-anomalous difference Patterson map.

SHARP (de La Fortelle and Bricogne, 1997) was used to refine the six Br sites,

from which the protein phases were derived and used to trace an initial model with

wARP (Perrakis et al., 1999). The map obtained after density modification was readily

interpretable; 147 out of 158 residues could be built automatically using wARP

(Perrakis et al., 1999). The incomplete polyalanine model was visualized with O

(Jones et al., 1991) to fill in missing portions of the main chain and to delete

symmetry-related main chains. At this point side chains were not added owing to

ambiguity while matching the sequence to the Fo map due to the poor side chain

densities along the random coils interspersed between the α-helices.

43

A SAD data set was also collected for the Hg-derivative at a wavelength of

1.653 Å. It had a strong anomalous signal even though the L-III wavelength is around

1 Å. SHELXD was run with 391 Es greater than 1.5 looking for 3 anomalous sites. The

four highest peaks with a CC of 46.67 and a PATFOM of 100.97 were accepted after

the Harker peaks at Harker section y = ½ of the Hg Bijvoet Patterson map were

checked to verify the four Hg sites (Figure 6).

Figure 6: Harker section y = ½ corresponding to the Hg positions in a Hg-anomalous difference Patterson map.

148 out of 158 residues could be built with wARP. The three Hg sites with the

highest occupancies were used to determine the positions of the three cysteine residues

present in the recombinant protein. The fourth residue had a lower occupancy and was

44

not covalently bonded to any residue (Figure 11b). This information allowed me to

match the sequence to the Br and Hg Fo maps with confidence. A Br anomalous

difference Fourier map was used to locate five additional Br sites.

The two structures containing either eleven Br or four Hg atoms but devoid of

water were refined initially with CNS (Brünger et al., 1998). 5% of randomly selected

reflections from the data set were used for evaluating Rfree. The refinement was

continued with SHELXL (Sheldrick and Schneider, 1997) using the same set of

reflections for calculating Rfree and with default restraints applied to geometrical and

atomic displacement parameters. Each cycle of SHELXL refinement consists of ten

steps of conjugate-gradient least-squares minimization followed by inspection of the

model and electron density maps with O. The water molecules added during SHELXL

refinement were processed with unit occupancies and inspected for the presence of

spherical electron density (1σ) in the 2mFo - DFc map, at <3.5 Å (Ippolito et al., 1990)

from suitable hydrogen-bond acceptors or donors. The process was repeated to add,

delete and refine the water positions until no further interpretation of the difference (Fo

– Fc) density contoured at 3σ and -3σ was possible. At the end of refinement, when the

process reached convergence, three cycles of tightly restrained (weight matrix 0.1),

isotropic refinement with REFMAC (Murshudov et al., 1997) was performed using all

reflections, including those previously used for Rfree, to obtain as accurate a model as

possible (Dauter et al., 2001). The refined structure was analyzed with PROCHECK

(Laskowski et al., 1993), and the coordinates of the Br- and Hg-derivatives of Rng232-

189 are deposited in RCSB Protein Data Bank with the respective accession codes,

1P2X and 1P5S. Its crystallographic data are listed in Table 4.

45

Table 4: Data collection, processing, phasing and refinement statistics. Numbers in parentheses refer to the highest resolution shell. Data collection statistics Crystal Rng21-189 + 1 M NaBr Rng21-189 +

saturated HgCl2Temperature (K) 100 100 Wavelength (Å) peak, 0.919087; inflexion, 0.919338; remote,

0.855057 1.653

X-ray source ESRF BM14 ESRF BM14 Data processing statistics Resolution range (Å) processed

Peak, 23-2.21 (2.29-2.21); inflexion, 23-2.21 (2.29-2.21); remote, 23-2.06 (2.13-2.06)

28-2.21 (2.29-2.21)

Spacegroup P21 P21Unit-cell parameters peak wavelength a (Å) 31.266 30.884 b (Å) 68.836 68.666 c (Å) 39.811 35.309 α = γ (°) 90 90 β (°) 105.544 102.514 VM (Å3·Da-1) 2.3 2.0 Solvent content (%) 46.3 39.3 Observed reflections 25342 130550 Unique reflections 7347 (202) 7034 (570) Completeness (%) 89.6 (24.8) 97.4 (81.5) Rmerge (%) a 5.3 (7.7) 8.4 (14.5) I/σ(I) 20.43 (9.39) 32.70 (9.93) MAD/SAD phasing statistics Resolution range (Å) processed

22.667-2.062 27.606-2.216

Accepted reflections 9130 6895 Isomorphous RCullis (centric) b 0.533 - Anomalous RCullis (acentric) c 0.772 0.625 Isomorphous phasing power (acentric) d 1.43 - Anomalous phasing power (acentric) e 1.248 1.915 Figure of merit (acentric) 0.51757 0.50549 Refinement statistics Resolution (Å) range processed

23-2.21 21-2.21

Reflections in working set 7173 6571 Reflections in test set 369 319 Rcryst (%) f 20.7 (27.5) 18.6 (22.0) Rfree (%)

g 26.9 (29.3) 24.4 (30.1) Protein atoms 1303 1303 Derivative atoms 11 4 Water atoms 157 145 R.m.s.d. bond lengths (Å) 0.008 0.007 R.m.s.d. bond angles (°) 1.0 1.2 R.m.s.d. dihedral angles (°) 19.9 23.0 R.m.s.d. improper angles (°) 1.01 1.78 Mean B factor (Å2) 17.6 22.2

46

a Rmerge = ∑

∑ −

hklobs

hklobs

hklI

hklIhklI

)(

)()(

b Isomorphous RCullis (centric) = ∑

∑±

−±

hklPPH

hklcalcHPPH

FF

FFF )(

c Anomalous RCullis (acentric) = ∑

∑±

±±

∆

∆−∆

hklobsPH

hklcalcPHobsPH

F

FF

)(

)()(

d Isomorphous phasing power = ∑

∑−

hklcalcPHobsPH

hklcalcH

FF

F

)()(

)(

e Anomalous phasing power (acentric) = ∑

∑±± ∆−∆

hklcalcPHobsPH

hklifH

FF

F

)()(

")(

f Rcryst = ∑

∑ −

hklobs

hklcalcobs

hklF

hklFhklF

)(

)()(

g Rfree = Rfactor for 5% of reflections not used in refinement (test set).

47

Molecular dynamics simulation in an aqueous system

Production-phase molecular dynamics was performed with the AMBER 7.0

suite of programs (Case et al., 2002) and PMEMD (Duke and Pedersen, 2003), with a

2.0 fs time step under the isothermal-isobaric ensemble (300 K and 1 atm) with the

ff99 force field (Wang et al., 2000), the TIP3P (Jorgensen et al., 1983) model for water,

periodic boundary conditions, the particle mesh Ewald method (PME) (Darden et al.,

1993) for electrostatics, a 10 Å cutoff for Lennard-Jones interactions, and the use of

SHAKE (Ryckaert et al., 1977) for restricting motion of all covalent bonds involving

hydrogen. In an iterative manner, five sodium cations were placed around Rng232-189

near residues with the highest electrostatic potential to compensate the overall negative

charge of the molecule. A total of 8763 TIP3P water molecules were then added

around residues 32-189 of Rng2 in order to end up with a buffer of about 10 Å from

the edge of the periodic box, which resulted in a box size of 72.535 × 70.410 × 72.229

Å. Temperature was maintained by the Berendsen coupling algorithm (Berendsen et al.,

1984) using separate τ coupling constants of 1.0 for the protein and solvent, and

pressure was maintained with isotropic molecule-based scaling (Berendsen et al.,

1984), also with a τ coupling constant of 1.0. The PME grid spacing was about 1.0 Å

and was interpolated on a cubic B-spline, with the direct sum tolerance set to 10-5. The

net center of velocity was removed every 100 ps to correct for the small energy drains

that resulted from the use of SHAKE, discontinuity in the potential energy near the

Lennard-Jones cutoff value and constant pressure conditions.

For equilibration, the solute (Rng232-189) was minimized using the steepest

descent method for the first 200 steps, followed by the conjugate-gradient method until

the root-mean-square (r.m.s.) of the Cartesian elements of the gradient was <0.4 kcal

mol-1 Å-1. Water molecules and sodium ions were then minimized in the same way

48

until the RMS was <0.1 kcal mol-1 Å-1 and then slowly heated, while allowing them to

move unrestrained for 25 ps (with a 1.0 fs time step) in order to fill in any vacuum

pockets. The solute atoms alone were then minimized in the presence of ever

decreasing positional restraints, thereby allowing them to slowly feel the forces of the

equilibrated solvent, until the positional restraints reached zero. Finally, a temperature

ramp was used to gradually raise the temperature of the whole system to 300 K over 20

ps. Coordinates from the trajectories were saved every 5 ps and analyzed using

AMBER 7.0.

Phylogenetic analysis

The boundaries of CH domains within each protein were located using profile

hidden Markov models (profile HMM) (Eddy, 1998) and a multiple sequence

alignment was obtained by running the list of CH domains through CLUSTALX

(Thompson et al., 1997). For the phlygenetic analysis, sites where gaps were

introduced during the multiple sequence alignment were excluded from all pairwise

comparisons. This is because there is no consensus to the problem of judging the order

and number of insertions, deletions or rearrangements of genetic material. To

reconstruct phylogenetic trees from the 117 individual CH domains, the consensus

from five common tree-building methods was used. Neighbour joining (NJ) (Saitou

and Nei, 1987), maximum parsimony (MP) (Eck and Dayhoff, 1966) and maximum

likelihood (ML) (Felsenstein 1981) trees were produced using PHYLIP (Felsenstein,

1989). The minimum evolution (ME) (Rzhetsky and Nei, 1992) tree was constructed

using MEGA2 (Kumar et al., 2001), whereas the quartet maximum likelihood (QML)

(Strimmer and von Haeseler, 1996) tree was constructed using TreePuzzle-5.1

(Schmidt et al., 2002). NJ and ME trees do not assume an evolutionary clock and are

49

unrooted. Default amino acid substitution matrices were used for trees constructed

using PHYLIP and MEGA2. QML trees were based on the VT model of amino acid

sequence evolution (Müller and Vingron, 2000) with a uniform rate heterogeneity.

The significance of internal branches in ME trees was tested by the standard

error test, and the standard errors of branch length were computed by the bootstrap

method implemented in the MEGA2 program. The reliability of each internal branch in

NJ, ME, MP and ML trees was assessed by bootstrapping: 1000 bootstrap samples

were used with the exception of ML where 100 bootstrap samples were used. The

reliability of each internal branch in QML trees was assessed by the proportion of

10000 puzzling steps supporting the branch (Strimmer and von Haeseler 1996).

Generation of figures and tables

The r.m.s.d. values listed in Table 5 were made with the help of Stralign

(http://www.hgc.ims.u-tokyo.ac.jp/service/tooldoc/stralign/intro.html). Other r.m.s.d.

values were calculated with the help of the MMTSB Tool Set (Feig et al., 2001).

Figures 7(a)-7(c) were prepared using ESPript (Gouet et al., 1999). The structural

alignment shown in Figure 7(b) was prepared with STAMP (Russell and Barton, 1992).

Figures 8(a)-8(c), 9, 10(a)-10(e), 11(a)-11(b) and 13 were prepared using the programs

POVScript+ (Fenn et al., 2003) and Raster3D (Merritt and Bacon, 1997). The multiple

structure superposition of CH domains shown in Figures 8(b) and 8(c) were prepared

using Indonesia (http://xray.bmc.uu.se/dennis/), using the brute-force method with a

cutoff of 10 Å and a fragment length of 100 residues. The molecular surface

representation of Rng232-189 sequence conservation as shown in Figure 8(a) was

prepared using both PyMOL (http://pymol.sourceforge.net/) and ProtSkin

(http://www.mcgnmr.ca/ProtSkin/). The surface hydrophobicities and surface

50

electrostatic potential as shown in Figure 8(a) were prepared using PyMOL and DelPhi

(Rocchia et al., 2001). DelPhi uses kT at 298 K as the energy unit and kT/e as the

electrostatic potential unit, where 1 eV = 38.94 kT (298 K). The crystal contacts shown

in Figures 10(a1) and 10(a2) were visualized with the help of Swiss-PdbViewer (Guex

and Peitsch, 1997). Figure 12 was prepared with Phylodendron

(http://iubio.bio.indiana.edu/treeapp/treeprint-form.html), using the phenogram mode.

Results and discussion

Overall structure

Residues 1-189 of S. pombe Rng2, which contain a single calponin-homology

domain, has been crystallized and the structures of a Br- and a Hg-derivative were

determined using multi-wavelength anomalous diffraction (MAD) methods (Wang et

al., 2003; Table 4). There is a single molecule per asymmetric unit in space group P21,

and interpretable electron density was observed from residues 32-189. Residues 1-31

are disordered and no electron density can be seen. The overall structure is compact

and globular. 100% of the residues in both derivatives were either in the 'most

favoured' or 'additional allowed' regions of Ramachandran space (Laskowski et al.,

1993). The overall mean B values are 17.51 and 19.89 Å2 for the Br- and Hg-derivative,

respectively, in reasonable agreement with the Wilson B values of 21.4 and 18.3 Å2.

The respective estimated standard uncertainties (esu) for the B values based on

maximum likelihood are 5.58 and 5.51 Å2.

A multiple sequence alignment of all CH domains known to bind F-actin is

shown in Figure 7(a). There are clearly no conserved residues although hydrophobic

residues are seen to alternate with polar/acidic/basic residues between helices a1-a3

51

and h2-a6. The CH domain resides between residues 40-150, and consists of four core

α-helices (a1, a3, a4, a6) connected by long loops (Figure 8a). There are two short α-

helices, a2 and a5. There are also three short stretches of 310-helices, h1, h2 and h3. α-

Helices a3 and a6 are almost parallel to one another, while a1 and a4 are perpendicular

to each other. The CH domain fold is held together by extensive hydrophobic contacts

among the α-helices in the core helical structure (Figure 8c).

Residues 150-189 do not have a defined secondary structure, except for the α-

helical stretch (a7) between Asp167 to Gln179. It wraps around helix a1, forming a

substructure that resembles neither the extended conformation seen in utrophin, nor the

compact conformation seen in fimbrin, although residues 154-160 forms an

unstructured coil which adopts a substructure similar to dystrophin residues 240-246 in

the carboxyl-terminal portion of the CH2 domain. The primary sequence within

residues 150-189 is unique to Rng2 and cannot be found elsewhere.

Data from fission yeast showed that Rng2 localizes to the F-actin ring which

forms during mitosis (Eng et al., 1998). A F-actin binding function probably exists

within the first 189 residues of Rng2, since an amino-terminal fusion of GFP to Rng21-

189 localizes to cortical actin patches within the growing cell tips (Wachtler et al.,

2003). F-actin cosedimentation assays indicate a direct interaction with residues 1-216

of human IQGAP1 (Fukata et al., 1997), which contain a single CH domain.

Interestingly, residues 150-189 of Rng2 wraps around the proposed actin-binding site

ABS1 and ABS2 (Corrado et al., 1994; Norwood et al., 2000; Figure 2a) via mainly

hydrophobic interactions, suggesting the putative ABS1 and ABS2 in Rng2 are not

involved in binding F-actin. However, this does not rule out the possibility of a direct

interaction between Rng21-189 and F-actin.

52

Figure 7(a): Multiple sequence alignment of calponin-homology domains from proteins known to bind F-actin. The blue square scale at the top shows the relative accessibility of each residue extracted from DSSP (Kabsch and Sander, 1983) - blue with red borders indicate exposed residues with an accessibility value A > 1; blue indicates accessible residues with A > 0.4, cyan indicates intermediate accessibility with 0.1 < A < 0.4, and white indicates buried residues with A < 0.1. The secondary structure of Rng2 and its residue numbering follows below and are shown in black. The predicted secondary structure of Rng2 based on PHD is shown at the bottom in red. The residues corresponding to Rng2 core α-helices (Figure 8c) are highlighted in yellow.

53

Figure 7(b): Structural alignment of all known calponin-homology domains. In the sequence name column, the number following the four digit PDB coordinate entry refers to either a CH1 or CH2 domain; it is absent for single CH domains. The last alphabet is a chain identifier. The cyan square scale at the bottom shows the hydropathic character of each residue according to the algorithm of Kyte and Doolittle (1982) with a window size of 3 - pink indicates hydrophobic residues with a value H > 1.5; grey indicates intermediate hydrophobicity with -1.5 < H < 1.5 and cyan indicates hydrophilic residues with H < -1.5.

Figure 7(c): Multiple sequence alignment of calponin-homology domains from the IQGAP family of proteins.

54

Figure 8: Structure of Rng232-189. (a) Ribbon representation of the crystal structure of Rng232-189 shown in two orientations with relative rotation of 180°. Bromine atoms are represented by spheres coloured according to B factors (blue to red denoting lower to higher values). The amino- and carboxyl-terminal residues are labelled together with α-helices a1-a7 and 310-helices h1-h3. For comparison purposes, the region corresponding to actin-binding sites ABS1 (a1) and ABS2 (h2-a6) of dystrophin CH1 domain (Norwood et al., 2000) are highlighted in green and blue, respectively. The 39 residues carboxyl-terminal to the CH domain (150-189) are highlighted in red. Orientations in (a)-(c) are the same. The molecular surfaces below the ribbon diagram are in the same orientation. Surface residue conservation scores ranging from 0 to 1 are coloured based on a white (not conserved) to red (conserved) scale. Kyte-Doolittle (Kyte and Doolittle, 1982) surface hydrophobicities ranging from -4.5 to 4.5 are coloured based on a cyan (hydrophilic) to wheat (hydrophobic) scale. Electrostatic surface potentials ranging from -155.214 kT/e to 318.183 kT/e are coloured based on a red (acidic) to blue (basic) scale, with a scale factor of 1.

55

Figure 8(b): Cα backbone superposition of the CH domains from the Br-derivative of Rng2 CH, Hg-derivative of Rng2 CH, dystrophin CH1, dystrophin CH2, calponin CH, EB1 CH, fimbrin CH1-1, fimbrin CH1-2, utrophin CH1, utrophin CH2, plectin CH1, plectin CH2 and beta-spectrin CH2. For clarity, only the Br-derivative of Rng232-189 CH (pink), dystrophin CH1 (red), calponin CH (orange) and plectin CH2 (blue) domains are shown here.

56

Figure 8(c): Cα backbone superposition of the four core α-helices belonging to the Br-derivative of Rng232-189 CH (pink), dystrophin CH1 (red), calponin CH (orange) and plectin CH2 (blue) domains. Orientations in (a), (b) and (c) are the same.

57

58

Comparison of CH domain structures

A superposition of the CH domains from Rng2, calponin, EB1, fimbrin,

dystrophin, utrophin, beta-spectrin and plectin (Figure 8b) shows that most of the

structural variations are concentrated at the amino- and carboxyl-terminus; i.e. before

residue 43 in helix a1 of Rng2 and beyond residue 148 in helix a6 of Rng2. The loops

between helices a1-a2, a3-a4 and h2-a6 also had considerable variation, consistent with

the alternative loop conformation found in Rng2 residues 92-99 (Figure 4a).

The Cα backbone for the four core helices containing 47 equivalent atoms of

other solved structures can be superimposed on those of Rng2 with up to 2.33 Å root-

mean-square deviation (r.m.s.d.) (Table 5). The closest match of Rng2 CH domain is

with the CH1 of dystrophin with a r.m.s.d. of 1.19 Å. In contrast, the second CH (CH2)

domain of plastin 3/fimbrin has a r.m.s.d. of 1.75 Å when compared with Rng2. The

CH1 domains of dystrophin, utrophin and plectin are all similar with a r.m.s.d. of

about 0.5 Å. This is consistent with their known actin-binding properties.

Rng2 CH DMD CH1 UTRN CH1 PLEC CH1 CNN CH PLEC CH2 SPTB CH2 DMD CH2 PLS3 CH1 UTRN CH2 PLS3 CH2 EB1 CH

DMD CH1 1.19 -

UTRN CH1 1.28 0.47 -

PLEC CH1 1.30 0.52 0.48 -

CNN CH 1.45 1.54 1.54 1.54 -

PLEC CH2 1.53 1.25 1.18 1.11 1.33 -

SPTB CH2 1.56 1.23 1.16 1.09 1.40 0.43 -

DMD CH2 1.63 1.19 1.14 1.01 1.47 0.52 0.60 -

PLS3 CH1 1.72 1.45 1.29 1.27 1.63 1.41 1.31 1.30 -

UTRN CH2 1.74 1.44 1.35 1.27 1.46 0.74 0.67 0.82 1.33 -

PLS3 CH2 1.75 1.33 1.38 1.20 1.59 1.08 1.02 0.92 1.44 1.25 -

EB1 CH 2.28 2.25 2.36 2.33 2.15 2.19 2.12 2.32 1.84 2.11 2.33 -

Table 5: R.m.s.d. values of the pairwise comparison of all the CH domains using the Cα belonging to the four core helices of each structure. The core helices of Rng232-189 correspond to residues 43-54, 73-81, 99-112 and 133-144 (Figure 8c). Abbreviations: DMD, dystrophin; UTRN, utrophin; PLEC, plectin; CNN, calponin; SPTB, beta-spectrin; PLS3, fimbrin ABD1; EB1, end-binding protein 1.

59

Superposition of the Rng2 CH domain with the calponin CH domain reveals a

relatively large r.m.s.d. of 1.45 Å. The largest deviation is found between the CH

domains of Rng2 and end-binding protein 1 (EB1) (Hayashi and Ikura, 2003), with a

r.m.s.d. of 2.33 Å. It is apparent from Table 5 that the CH domains from Rng2,

calponin and EB1 are quite dissimilar to each other and do not cluster together like

CH1 and CH2 domains. The Rng2 CH domain is more similar to the F-actin binding

CH1 domains, whereas the calponin CH domain is more similar to CH2 domains. In

contrast, the EB1 CH domain is dissimilar to the rest of the CH domains. This agrees

with the different functional roles they play. For instance, Rng2 CH domain

localization correlates with cortical actin patches in fission yeast (Wachtler et al.,

2003), and the CH domain from its mammalian orthologue IQGAP binds F-actin and

calmodulin (Ho et al., 1999); calponin CH domain binds calmodulin, caltropin and

ERK (Wills et al., 1994; Leinweber et al., 1999); and EB1 associates with microtubule

filaments (Hayashi and Ikura, 2003).

The Rng2 CH domain has three 310 helices (h1-h3) that are uniformly three

residues long and thus only one turn in length (Figure 8B). The dipoles are less well

aligned here compared to regular α-helices (Figure 9). The C=O groups are hydrogen

bonded to crystallographic waters and point away from the helix axis. The 310-helices

have less favourable side chain packing and backbone conformation and are usually

found in solvent accessible areas. In the case of Rng2, all the 310-helices are solvent

accessible. They link random coils or helices together at abrupt angles. This is most

clearly seen in helix h2 (Figure 8a).

It has been reported that the crystal structure of proteins subjected to light

cross-linking with glutaraldehyde were identical to those without such treatment

(Fitzpatrick et al., 1994). Thus, no data were collected from a native crystal. No

60

glutaraldehyde cross-links were observed in the electron density map. They form

randomly and should not occur in high densities if the crystals were cross-linked

briefly (Fitzpatrick et al., 1993).

Figure 9: A 310-helical structure between residues 62-64 (helix h1). Water molecules are represented by cyan spheres and dotted lines indicate hydrogen bonds. The 2mFo - DFc map is contoured at 1σ and superimposed on the coordinates.

61

Intermolecular contacts

In the monoclinic crystal of Rng21-189, each molecule is surrounded by ten

symmetry-related molecules within an intermolecular distance of 4 Å (Figure 10a1 and

10a2). The direct intermolecular contacts and those mediated via crystallographic

waters are named macrobonds (Oki et al., 1999), with symmetrically equivalent

macrobonds forming the same interatomic interactions. There are five groups of

macrobonds, labeled A-E. The contacting molecules between macrobond A are related

by a two-fold screw axis along the b axis. Those between macrobond B are related by a

unit translation of the repeating lattice unit along the a axis. Macrobonds A and B

consist of numerous non-covalent interactions. Figures 10(b1) to 10(b3) shows the

different types of interactions in macrobond A. There is a potential salt bridge and a

potential water-mediated hydrogen bond between the ζ-amino group of Arg70 and the

γ-carboxyl group of Asp151 (Figure 10b1). However, the temperature factor of the

water molecule is large (49.65 Å2) suggesting local disorder and a weaker hydrogen

bond. The ζ-amino group of Arg70 is linked via a potential salt bridge to the δ-

carbonyl group of Gln67. A bromide anion with a temperature factor of 24.54 Å2

forms a potential hydrogen bond to the ζ-amino group of Arg175, which is doubly

hydrogen bonded to the carbonyl group of Leu152 (Figure 10b2). There is a nonpolar

cluster (>5 Å) of residues Ile172, Leu152, Phe82, Phe110 and Ile114 in a background

of specific intermolecular hydrogen bonds and salt bridges (Figure 10b3). The inter-

residue distances are longer than the usual distance of 3.7 Å (Matsuura and Chernov,

2003) and are unlikely to be van der Waals contacts.

The contacting molecules between macrobond C are related by a unit

translation along the c axis followed by another unit translation along the a axis

(Figure 10c). There is a potential direct hydrogen bond between the ζ-amino group of

62

Arg99 and the δ-carbonyl group of Glu161. It also has another potential direct

hydrogen bond between the α-imino of Tyr92 and γ-carbonyl group of Asn162.

Macrobond D comprises contacts made by molecules related by a unit

translation along the c axis (Figure 10d). It has a potential water-mediated hydrogen

bond from the γ-carbonyl group of Asn162 to the ε-amino group of Lys131. However

the sigma-A weighted map (Read, 1986) suggests the side chain of Lys131 is

disordered.

Molecules between macrobond E are related by a lattice translation of one unit

along the a axis followed by a two-fold screw axis along the b axis (Figure 10e). It has

a unique occurrence of a potential bromide-mediated hydrogen bond between the α-

imino group of Pro117 and the ζ-amino group of Arg178.

The total numbers of potential direct hydrogen bonds in the Br-derivative with

a length of less than 3.5 Å (Ippolito et al., 1990) between amino acid residue atoms are

5, 2, 3, 0, and 0, respectively, for macrobonds A, B, C, D and E. The numbers of bound

water molecules involved in the intermolecular hydrogen bonds are 24, 24, 8, 1 and 0,

respectively. In line with the relatively high solvent content of protein crystals, water-

mediated hydrogen bonds are clearly responsible for holding the crystal together. It is

interesting to note that the longest dimension of the unit cell extends in the b axis and

involves macrobonds A and A' which has the most direct hydrogen bonds and potential

water-mediated hydrogen bonds. The favourable formation of the periodic bond chain

(PBC) A-A' results in a tendency to elongate in this direction (Hartman, 1973). PBCs

refer to the intermolecular interactions or bonds occuring periodically across a crystal

that contribute to its growth habit. This was indeed observed during the crystal

screening process when many conditions resulted in crystals with a fibrous

63

morphology. The c axis and a axis have about the same length and involve

macrobonds C and D, and B respectively.

The Hg-derivative shows a similar arrangement of intermolecular contacts

(Figure 10a2). This is to be expected since the crystals were derivatized using crystals

grown under the same conditions. The total numbers of potential direct hydrogen

bonds in the Hg-derivative with a length of less than 3.5 Å between amino acid residue

atoms are 8, 6, 6, 6, and 1, respectively for macrobonds A, B, C, D and E. The numbers

of bound water molecules involved in the intermolecular hydrogen bonds are 13, 20, 9,

20 and 4, respectively. The significant increase in the number of potential direct and

water-mediated hydrogen bonds in macrobond D relative to the Br-derivative

correlates with a significant decrease in unit cell dimensions along the c axis.

64

Figure 10: Intermolecular contacts in a P21 crystal of the Rng2 CH domain. (a1) Molecular packing projected from the c axis (upper) and b axis (lower) directions for the Br-derivative. The unit cell (P21) and its related symmetry elements are coloured red. The macrobonds, labelled A-E are represented by dashed lines and circles for bonds parallel and perpendicular to the projection plane respectively. The colouring scheme of the ribbon model of Rng2 follows figure 8(a). (a2) Same as (a1) for the Hg-derivative. The MAD and SAD data sets for the Br- and Hg-derivatives respectively were independently solved, accounting for the different origins which are related by half a unit translation along the a axis. Blue arrows point at the loop between residues 92-99 of Rng2.

65

Figure 10(b)-(d): The 2mFo - DFc map is contoured at 1σ and superimposed on the coordinates. Grey and green carbons indicate separate molecules. Amino acid codes are shown in lower case. The chain number is in upper case next to the residue number. (b1) Chemical environment at the interface of two molecules along macrobond A of the Br-derivative. There is a potential salt bridge and a potential water-mediated hydrogen bond between the ζ-amino group of Arg70 and the γ-carboxyl group of Asp151 The ζ-amino group of Arg70 is linked via a potential salt bridge to the δ-carbonyl group of Gln67.

66

Figure 10(b2): Macrobond A of the Br-derivative. A bromide anion with a temperature factor of 24.54 Å2 forms a potential hydrogen bond to the ζ-amino group of Arg175, which is doubly hydrogen bonded to the carbonyl group of Leu152.

67

Figure 10(b3): Macrobond A of the Br-derivative. There is a nonpolar cluster (>5 Å) of residues Ile172, Leu152, Phe82, Phe110 and Ile114 in a background of specific intermolecular hydrogen bonds and salt bridges.

68

Figure 10(c): Macrobond C of the Br-derivative. There is a potential direct hydrogen bond between the ζ-amino group of Arg99 and the δ-carbonyl group of Glu161. It also has another potential direct hydrogen bond between the α-imino of Tyr92 and γ-carbonyl group of Asn162.

69

Figure 10(d): Macrobond D of the Br-derivative. It has a potential water-mediated hydrogen bond from the γ-carbonyl group of Asn162 to the ε-amino group of Lys131.

70

Figure 10(e): Macrobond E of the Br-derivative. It has a potential bromide-mediated hydrogen bond between the α-imino group of Pro117 and the ζ-amino group of Arg178.

71

Comparison of Br- and Hg-derivatives of Rng232-189

The Br- and Hg-derivatives of Rng21-189 are somewhat different, with a r.m.s.d.

of 2.14 Å between all 159 atoms and 1.37 Å between the Cα atoms. This is mainly

owing to residues Tyr92 to Arg99, which form a loop that points away from helix a2 in

the Hg-derivative, but points towards helix a2 in the Br-derivative (Figure 11a). The

r.m.s.d. between Cα atoms dropped to 0.52 Å after the loop was omitted from the

calculation. The largest deviation among the backbone atoms was seen at Leu96 with

an overall r.m.s.d. of 11.53 Å. This contrasts with a comparison of the orthorhombic

and tetragonal crystal forms of lysozyme, which have r.m.s.d.s that do not exceed 2 Å

(Oki et al., 1999).

There are only three cysteine residues stretching between residues 32-189 of

Rng2. In the Hg-derivative, they are all found to be associated with a Hg atom within

3.2 Å. They have occupancies between 45-64%, low temperature factors not exceeding

9.51 Å2 and show strong density exceeding 5σ in the 2mFo - DFc map (Read, 1986)

(Figure 11b). The fourth Hg atom (Figure 11b, Hg194) is responsible for the

maintaining the loop in an 'open' conformation relative to the Br-derivative. The

residues surrounding the fourth Hg atom are Asn71 (4.36 Å relative to Cβ), Phe98

(3.69 Å to Cζ) and His100 (2.66 Å to Nε). The bond length between Nε of His100 and

Hg194 is short (2.66 Å), suggesting the presence of a metal-ligand bond in some of the

protein molecules in the crystal. This is consistent with Hg194 having a low occupancy

of 30%, a high temperature factor of 35.7 Å2, together with weak electron density not

exceeding 2σ. The Hg anomalous map (Figure 11b, red density) shows a weak peak

and confirms its presence.

In order to compare the crystal structure of both derivatives, molecular

dynamics (MD) simulation was performed for the aqueous solution system of Rng232-

72

189 in the absence of Br and Hg atoms. After the initial relaxation procedure for 45 ps,

the system representing the conformation seen in the Hg-derivative reached a

stationary state within 50 ps after starting the MD simulations, whereas the Br-

derivative took about 1.9 ns (Figure 11c). Furthermore, the Hg-derivative had a r.m.s.d.

of 1.5 Å whereas the Br-derivative had a r.m.s.d. of about 2.5 Å over the 5 ns

simulation time. Although both derivatives had some conformational differences in the

loop between residues 92-99, both structures remained stable and retained their

differences around residues 92-99 throughout the simulation. The average r.m.s.d. of

each residue over the 5 ns simulation relative to the starting point is presented in

Figure 11(d). The loops clearly had more motion than the α-helices. The largest

motions are seen in the loop between Tyr92-Arg99 of the Br-derivative of Rng2,

whereas the Hg-derivative remained relatively stable. The higher r.m.s.d. seen in the

Br-derivative relative to the Hg-derivative is consistent both with the slightly higher

temperature factors (~ 32-51 Å2) at Asn94 of the Br-derivative, compared to the Hg-

derivative (~ 19-24 Å2); and with funnel diagrams depicting the energetic drive to the

native state (Pande et al., 1998). The crystallographic and MD data both suggest that

residues 92-99 switches between the Br- and Hg-derivative conformations, and that

Hg194 had trapped it in the alternative conformation. The thirty-nine residues (150-

189) carboxyl-terminal to the CH domain maintained its conformation throughout the

simulation. The thirty-one residues at the amino-terminus are disordered and were not

included in the simulation.

73

Figure 11: Comparison between Br- and Hg-derivatives of Rng232-189. (a) Ribbon representation of the Br-derivative (red helices and blue coils) superimposed on the Hg-derivative (pink helices and cyan coils). Hg atoms are shown as spheres coloured according to their B factors.

74

Figure 11(b): Environment around the Hg atom near the variable loop from residues Tyr92-Arg99 in the Hg-derivative. The 2mFo - DFc map is shown contoured at 1σ. The Hg anomalous map (red) is superimposed onto the electron density map.

75

Figure 11(c): Mass weighted Cα r.m.s.d. of Rng232-189 as a function of time over the entire trajectory of a 5 ns molecular dynamics simulation for Rng232-189, in the Br-derivative (red) and Hg-derivative (black).

76

Figure 11(d): Main chain (N, Cα, C and O) r.m.s.d. for each residue as a function of time during the same MD. α-Helical sections of the structure are highlighted in green.

77

Calponin-homology domain phylogeny

A phylogenetic tree of 117 CH domains shared by 73 proteins from human,

chicken, mouse, rat, turkey, slime mold, budding yeast, fission yeast and Arabidopsis

was calculated from a consensus of several tree-building methods (Figure 12). The tree

was rooted on the basis of twelve groups of CH domains and is similar to Elena

Korenbaum and Rivero (2002). However, they reported the presence of a novel group

of CH domains (CHc) found in carnitine palmitoyltransferases (CPT) and carnitine

acetyltransferases (CAT) which could not be verified using either BLAST (Altschul et

al., 1990) or hmmsearch (Eddy, 1998) with a CH HMM (Hidden Markov model)

constructed from my multiple sequence alignment of CH domains. Furthermore, there

seems to be no biological basis for the presence of a CH domain in either CAT or CPT

and they were omitted in my analysis. The CH domains from ASPM, NICAL, NUMA

and SPTBN5, which were omitted from previous phylogenetic analyses (Banuelos et

al., 1998; Stradal et al., 1998; Gimona et al., 2002; Korenbaum and Rivero, 2002),

each formed a separate group.

For those proteins that have tandem CH domains (ABDs), the amino-terminal

CH1 domains are more similar to each other than the carboxyl-terminal CH2 domains.

The CHf1 (CH1.1) and CHf3 (CH2.1) domains of fimbrin clustered close to the F-

actin binding CH1 domains. This agrees with data from a genetic analysis of

Saccharomyces cerevisiae Sac6 which provided indirect evidence for the F-actin

binding ability of fimbrin ABD1 and ABD2 (Brower et al., 1995). Similarly, direct

evidence from experiments on AtFim1 showed that the ABD2 binds F-actin (Nakano

et al., 2001). Thus it makes sense that the CHf1 and CHf3 domain of fimbrins

clustered close to the CH1 domains. The CHf2 (CH1.2) domain of fimbrins grouped

with the CH2 domains, whereas the CHf4 (CH2.2) domains grouped close to the

78

79

microtubule-binding MAPRE CH domains (CHe) (Hayashi and Ikura, 2003). This

agrees with experiments showing that CH2 domains generally bind F-actin weakly

(Corrado et al., 1994; Winder et al., 1995).

Most of the proteins that contain a single CH domain cluster together (Figure

12, CH3). In contrast to Gimona et al., 2002, no conserved residues were observed

although an alternating pattern of hydrophobic and polar/ionic residues was present

(Figure 7a). The Rng2 CH domain grouped together with the F-actin binding IQGAP

CH domains (Fukata et al., 1997) in a separate branch distinct from calponin, VAV,

SM22/TAGLN, smoothelin/SMTN and ARHGEF CH domains that do not bind F-

actin. Similarly, residues 1-200 of the human Gas2-related gene on chromosome 22

(hGAR22), which formed a separate branch, contains a CH domain and cosediments

with F-actin (Goriounov et al., 2003). The fact that F-actin binding CH domains from

IQGAP and Gas2 proteins formed two separate branches can be rationalized from the

ability of the CH domain from IQGAP to bind both F-actin and calmodulin (Ho et al.,

1999), whereas the equivalent domain from the Gas2 orthologue hGAR22 only binds

F-actin (Goriounov et al., 2003). The phylogenetic analysis shows that proteins with

single CH domains are a heterogenous group (CH3) with different ligands and

biological functions.

80

Figure 12: Consensus of phylogenies of 117 individual calponin-homology (CH) sequences from 73 proteins. The species represented include Arabidopsis thaliana (at), Dictyostelium discoideum (dd), Gallus gallus (gg), Homo sapiens (hs), Meleagris gallopavo (mg), Mus musculus (mm), Rattus norvegicus (rn), Saccharomyces cerevisiae (sc) and Schizosaccharomyces pombe (sp). The number appended to the species symbol represents the order of the CH domain with 1, 3 being amino-terminal and 2, 4 being carboxyl-terminal. The CH groups were designated following Korenbaum and Rivero (2002). Rng2 (Rng2_sp) is highlighted in dark grey at the top of the figure. The protein codes for the 9 species, together with their accession numbers, are as follows: Rng2_sp (NP_593860); Iqg1_sc (NP_015082); IQGAP1_hs (NP_003861); IQGAP2 (NP_006624); CNN_sp (NP_593863); CNN1_hs (NP_001290); 1h67_gg (P26932); CNN_mg (P37803); CNN3_hs (NP_001830); CNN2_hs (NP_004359); ARHGEF7b_hs (NP_663788); ARHGEF6_hs (NP_004831); LRRN4_hs (NP_002310); CHDC1_hs (XP_166260); LMO7_hs (NP_005349); VAV1_hs (NP_005419); VAV3_hs (NP_006104); VAV2_hs (NP_003362); TAGLN_hs (NP_003177); NP25_hs (NP_037391); TAGLN2_hs (NP_003555); Scp1_sc (NP_015012); GAS2L1_hs (NP_006469); GAR17_hs (NP_644814); GAS2_hs (NP_005247); ASPM_hs (NP_060606); SMTNb_hs (NP_599031); SMTNa_hs (NP_599032); SMTNc_hs (NP_008863); SMTN_mm (NP_038898); MICAL2_hs (NP_055447); MICAL3_hs (XP_032997); TANGN_hs (XP_170658); MIRAB13_hs (NP_203744); NICAL_hs (NP_073602); NUMA1_hs (NP_006176); 1pa7_hs (NP_036457); MAPRE3_hs (NP_036458); MAPRE2_hs (NP_055083); Sac6_sc1 (NP_010414); Fim1_sp1 (NP_596289); PLS1_hs1 (NP_002661); 1aoa1_hs1 (NP_005023); LCP1_hs1 (NP_002289); AtFim1_at1 (NP_200351); ACTN3_hs1 (NP_001095); ACTN4_hs1 (NP_004915); ACTN1_hs1 (NP_001093); ACTN2_hs1 (NP_001094); FLNB_hs1 (NP_001448); FLNA_hs1 (NP_001447); FLNC_hs1 (NP_001449); FLN_gg1 (BAB63943); ctxA_dd1 (AAB62275); abpC_dd1 (P13466); SYNE1_hs1 (NP_149062); CLMN_hs1 (NP_079010); SYNE2_hs1 (NP_055995); 1dxx1_hs1 (NP_003997); 1qag1_hs1 (NP_009055); MACF1a_hs1 (NP_036222); PLEC_rn1 (NP_071796); 1mb81_hs1 (NP_000436); SPTBN1_hs1 (NP_003119); SPTB_hs1 (NP_000338); SPTBN2_hs1 (NP_008877); SPTBN4_hs1 (NP_079489); SPTBN5_hs1 (NP_057726); NAV2_hs (NP_660093); NAV3_hs (NP_055718); PARVG_hs1 (NP_071424); PARVA_hs1 (NP_060692); PARVB_hs1 (NP_037459).

81

82

Concluding remarks: Rng2 structure and function

Rng2 and its mammalian IQGAP orthologues are multifunctional, multidomain

proteins that link the signal transduction pathways with cytoskeletal reorganization

through many ligands, such as F-actin, EF-hand proteins like calmodulin and Cdc4 and

the GTPase Cdc42 (Eng et al., 1998; Ho et al., 1999; Fukata et al., 1997; D’souza et

al., 2001; Epp et al., 1997; McCallum et al., 1996). The structural and phylogenetic

evidence presented here indicates that Rng2 CH domain is most similar to the F-actin

binding CH1 domain of ABD-containing proteins (Table 5, Figures 7, 8 and 12).

Coupled with the in vivo localisation of GFP-Rng21-189 at the same site as cortical actin

patches (Wachtler et al., 2003), it is likely that residues 1-189 of Rng2 are involved in

binding F-actin structures.

The Rng232-189 structure has two flexible regions: residues 1-31, which are

disordered in the crystal structure; and residues 92-99, which switches between two

stable conformations (Figure 11A). Flexible loops are often found at the interface

between a protein and its respective ligand, which can be a small molecule, nucleic

acid or another protein. In the case of Rng2 with F-actin being the in vivo ligand, it is

possible that a molecular surface inclusive of the two flexible regions would represent

the area where they interact. In this scenario, residues 1-31 might become ordered

upon binding F-actin.

Residues 150-189 of Rng2 have no known homologue or orthologue and adopt

a conformation that resembles neither the extended conformation seen in utrophin, nor

the compact conformation seen in fimbrin, although residues 154-160 forms an

unstructured coil which adopts a substructure similar to dystrophin residues 240-246 in

the carboxyl-terminal portion of the CH2 domain. Residues 150-189 wrap around the

proposed actin-binding site ABS1 and ABS2 (Corrado et al., 1994; Norwood et al.

2000) (Figure 2A), suggesting the putative ABS1 and ABS2 in Rng2 are not involved

in binding F-actin. The Rng2 CH domain within residues 43-148 is also unlikely to

bind F-actin independently, since data from the single CH domain of calponin

expressed without its neighbouring residues showed a lack of F-actin binding capacity

(Gimona and Mital, 1998; Mino et al., 1998; Table 6). In the absence of similar data

from the IQGAP family of proteins, it seems likely that the stretch of residues from

150 to 189 of Rng2 may be important for the F-actin binding ability of Rng21-189. The

flexible loop at residues 92-99 and the α-helical stretch within residues 150-189 face

the same direction, potentially defining the molecular surface where Rng232-189

interacts with F-actin. To test its biological importance, mutations to four residues

conserved within the IQGAP family of proteins (Figure 7c) can be generated and

tested for their ability to disrupt Rng2 localisation to the actomyosin ring - L96D

within the flexible loop, R177D within α-helix a7, R70E and K131E within the basic

patch (Figure 13).

Figure 13: Residues shown in stick representation on the ribbon model of Rng232-189 are conserved within the IQGAP family of proteins (Figure 7c). The transparent surface electrostatic potential highlight the presence of a basic patch where Rng232-189 possibly interacts with F-actin.

83

84

A possible model of Rng2 function derived from currently available evidence

has at least one actin-binding domain within residues 1-189. Previous studies on

calponin (Gimona and Mital, 1998; Gimona and Winder, 1998; Mino et al., 1998) and

SM22 (Fu et al., 2000) showed that residues C-terminal to the CH domain were

essential for their localization to F-actin (Table 6). This suggests the presence of

another actin-binding region downstream of the CH domain. A possible location

comes from the following data. Firstly, the EF-hand proteins Cdc4 (D’souza et al.,

2001) and Cam1 (Eng et al., 1998), have been shown to physically interact with Rng2.

In the case of Mlc1, which is another EF-hand protein, crystal structures of complexes

of Mlc1 with IQ motifs from Myo2 and Iqg1 and known (Terrak et al., 2003). Thus,

the second actin-binding region probably resides within the IQ repeats, and should act

in an indirect fashion via Cdc4 or Cam1. Oligomerisation of Rng2 via its downstream

coiled coil domain might result in the generation of an F-actin binding interface via the

dimerisation of its CH domain. Once the medial F-actin spot has formed during mitosis,

the carboxyl-terminal RasGAP domain potentially initiates the assembly of a broad

actomyosin ring through as yet unidentified effectors. Rng2 CH domain probably

serves as vehicle to bring its carboxy-terminal RasGAP domain to the actomyosin ring

during cell division, and activate its signalling function during cytokinesis.

Appendix

Table 6: Biochemical characteristics of several CH domains. Structures deposited in RCSB are in square brackets. CH domains pI [RCSB]

Binding constants (Kd) F-actin bundling Types of interaction Actin-binding sites (ABS)

Mutants (residues changed)

ACTN CH1 (gg) 9.78 ACTN CH2 (gg) 6.07

120-134 Kuhlman et al., 1992

ACTN1 CH1 9.78 ACTN1 CH2 6.07

actinin 1-269 4 µM Way et al., 1992 actinin 25-257 3 µM Leinweber et al., 1999

ABD binds actin hydrophobically. Kuhlman et al., 1992 actinin 25-257 binds ERK Leinweber et al., 1999




mutants had increased F-actin affinity. Kaplan et al., 2001

228, 232, 235Kaplan et al., 2001

85

86

CH domains pI [RCSB]



FLN CH1 (gg) 9.83 FLN CH2 (gg) 4.73

hydrophobic and electrostatic. Lebart et al., 1994

121-147 Lebart et al., 1994

FLNA CH1 9.76 FLNA CH2 4.93

1-1761 (187kD) 3µM Gorlin et al., 1990

filamin C-terminal (2551-2647) needed to crosslink actin. Gorlin et al., 1990

170, 172, 196, 200, 207, 254, 273 Robertson et al., 2003

FLNB CH1 9.69 FLNB CH2 4.77

FLNC CH1 9.86 FLNC CH2 4.53

SPTB 9.8 SPTB CH2 6.04

47-186 5 µM Karinch et al., 1990

182, 220Hassoun et al., 1997

87




SPTBN1 CH1 9.93 SPTBN1 CH2 7.22 [1AA2/1BKR]

beta-spectrin 1-313 26 µM Li and Bennett, 1996 beta-spectrin 1-281 20 µM CH2 (173-182) does not bind actin. Banuelos et al., 1998 minispectrin 2.5 µM Raae et al., 2003

SPTBN2 CH1 9.75 SPTBN2 CH2 6.56



88




DMD CH1 9.63 [1DXX] DMD CH2 [1DXX] 5.84

dystrophin 2-90 100 nM dystrophin 2-246 150 nM Corrado et al., 1994 dystrophin 2-246 13.7 µM Renley et al., 1998

ABD (1-337) can form dimers and bundle F-actin.Fontao et al., 2001

dystrophin 1-246 binds actin hydrophobically. Amann et al., 1998

dystrophin 2-90 Corrado et al., 1994

UTRN CH1 9.70 [1QAG] UTRN CH2 8.22 [1BHD/1QAG]

utrophin 28-261 67 µM Keep et al., 1999a utrophin 1-261 19 µM utrophin 113-371 275 µM Winder et al., 1995

ABD (28-261) is a monomer in solution. Keep, Winder et al., 1999 Unable to crosslink actin. Winder et al., 1995

utrophin 1-261 binds actin hydrophobically. Winder et al., 1995

PLEC1 CH1 10.06 [1MB8] PLEC1 CH2 [1MB8] 4.95

plectin 1-339 0.3 µM Fontao et al., 2001 plectin 59-293 22.3 µM (open) 59.3 µM (closed) Garcia-Alvarez et al., 2003


plectin 65-302 binds β-4 integrin. Geerts et al., 1999

plectin CH1 10.06 plectin CH2 (rn) 4.95

plectin 1-1129 0.32 µM Andra et al., 1998

89




BPAG1n CH1 10.13 BPAG1n CH2 4.68

full length BPAG1n 0.21 µM Yang et al., 1996


MACF1a CH1 10.43 MACF1a CH2 4.82

MACF1 ABD (74-293)Leung et al., 1999

SYNE1 CH1 10.14 SYNE1 CH2 6.15

SYNE2 CH1 8.76 SYNE2 CH2 6.14

SYNE2 1-285 3.8 µM Zhen et al., 2002

ABD (1-285) bundles F-actin. Zhen et al., 2002

ABP-120 CH1 (dd) 9.40 ABP-120 CH2 4.31

ABP-120 89-115 Bresnick et al., 1991

PARVA CH1 5.36 PARVA CH2 5.63

full length alpha-parvin. 8.4 µM Olski et al., 2001

actopaxin 223-372 (CH2 domain) binds paxillin LD1 and LD4 motifs. Full length actopaxin binds actin. Nikolopoulos and Turner, 2000

PARVB CH1 6.08 PARVB CH2 6.39

90




PARVG CH1 5.47 PARVG CH2 5.00

cortexillin I CH1 9.30 cortexillin I CH2 4.74

cortexillin C-terminal(422-444) bundles F-actin.Stock et al., 1999

cortexillin I (1-233)binds F-actin very weakly. Stock et al., 1999

PLS1 CH1 5.08 PLS1 CH2 5.85

PLS1 CH3 9.37 PLS1 CH4 8.74

LCP1 CH1 5.06 LCP1 CH2 5.34

L-plastin binds unpolymerized vimentin subunits. 0.25 µM Correia et al., 1999

L-plastin 143-188 (CH1 domain) binds vimentin. Correia et al., 1999

L-plastin binds vimentin hydrophobically. Correia et al., 1999

LCP1 CH3 9.72

PLS3 CH1 5.53 [1AOA] PLS3 CH2 6.12 [1AOA]

PLS3 CH3 9.37 PLS3 CH4 5.75

91




Fim1 CH1 5.17 Fim1 CH2 9.07

fimbrin 1-429 showsweak binding. Nakano et al., 2001

Fim1 CH3 6.87 Fim1 CH4 6.94

fimbrin 369-614 shows strong binding. Nakano et al., 2001

AtFim1 CH1 (at) 5.87 AtFim1 CH2 6.08

full length fimbrin 0.45 µM (high affinity) 1.9 µM (low affinity) Kovar et al., 2000

AtFim1 CH3 (at) 9.47 AtFim1 CH4 8.80

Sac6 CH1 (sc) 4.86 Sac6 CH2 8.68

electrostaticHonts et al., 1994

fimbrin 140, 145, 252, 259, 264, 268, 379, 380, 383 Brower et al., 1995

Sac6 CH3 (sc) 5.81 Sac6 CH4 5.73

414, 484, 560, 610 Brower et al., 1995

Rng2 CH (sp) 5.75 [1P2X/1P5S]

Rng2 1-189Wachtler et al., 2003

92




IQGAP1 CH 6.47

IQGAP1 918-1657 binds Cdc42. McCallum et al., 1996 IQGAP1 740-869 binds calmodulin. Ho et al., 1999

IQGAP1 1-216 binds F-actin Fukata et al., 1997 IQGAP1 1-232 binds F-actin and calmodulin. Ho et al., 1999

IQGAP2 CH 7.89

GAR22 CH 4.63

GAR22 1-200 Goriounov et al., 2003

MAPRE1 CH 9.33 [1PA7/1UEG]

MAPRE1 binds microtubules. Hayashi and Ikura, 2003

MAPRE1 1-133 Bu and Su, 2003

93




basic calponin CH (gg) 6.07 [1H67]

full length calponin binds ERK6.4 µM Leinweber et al., 1999 full length calponin binds desmin tetramers 3-15 µM Wang et al., 1996 Mabuchi et al., 1997 full length calponin binds the regulatory light chain of myosin. 4.4 µM Szymanski and Goyal, 1999 calponin 2-51 binds calmodulin. 1.8 µM calponin 2-51 binds caltropin. 1.2 µM calponin 45-228 binds calmodulin and caltropin. 0.07 µM Wills et al., 1994 calponin 145-182 binds myosin HMM - 4 µM; S2 - 4.7 µM; LMM - 2.6 µM Szymanski and Tao, 1997

calponin binds F-actin through electrostatic interactions Tang et al., 1997 calponin 7-182 binds desmin. Wang et al., 1996

94




calponin CH (mg)7.46

calponin 7-91 binds ERK Leinweber et al., 1999 calponin 145-163 binds F-actin. calponin 153-163 binds calmodulin caltropin and tropomyosin. Mezgueldi et al., 1995

acidic calponin CH (rn) 8.83

full length calponin binds microtubules 0.3 µM Fattoum et al., 2003

calponin 145-182 and 183-292 binds microtubules. Fattoum et al., 2003

CNN1 CH 6.46

full length calponin 0.24 µM Mino et al., 1998

calponin 141-160 and172-187 Mino et al., 1998

CNN1 CH (mm) calponin 164-273 Gimona et al., 1998

CNN2 CH (mm) calponin 166-275 Gimona et al., 1998

Scp1 CH (sp) 5.37

full length Scp1 0.7 µM Goodman et al., 2003

Scp1 136-200 crosslinks actin via two distinct sites.Goodman et al., 2003

SM22/TAGLN 4.78

170-186 Fu et al., 2000

SMTN (mm) 5.89

1-134535-682 Quensel et al., 2002

ARHGEF6 4.59

ARHGEF6 56-83interacts with PARVB. Rosenberger et al., 2003

95




VAV 7.59

Vav 1-144interacts with zinc-finger (ZF) domain of VAV. Zugaza et al., 2002

References

Abrahams, J.P. and Leslie, A.G.W. (1996). Methods used in the structure determination of bovine mitochondrial F1 ATPase. Acta Crystallogr. D52, 30-42. Altschul, S.F., Gish, W., Miller, W., Myers, E.W. and Lipman, D.J. (1990). Basic local alignment search tool. J. Mol. Biol. 215, 403-410. Amann, K.J., Renley, B.A. and Ervasti, J.M. (1998). A cluster of basic repeats in the dystrophin rod domain binds F-actin through an electrostatic interaction. J. Biol Chem. 273, 28419-28423. Andra, K., Nikolic, B., Stocher, M., Drenckhahn, D. and Wiche, G. (1998). Not just scaffolding: plectin regulates actin dynamics in cultured cells. Genes Dev. 12, 3442-3451. Arai, R. and Mabuchi, I. (2002). F-actin ring formation and the role of F-actin cables in the fission yeast Schizosaccharomyces pombe. J. Cell Sci. 115, 887-898. Bahler, J. and Nurse, P. (2001). Fission yeast Pom1p kinase activity is cell cycle regulated and essential for cellular symmetry during growth and division. EMBO J. 20, 1064-1073. Bahler, J., Steever, A.B., Wheatley, S., Wang, Y., Pringle, J.R., Gould, K.L. and McCollum, D. (1998). Role of polo kinase and Mid1p in determining the site of cell division in fission yeast. J. Cell Biol. 143, 1603-1616. Banuelos, S., Saraste, M. and Carugo, K.D. (1998). Structural comparisons of calponin homology domains: implications for actin binding. Structure 6, 1419-1431. Bashour, A.M., Fullerton, A.T., Hart, M.J. and Bloom, G.S. (1997). IQGAP1, a Rac- and Cdc42-binding protein, directly binds and cross-links microfilaments. J. Cell Biol. 137, 1555-1566. Bateman, A., Birney, E., Cerruti, L., Durbin, R., Etwiller, L., Eddy, S.R., Griffiths-Jones, S., Howe, K.L., Marshall, M. and Sonnhammer, E.L. (2002). The Pfam Protein Families Database. Nucleic Acids Res. 30, 276-280. Berendsen, H.J.C., Postma, J.P.M., van Gunsteren, W.F., DiNola, A. and Haak, J.R. (1984). Molecular dynamics with coupling to an external bath. J. Chem. Phys. 81, 3684-3690. Berlin, A., Paoletti, A. and Chang, F. (2003). Mid2p stabilizes septin rings during cytokinesis in fission yeast. J. Cell Biol. 160, 1083-1092. Bezanilla, M., Wilson, J.M. and Pollard, T.D. (2000). Fission yeast myosin-II isoforms assemble into contractile rings at distinct times during mitosis. Curr. Biol. 10, 397-400.

96

Bezanilla, M. and Pollard, T.D. (2000). Myosin-II tails confer unique functions in Schizosaccharomyces pombe: characterization of a novel myosin-II tail. Mol. Biol. Cell. 11, 79-91. Bramham, J., Hodgkinson, J.L., Smith, B.O., Uhrin, D., Barlow, P.N. and Winder, S.J. (2002). Solution structure of the calponin CH domain and fitting to the 3D-helical reconstruction of F-actin:calponin. Structure 10, 249-258. Bresnick, A.R., Janmey, P.A. and Condeelis, J. (1991). Evidence that a 27-residue sequence is the actin-binding site of ABP-120. J. Biol. Chem. 266, 12989-12993. Brodersen, D.E., de La Fortelle, E., Vonrhein, C., Bricogne, G., Nyborg, J. and Kjeldgaard, M. (2000). Applications of single-wavelength anomalous dispersion at high and atomic resolution. Acta Crystallogr. D56, 431-441. Brower, S.M., Honts, J.E. and Adams, A.E. (1995). Genetic analysis of the fimbrin-actin binding interaction in Saccharomyces cerevisiae. Genetics 140, 91-101. Brünger, A.T., Adams, P.D., Clore, G.M., DeLano, W.L., Gros, P., Grosse-Kunstleve, R.W., Jiang, J.S., Kuszewski, J., Nilges, M., Pannu, N.S., Read, R.J., Rice, L.M., Simonson, T. and Warren, G.L. (1998). Crystallography & NMR system: A new software suite for macromolecular structure determination. Acta Crystallogr. D54, 905-921. Bu, W. and Su, L.K. (2003). Characterization of functional domains of human EB1 family proteins. J. Biol. Chem. 278, 49721-49731. Carugo, K.D., Banuelos, S. and Saraste, M. (1997). Crystal structure of a calponin homology domain. Nature Struct. Biol. 4, 175-179. Case, D.A., Pearlman, D.A., Caldwell, J.W., Cheatham III, T.E., Wang, J., Ross, W.S., Simmerling, C.L., Darden, T.A., Merz, K.M., Stanton, R.V., Cheng, A.L., Vincent, J.J., Crowley, M., Tsui, V., Gohlke, H., Radmer, R.J., Duan, Y., Pitera, J., Massova, I., Seibel, G.L., Singh, U.C., Weiner, P.K., and Kollman, P.A. (2002). AMBER 7 (San Francisco: University of California). Castresana, J. and Saraste, M. (1995). Does Vav bind to F-actin through a CH domain? FEBS Lett. 374, 149-151. Cerutti, L. and Simanis, V. (1999). Asymmetry of the spindle pole bodies and spg1p GAP segregation during mitosis in fission yeast. J. Cell Sci. 112, 2313-2321. Chang, F. (1999). Movement of a cytokinesis factor cdc12p to the site of cell division. Curr. Biol. 9, 849-852. Chang, F., Drubin, D. and Nurse, P. (1997). cdc12p, a protein required for cytokinesis in fission yeast, is a component of the cell division ring and interacts with profilin. J. Cell Biol. 137, 169-182.

97

Corrado, K., Mills, P.L. and Chamberlain, J.S. (1994). Deletion analysis of the dystrophin-actin binding domain. FEBS Lett. 344, 255-260. Correia, I., Chu, D., Chou, Y.H., Goldman, R.D. and Matsudaira, P. (1999). Integrating the actin and vimentin cytoskeletons: Adhesion-dependent formation of fimbrin-vimentin complexes in macrophages. J. Cell Biol. 146, 831-842. Cortes, J.C., Ishiguro, J., Duran, A. and Ribas, J.C. (2002). Localization of the (1,3)beta-D-glucan synthase catalytic subunit homologue Bgs1p/Cps1p from fission yeast suggests that it is involved in septation, polarized growth, mating, spore wall formation and spore germination. J. Cell Sci. 115, 4081-4096. Cowtan, K. (1999). Error estimation and bias correction in phase-improvement calculations. Acta Crystallogr. D55, 1555-1567. D'souza, V.M., Naqvi, N.I., Wang, H. and Balasubramanian, M.K. (2001). Interactions of Cdc4p, a myosin light chain, with IQ-domain containing proteins in Schizosaccharomyces pombe. Cell Struct. Funct. 26, 555-565. Darden, T., York, D. and Pedersen, L. (1993). Particle mesh Ewald: An N·log(N) method for Ewald sums in large systems. J. Chem. Phys. 98, 10089-10092. Dauter, Z., Li, M. and Wlodawer, A. (2001). Practical experience with the use of halides for phasing macromolecular structures: a powerful tool for structural genomics. Acta Crystallogr. D57, 239-249. Dauter, Z. and Adamiak, D.A. (2001). Anomalous signal of phosphorus used for phasing DNA oligomer: importance of data redundancy. Acta Crystallogr. D57, 990-995. Demeter, J. and Sazer, S. (1998). imp2, a new component of the actin ring in the fission yeast Schizosaccharomyces pombe. J. Cell Biol. 143, 415-427. Diederichs, K. and Karplus, P.A. (1997). Improved R-factors for diffraction data analysis in macromolecular crystallography. Nature Struct. Biol. 4, 269-275. Duke, R.E. and Pedersen, L.G. (2003). PMEMD 3 (University of North Carolina-Chapel Hill). Dunphy, W.G. and Kumagai, A. (1991). The cdc25 protein contains an intrinsic phosphatase activity. Cell 67, 189-196. Eck, R.V. and Dayhoff, M.O. (1966). Atlas of protein sequence and structure (Silverspring, Maryland: Natl Biomed. Res. Foundation). Eddy, S.R. (1998). Profile hidden Markov models. Bioinformatics 14, 755-763. Ekström, F., Stier, G. and Sauer, U.H. (2003). Crystallization of the actin-binding domain of human α-actinin: analysis of microcrystals of SeMet-labelled protein. Acta Crystallogr. D59, 724-726.

98

Eng, K., Naqvi, N.I., Wong, K.C.Y. and Balasubramanian, M.K. (1998). Rng2p, a protein required for cytokinesis in fission yeast, is a component of the actomyosin ring and the spindle pole body. Curr. Biol. 8, 611-621. Epp, J.A. and Chant, J. (1997). An IQGAP-related protein controls actin-ring formation and cytokinesis in yeast. Curr. Biol. 7, 921-929. Fabbrizio, E., Bonet-Kerrache, A., Leger, J.J. and Mornet, D. (1993). Actin-dystrophin interface. Biochemistry 32, 10457-10463. Fantes, P.A. and Nurse, P. (1978). Control of the timing of cell division in fission yeast. Cell size mutants reveal a second control pathway. Exp. Cell Res. 115, 317-329. Fattoum, A., Roustan, C., Smyczynski, C., Der Terrossian, E. and Kassab, R. (2003). Mapping the microtubule binding regions of calponin. Biochemistry 42, 1274-1282 Feig, M., Karanicolas, J. and Brooks III, C.L. (2001). MMTSB Tool Set, MMTSB (NIH Research Resource, The Scripps Research Institute). Felsenstein, J. (1989). PHYLIP -- Phylogeny Inference Package (Version 3.2). Cladistics 5, 164-166. Felsenstein, J. (1981). Evolutionary trees from DNA sequences: a maximum likelihood approach. J. Mol. Evol. 17, 368-376. Fenn, T.D., Ringe, D. and Petsko, G.A. (2003). POVScript+: a program for model and data visualization using persistence of vision ray-tracing. J. Appl. Cryst. 36, 944-947. Fitzpatrick, P.A., Ringe, D. and Klibanov, A.M. (1994). X-ray crystal structure of cross-linked subtilisin Carlsberg in water vs. acetonitrile. Biochem Biophys Res Commun. 198, 675-681. Fitzpatrick, P.A., Steinmetz, A.C., Ringe, D. and Klibanov, A.M. (1993). Enzyme crystal structure in a neat organic solvent. Proc. Natl Acad. Sci. U S A. 90, 8653-8657. Fontao, L., Geerts, D., Kuikman, I., Koster, J., Kramer, D. and Sonnenberg, A. (2001). The interaction of plectin with actin: evidence for cross-linking of actin filaments by dimerization of the actin-binding domain of plectin. J. Cell Sci. 114, 2065-2076. Fu, Y., Liu, H.W., Forsythe, S.M., Kogut, P., McConville, J.F., Halayko, A.J., Camoretti-Mercado, B. and Solway, J. (2000). Mutagenesis analysis of human SM22: characterization of actin binding. J. Appl. Physiol. 89, 1985-1990. Fukata, M., Kuroda, S., Fujii, K., Nakamura, T., Shoji, I., Matsuura, Y., Okawa, K., Iwamatsu, A., Kikuchi, A. and Kaibuchi, K. (1997). Regulation of cross-linking of actin filament by IQGAP1, a target for Cdc42. J. Biol. Chem. 272, 29579-29583.

99

Galkin, V.E., Orlova, A., VanLoock, M.S., Rybakova, I.N., Ervasti, J.M. and Egelman, E.H. (2002). The utrophin actin-binding domain binds F-actin in two different modes: implications for the spectrin superfamily of proteins. J. Cell Biol. 157, 243-251. García-Alvarez, B., Bobkov, A., Sonnenberg, A. and de Pereda, J.M. (2003). Structural and functional analysis of the actin binding domain of plectin suggests alternative mechanisms for binding to F-actin and integrin beta4. Structure 11, 615-625. Geerts, D., Fontao, L., Nievers, M.G., Schaapveld, R.Q., Purkis, P.E., Wheeler, G.N., Lane, E.B., Leigh, I.M. and Sonnenberg, A. (1999). Binding of integrin alpha6beta4 to plectin prevents plectin association with F-actin but does not interfere with intermediate filament binding. J. Cell Biol. 147, 417-434. Gimona, M., Djinovic-Carugo, K., Kranewitter, W.J. and Winder, S.J. (2002). Functional plasticity of CH domains. FEBS Lett. 513, 98-106. Gimona, M. and Winder, S.J. (1998). Single calponin homology domains are not actin-binding domains. Curr Biol. 8, R674-R675. Gimona, M. and Mital, R. (1998). The single CH domain of calponin is neither sufficient nor necessary for F-actin binding. J. Cell Sci. 111, 1813-1821. Goldsmith, S.C., Pokala, N., Shen, W., Fedorov, A.A., Matsudaira, P. and Almo, S.C. (1997). The structure of an actin-crosslinking domain from human fimbrin. Nature Struct. Biol. 4, 708-712. Gonzalez, A. (2003). Optimizing data collection for structure determination. Acta. Crystallogr. D59, 1935-1942. Goodman, A., Goode, B.L., Matsudaira, P. and Fink, G.R. (2003). The Saccharomyces cerevisiae calponin/transgelin homolog Scp1 functions with fimbrin to regulate stability and organization of the actin cytoskeleton. Mol. Biol. Cell 14, 2617-2629. Goriounov, D., Leung, C.L. and Liem, R.K. (2003). Protein products of human Gas2-related genes on chromosomes 17 and 22 (hGAR17 and hGAR22) associate with both microfilaments and microtubules. J Cell Sci. 116, 1045-1058. Gorlin, J.B., Yamin, R., Egan, S., Stewart, M., Stossel, T.P., Kwiatkowski, D.J. and Hartwig, J.H. (1990). Human endothelial actin-binding protein (ABP-280, nonmuscle filamin): a molecular leaf spring. J. Cell Biol. 111, 1089-1105. Gouet, P., Courcelle, E., Stuart, D.I. and Metoz, F. (1999). ESPript: multiple sequence alignments in PostScript. Bioinformatics 15, 305-308. Guan, K.L. and Dixon, J.E. (1991). Eukaryotic proteins expressed in Escherichia coli: an improved thrombin cleavage and purification procedure of fusion proteins with glutathione S-transferase. Anal. Biochem. 192, 262-267.

100

Guertin, D.A., Trautmann, S. and McCollum, D. (2002). Cytokinesis in eukaryotes. Microbiol. Mol. Biol. Rev. 66, 155-178. Guertin, D.A., Chang, L., Irshad, F., Gould, K.L. and McCollum, D. (2000). The role of the sid1p kinase and cdc14p in regulating the onset of cytokinesis in fission yeast. EMBO J. 19, 1803-1815. Guex, N. and Peitsch, M.C. (1997). SWISS-MODEL and the Swiss-PdbViewer: An environment for comparative protein modeling. Electrophoresis 18, 2714-2723. Hanein, D., Volkmann, N., Goldsmith, S., Michon, A.M., Lehman, W., Craig, R., DeRosier, D., Almo, S. and Matsudaira, P. (1998). An atomic model of fimbrin binding to F-actin and its implications for filament crosslinking and regulation. Nature Struct. Biol. 5, 787-792. Erratum in: (1998). 5, 924. Hanein, D, Matsudaira, P. and DeRosier, D.J. (1997). Evidence for a conformational change in actin induced by fimbrin (N375) binding. J. Cell Biol. 139, 387-396. Hartman, P. (1973). Ch. 14. In Crystal Growth: An Introduction, P. Hartman, ed. (Amsterdam: North-Holland). Hassoun, H., Vassiliadis, J.N., Murray, J., Njolstad, P.R., Rogus, J.J., Ballas, S.K., Schaffer, F., Jarolim, P., Brabec, V. and Palek, J. (1997). Characterization of the underlying molecular defect in hereditary spherocytosis associated with spectrin deficiency. Blood 90, 398-406. Hayashi, I. and Ikura, M. (2003). Crystal structure of the amino-terminal microtubule-binding domain of end-binding protein 1 (EB1). J. Biol Chem. 278, 36430-36434. Ho, Y.D., Joyal, J.L., Li, Z. and Sacks, D.B. (1999). IQGAP1 integrates Ca2+/calmodulin and Cdc42 signalling. J. Biol. Chem. 274, 464-470. Honts, J.E., Sandrock, T.S., Brower, S.M., O'Dell, J.L. and Adams, A.E. (1994). Actin mutations that show suppression with fimbrin mutations identify a likely fimbrin-binding site on actin. J. Cell Biol. 126, 413-422. Hou, M.C., Salek, J. and McCollum, D. (2000). Mob1p interacts with the Sid2p kinase and is required for cytokinesis in fission yeast. Curr. Biol. 10, 619-622. Ippolito, J.A., Alexander, R.S. and Christianson, D.W. (1990). Hydrogen bond stereochemistry in protein structure and function. J. Mol. Biol. 215, 457-471. James, R.W. (1958). The Optical Principles of the Diffraction by X-rays (London: Bell & Sons). Jones, T.A., Zou, J.Y., Cowan, S.W. and Kjeldgaard, M. (1991). Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Cryst. A47, 110-119.

101

Jorgensen, W.L., Chandrasekhar, J., Madura, J. and Klein, M.L. (1983). Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 79, 926-935. Kabsch, W. and Sander, C. (1983). Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 2577-2637. Kaplan, J.M., Kim, S.H., North, K.N., Rennke, H., Correia, L.A., Tong, H.Q., Mathis, B.J., Rodriguez-Perez, J.C., Allen, P.G., Beggs, A.H. and Pollak, M.R. (2001). Mutations in ACTN4, encoding alpha-actinin-4, cause familial focal segmental glomerulosclerosis. Nature Genet. 24, 251-256. Karinch, A.M., Zimmer, W.E. and Goodman, S.R. (1990). The identification and sequence of the actin-binding domain of human red blood cell beta-spectrin. J. Biol. Chem. 265, 11833-11840. Katayama, S., Hirata, D., Arellano, M., Perez, P. and Toda, T. (1999). Fission yeast alpha-glucan synthase Mok1 requires the actin cytoskeleton to localize the sites of growth and plays an essential role in cell morphogenesis downstream of protein kinase C function. J. Cell Biol. 144, 1173-1186. Keep, N.H., Winder, S.J., Moores, C.A., Walke, S., Norwood, F.L. and Kendrick-Jones, J. (1999). Crystal structure of the actin-binding region of utrophin reveals a head-to-tail dimer. Structure 7, 1539-1546. Keep, N.H., Norwood, F.L.M., Moores, C.A., Winder, S.J. and Kendrick-Jones, J. (1999). The 2.0 Å structure of the second calponin homology domain from the actin-binding region of the dystrophin homologue utrophin. J. Mol. Biol. 285, 1257-1264. Kitayama, C., Sugimoto, A. and Yamamoto, M. (1997). Type II myosin heavy chain encoded by the myo2 gene composes the contractile ring during cytokinesis in Schizosaccharomyces pombe. J. Cell Biol. 137, 1309-1319. Korenbaum, E. and Rivero, F. (2002). Calponin homology domains at a glance. J. Cell Sci. 115, 3543-3545. Kovar, D.R., Kuhn, J.R., Tichy, A.L. and Pollard, T.D. (2003). The fission yeast cytokinesis formin Cdc12p is a barbed end actin filament capping protein gated by profilin. J. Cell Biol. 161, 875-887. Kovar, D.R., Staiger, C.J., Weaver, E.A. and McCurdy, D.W. (2000). AtFim1 is an actin filament crosslinking protein from Arabidopsis thaliana. Plant J. 24, 625-636. Kuhlman, P.A., Hemmings, L. and Critchley, D.R. (1992). The identification and characterisation of an actin-binding site in alpha-actinin by mutagenesis. FEBS Lett. 304, 201-206. Kumar, S., Tamura, K., Jakobsen, I.B. and Nei, M. (2001). MEGA2: molecular evolutionary genetics analysis software. Bioinformatics 17, 1244-1245.

102

Kutsche, K., Yntema, H., Brandt, A., Jantke, I., Nothwang, H.G., Orth, U., Boavida, M.G., David, D., Chelly, J., Fryns, J.P., Moraine, C., Ropers, H.H., Hamel, B.C., van Bokhoven, H. and Gal, A. (2000). Mutations in ARHGEF6, encoding a guanine nucleotide exchange factor for Rho GTPases, in patients with X-linked mental retardation. Nature Genet. 26, 247-250. Kyte, J. and Doolittle, R. (1982). A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157, 105-132. La Fortelle, E. de and Bricogne, G. (1997). Maximum-likelihood heavy-atom parameter refinement for multiple isomorphous replacement and multiwavelength anomalous diffraction methods. Methods Enzymol. 276, 472-494. Laskowski, R.A., MacArthur, M.W., Moss, D.S. and Thornton, J.M. (1993). PROCHECK: a program to check the stereochemical quality of protein structures. J. Appl. Crystallogr. 26, 283-291. Lebart, M.C., Mejean, C., Casanova, D., Audemard, E., Derancourt, J., Roustan, C. and Benyamin, Y. (1994). Characterization of the actin binding site on smooth muscle filamin. J. Biol. Chem. 269, 4279-4284. Leinweber, B.D., Leavis, P.C., Grabarek, Z., Wang, C.L. and Morgan, K.G. (1999). Extracellular regulated kinase (ERK) interaction with actin and the calponin homology (CH) domain of actin-binding proteins. Biochem. J. 344, 117-123. Leung, C.L., Sun, D. and Liem, R.K. (1999). The intermediate filament protein peripherin is the specific interaction partner of mouse BPAG1-n (dystonin) in neurons. J. Cell Biol. 144, 435-446. Li, X. and Bennett, V. (1996). Identification of the spectrin subunit and domains required for formation of spectrin/adducin/actin complexes. J. Biol. Chem. 271, 15695-15702. Liu, J., Tang, X., Wang, H., Oliferenko, S. and Balasubramanian, M.K. (2002). The localization of the integral membrane protein Cps1p to the cell division site is dependent on the actomyosin ring and the septation-inducing network in Schizosaccharomyces pombe. Mol. Biol. Cell. 13, 989-1000. Mabuchi, K., Li, B., Ip, W. and Tao, T. (1997). Association of calponin with desmin intermediate filaments. J. Biol. Chem. 272, 22662-22666. Marks, J. and Hyams, J.S. (1985). Localization of F-actin through the cell division cycle of Schizosaccharomyces pombe. Eur. J. Cell Biol. 39, 27-32. Matsuura, Y. and Chernov, A.A. (2003). Morphology and the strength of intermolecular contacts in protein crystals. Acta Crystallogr. D59, 1347-1356. Matthews, B.W. (1968). Solvent content of protein crystals. J. Mol. Biol. 33, 491-497.

103

Matynia, A., Dimitrov, K., Mueller, U., He, X. and Sazer, S. (1996). Perturbations in the spi1p GTPase cycle of Schizosaccharomyces pombe through its GTPase-activating protein and guanine nucleotide exchange factor components result in similar phenotypic consequences. Mol. Cell Biol. 16, 6352-6362. Maundrell, K. (1990). nmt1 of fission yeast. A highly transcribed gene completely repressed by thiamine. J. Biol. Chem. 265, 10857-10864. McCallum, S.J., Wu, W.J. and Cerione, R.A. (1996). Identification of a putative effector for Cdc42Hs with high sequence similarity to the RasGAP-related protein IQGAP1 and a Cdc42Hs binding partner with similarity to IQGAP2. J. Biol. Chem. 271, 21732-21737. McCully, E.K. and Robinow, C.F. (1971). Mitosis in the fission yeast Schizosaccharomyces pombe: a comparative study with light and electron microscopy. J. Cell Sci. 9, 475-507. McGough, A., Way, M. and DeRosier, D. (1994). Determination of the alpha-actinin-binding site on actin filaments by cryoelectron microscopy and image analysis. J. Cell Biol. 126, 433-443. Merritt, E.A. and Bacon, D.J. (1997). Raster3D: Photorealistic Molecular Graphics. Methods Enzymol. 277, 505-524. Mezgueldi, M., Mendre, C., Calas, B., Kassab, R. and Fattoum, A. (1995). Characterization of the regulatory domain of gizzard calponin. Interactions of the 145-163 region with F-actin, calcium-binding proteins, and tropomyosin. J. Biol. Chem. 270, 8867-8876. Miller, R., Gallo, S.M., Khalak, H.G. and Weeks, C.M. (1994). SnB: crystal structure determination via Shake-and-Bake. J. Appl. Cryst. 27, 613-621. Mino, T., Yuasa, U., Nakamura, F., Naka, M. and Tanaka, T. (1998). Two distinct actin-binding sites of smooth muscle calponin. Eur. J. Biochem. 251, 262-268. Moores, C.A., Keep, N.H. and Kendrick-Jones, J. (2000). Structure of the utrophin actin-binding domain bound to F-actin reveals binding by an induced fit mechanism. J. Mol. Biol. 297, 465-480. Moreno, S., Klar, A. and Nurse, P. (1991). Molecular genetic analysis of fission yeast Schizosaccharomyces pombe. Methods Enzymol. 194, 795-823. Motegi, F., Nakano, K. and Mabuchi, I. (2000). Molecular mechanism of myosin-II assembly at the division site in Schizosaccharomyces pombe. J. Cell Sci. 113, 1813-1825. Müller, T. and Vingron, M. (2000). Modeling Amino Acid Replacement. J. Comp. Biol. 7, 761-776.

104

Murshudov, G.N., Vagin, A.A. and Dodson, E.J. (1997). Refinement of Macromolecular Structures by the Maximum-Likelihood Method. Acta Crystallogr. D53, 240-255. Nakano, K., Satoh, K., Morimatsu, A., Ohnuma, M. and Mabuchi, I. (2001). Interactions among a fimbrin, a capping protein, and an actin-depolymerizing factor in organization of the fission yeast actin cytoskeleton. Mol. Biol. Cell. 12, 3515-3526. Nikolopoulos, S.N. and Turner, C.E. (2000). Actopaxin, a new focal adhesion protein that binds paxillin LD motifs and actin and regulates cell adhesion. J. Cell Biol. 151, 1435-1448. Norwood, F.L.M., Sutherland-Smith, A.J., Keep, N.H. and Kendrick-Jones, J. (2000). The structure of the N-terminal actin-binding domain of human dystrophin and how mutations in this domain may cause Duchenne or Becker muscular dystrophy. Structure 8, 481-491. Nurse, P., Thuriaux, P. and Nasmyth, K. (1976). Genetic control of the cell division cycle in the fission yeast Schizosaccharomyces pombe. Mol. Gen. Genet. 146, 167-178. Oki, H., Matsuura, Y., Komatsu, H. and Chernov, A.A. (1999). Refined structure of orthorhombic lysozyme crystallized at high temperature: correlation between morphology and intermolecular contacts. Acta Crystallogr. D55, 114-21. Olski, T.M., Noegel, A.A. and Korenbaum, E. (2001). Parvin, a 42 kDa focal adhesion protein, related to the alpha-actinin superfamily. J. Cell Sci. 114, 525-538. Otwinowski, Z. and Minor, W. (1997). Processing of X-ray diffraction data collected in oscillation mode. Methods Enzymol. 276, 307-326. Otwinowski, Z. (1991). Isomorphous Replacement and Anomalous Scattering. In Proceedings of the CCP4 Study Weekend, nos. DL/SCI/R32, ISSN 0144-5677. W. Wolf, P.R. Evans and A.G.W. Leslie, eds. (Warrington: Daresbury Laboratory), pp. 80-86. Pande, V.S., Grosberg, A.Y., Tanaka, T. and Rokhsar, D.S. (1998). Pathways for protein folding: is a new view needed? Curr. Opin. Struct. Biol. 8, 68-79. Perrakis, A., Morris, R. and Lamzin, V.S. (1999). Automated protein model building combined with iterative structure refinement. Nature Struct. Biol. 6, 458-463. Prentice, H.L. (1992). High efficiency transformation of Schizosaccharomyces pombe by electroporation. Nucleic Acids Res. 20, 621. Pringle, J.R., Broach, J.R. and Elizabeth, W.J. (1997). The Molecular and Cellular Biology of the Yeast Saccharomyces Vol. 3, Cell Cycle and Cell Biology (New York: Cold Spring Harbor Laboratory Press).

105

Quensel, C., Kramer, J., Cardoso, M.C. and Leonhardt, H. (2002). Smoothelin contains a novel actin cytoskeleton localization sequence with similarity to troponin T. J. Cell Biochem. 85, 403-409. Raae, A.J., Banuelos, S., Ylanne, J., Olausson, T., Goldie, K.N., Wendt, T., Hoenger, A. and Saraste, M. (2003). Actin binding of a minispectrin. Biochim. Biophys. Acta. 1646, 67-76. Rajagopalan, S., Wachtler, V. and Balasubramanian, M. (2003). Cytokinesis in fission yeast: a story of rings, rafts and walls. Trends Genet. 19, 403-408. Read, R.J. (1986). Improved Fourier coefficients for maps using phases from partial structures with errors. Acta Crystallogr. A42, 140-149. Renley, B.A., Rybakova, I.N., Amann, K.J. and Ervasti, J.M. (1998). Dystrophin binding to nonmuscle actin. Cell Motil. Cytoskeleton. 41, 264-270. Robertson, S.P., Twigg, S.R., Sutherland-Smith, A.J., Biancalana, V., Gorlin, R.J., Horn, D., Kenwrick, S.J., Kim, C.A., Morava, E., Newbury-Ecob, R., Orstavik, K.H., Quarrell, O.W., Schwartz, C.E., Shears, D.J., Suri, M., Kendrick-Jones, J., Wilkie, A.O.; OPD-spectrum Disorders Clinical Collaborative Group. (2003). Localized mutations in the gene encoding the cytoskeletal protein filamin A cause diverse malformations in humans. Nature Genet. 33, 487-491. Rosenberger, G., Jantke, I., Gal, A. and Kutsche, K. (2003). Interaction of alphaPIX (ARHGEF6) with beta-parvin (PARVB) suggests an involvement of alphaPIX in integrin-mediated signaling. Hum. Mol. Genet. 12, 155-167. Rupes, I. (2002). Checking cell size in yeast. Trends Genet. 18, 479-485. Russell, P. and Nurse, P. (1987). Negative regulation of mitosis by wee1+, a gene encoding a protein kinase homolog. Cell 49, 559-567. Russell, R.B. and Barton, G.J. (1992). Multiple protein sequence alignment from tertiary structure comparison. PROTEINS: Struct. Funct. Genet. 14, 309-323. Ryckaert, J.P., Ciccotti, G. and Berendsen, H.J.C. (1977). Numerical integration of the cartesian equations of motion of a system with constraints: Molecular dynamics of n-alkanes. J. Computat. Phys. 23, 327-341. Rzhetsky, A. and Nei, M. (1992). A simple method for estimating and testing minimum evolution trees. Mol. Biol. Evol. 9, 945-967. Saitou, N. and Nei, M. (1987). The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4, 406-425. Schmidt, H.A., Strimmer, K., Vingron, M. and von Haeseler, A. (2002). TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics 18, 502-504.

106

Schneider, T.R. and Sheldrick, G.M. (2002). Substructure solution with SHELXD. Acta Crystallogr. D58, 1772-1779. Sheldrick, G.M. (1998). SHELX: Applications to Macromolecules. In Direct Methods for Solving Macromolecular Structures, S. Fortier, ed. (Dordrecht: Kluwer Academic Publishers), pp. 401-411. Sheldrick, G.M. and Schneider, T.R. (1997). SHELXL: high resolution refinement. Methods Enzymol. 277, 319–343. Simerly, C., Dominko, T., Navara, C., Payne, C., Capuano, S., Gosman, G., Chong, K.Y., Takahashi, D., Chace, C., Compton, D., Hewitson, L. and Schatten, G. (2003). Molecular correlates of primate nuclear transfer failures. Science 300, 297. Sohrmann, M., Schmidt, S., Hagan, I. and Simanis, V. (1998). Asymmetric segregation on spindle poles of the Schizosaccharomyces pombe septum-inducing protein kinase Cdc7p. Genes Dev. 12, 84-94. Sparks, C.A., Morphew, M. and McCollum, D. (1999). Sid2p, a spindle pole body kinase that regulates the onset of cytokinesis. J. Cell Biol. 146, 777-790. Stock, A., Steinmetz, M.O., Janmey, P.A., Aebi, U., Gerisch, G., Kammerer, R.A., Weber, I. and Faix, J. (1999). Domain analysis of cortexillin I: actin-bundling, PIP(2)-binding and the rescue of cytokinesis. EMBO J. 18, 5274-5284. Stradal, T., Kranewitter, W., Winder, S.J. and Gimona, M. (1998). CH domains revisited. FEBS Lett. 431, 134-137. Strimmer, K. and von Haeseler, A. (1996). Quartet puzzling: A quartet maximum likelihood method for reconstructing tree topologies. Mol. Biol. Evol. 13, 964-969. Szymanski, P.T. and Goyal, R.K. (1999). Calponin binds to the 20-kilodalton regulatory light chain of myosin. Biochemistry 38, 3778-3784. Szymanski, P.T. and Tao, T. (1997). Localization of protein regions involved in the interaction between calponin and myosin. J. Biol. Chem. 272, 11142-11146. Tang, D.C., Kang, H.M., Jin, J.P., Fraser, E.D. and Walsh, M.P. (1996). Structure-function relations of smooth muscle calponin. The critical role of serine 175. J. Biol. Chem. 271, 8605-8611. Tatebayashi, K., Tani, T. and Ikeda, H. (2001). Fission yeast Mog1p homologue, which interacts with the small GTPase Ran, is required for mitosis-to-interphase transition and poly(A)(+) RNA metabolism. Genetics 157, 1513-1522. Terrak, M., Wu, G., Stafford, W.F., Lu, R.C. and Dominguez, R. (2003). Two distinct myosin light chain structures are induced by specific variations within the bound IQ motifs-functional implications. EMBO J. 22, 362-371.

107

Terwilliger, T.C. (2000). Maximum likelihood density modification. Acta Crystallogr. D56, 965-972. Terwilliger, T.C. and Berendzen, J. (1999). Automated MAD and MIR structure solution. Acta Crystallogr. D55, 849-861. Thompson, J.D., Gibson, T.J., Plewniak, F., Jeanmougin, F. and Higgins, D.G. (1997). The ClustalX windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 24, 4876-4882. Tomlin, G.C., Morrell, J.L. and Gould, K.L. (2002). The spindle pole body protein Cdc11p links Sid4p to the fission yeast septation initiation network. Mol. Biol. Cell. 13, 1203-1214. Trautmann, S., Wolfe, B.A., Jorgensen, P., Tyers, M., Gould, K.L. and McCollum, D. (2001). Fission yeast Clp1p phosphatase regulates G2/M transition and coordination of cytokinesis with cell cycle progression. Curr. Biol. 11, 931-940. Wachtler, V., Rajagopalan, S. and Balasubramanian, M.K. (2003). Sterol-rich plasma membrane domains in the fission yeast Schizosaccharomyces pombe. J. Cell Sci. 116, 867-874. Wang, C.H., Walsh, M., Balasubramanian, M.K. and Dokland, T. (2003). Expression, purification, crystallization and preliminary crystallographic analysis of the calponin-homology domain of Rng2. Acta Crystallogr. D59, 1809-1812. Wang, J., Cieplak, P. and Kollman, P.A. (2000). How well does a restrained electrostatic potential (RESP) model perform in calculating conformational energies of organic and biological molecules? J. Comput. Chem. 21, 1049-1074. Wang, Z., Zhu, G., Huang, Q., Qian, M., Shao, M., Jia, Y. and Tang, Y. (1998). X-ray studies on cross-linked lysozyme crystals in acetonitrile-water mixture. Biochim. Biophys. Acta. 1384, 335-344. Wang, P. and Gusev, N.B. (1996). Interaction of smooth muscle calponin and desmin. FEBS Lett. 392, 255-258. Way, M., Pope, B. and Weeds, A.G. (1992). Evidence for functional homology in the F-actin binding domains of gelsolin and alpha-actinin: implications for the requirements of severing and capping. J. Cell Biol. 119, 835-842. Weeks, C.M., DeTitta, G.T., Miller, R. and Hauptman, H.A. (1993). Applications of the minimal principle to peptide structures, Acta Crystallogr. D49, 179-181. Wills, F.L., McCubbin, W.D., Gimona, M., Strasser, P. and Kay, C.M. (1994). Two domains of interaction with calcium binding proteins can be mapped using fragments of calponin. Protein Sci. 3, 2311-2321. Wilson, A.J.C. (1950). Largest likely values for the reliability index. Acta Cryst. 3, 397-398.

108

Winder, S.J., Hemmings, L., Maciver, S.K., Bolton, S.J., Tinsley, J.M., Davies, K.E., Critchley, D.R. and Kendrick-Jones J. (1995). Utrophin actin binding domain: analysis of actin binding and cellular targeting. J. Cell Sci. 108, 63-71. Winder, S.J. and Walsh, M.P. (1990). Smooth muscle calponin. Inhibition of actomyosin MgATPase and regulation by phosphorylation. J. Biol. Chem. 265, 10148-10155. Wu, J.Q., Kuhn, J.R., Kovar, D.R. and Pollard, T.D. (2003). Spatial and temporal pathway for assembly and constriction of the contractile ring in fission yeast cytokinesis. Dev. Cell. 5, 723-734. Xu, J., Wirtz, D. and Pollard, T.D. (1998). Dynamic cross-linking by alpha-actinin determines the mechanical properties of actin filament networks. J. Biol. Chem. 273, 9570-9576. Yang, C., Pflugrath, J.W., Courville, D.A., Stence, C.N. and Ferrara, J.D. (2003). Away from the edge: SAD phasing from the sulfur anomalous signal measured in-house with chromium radiation. Acta Crystallogr. D59, 1943-1957. Yang, Y., Dowling, J., Yu, Q.C., Kouklis, P., Cleveland, D.W. and Fuchs, E. (1996). An essential cytoskeletal linker protein connecting actin microfilaments to intermediate filaments. Cell 86, 655-665. Zhen, Y.Y., Libotte, T., Munck, M., Noegel, A.A. and Korenbaum, E. (2002). NUANCE, a giant protein connecting the nucleus and actin cytoskeleton. J. Cell Sci. 115, 3207-3222. Zugaza, J.L., Lopez-Lago, M.A., Caloca, M.J., Dosil, M., Movilla, N. and Bustelo, X.R. (2002). Structural determinants for the biological activity of Vav proteins. J. Biol. Chem. 277, 45377-45392.

109

Documents

X-RAY CRYSTALLOGRAPHY OF YEAST CYTOSKELETAL PROTEINS · 2018-01-09 · x-ray crystallography of fission yeast rng2 wang chern hoe b.sc. (hons.), nus a thesis submitted for the degree