2
Rosetta is a de novo/comparative protein structure modeling algorithm. As one of the top-performing programs for protein structure prediction, it has predicted protein structures with high backbone accuracies. As a test case, we modeled the N-domain of Troponin C (NTnC) at both 4Ò"f and 30Ò"f, for which NMR structures are known. Rosetta-predicted structures of NTnC align with the NMR-determined structures at these two temperatures with ~3A ˚ backbone root mean square deviation. Our approach should be gener- ally applicable to modeling temperature-dependent protein conformational rearrangements. 1178-Pos Board B70 Classification of Amyloidogenic Hexapeptides with Machine Learning Methods Malgorzata Kotulska, Olgierd Unold, Jerzy Stanislawski. Wroclaw University of Technology, Wroclaw, Poland. Amyloids are proteins that form fibrils. Many of them underlie serious dis- eases, like Alzheimer disease. Recent studies indicate that amyloidogenic properties can be associated with short segments of aminoacids, which trans- form the structure when exposed. A few hundreds of such peptides have been experimentally found; experimental testing of all possible aminoacid combi- nations is currently not feasible. Instead, they can be predicted by computa- tional methods. 3D Profile is a physicochemical method that has generated the most numerous dataset. However, it is computationally very demanding. Here, we show that dataset generation can be accelerated. Two methods to increase the classification efficiency of amyloidogenic candidates are pre- sented and tested: simplified 3D profile generation and machine learning methods. We generated a new dataset of hexapeptides, using modified 3D profile algo- rithm, which showed very good classification overlap with ZipperDB (93.5%). The new part of our dataset contains 1779 segments, with 204 clas- sified as amyloidogenic. The dataset was applied for various machine learn- ing methods. The most effective methods were Multilayer Perceptron and Alternating Decision Tree with areas under ROC curve of 0.96, accuracy of 91%, true positive rate of ca. 80%, and true negative rate 95%. A few other machine learning methods also achieved a good performance. We showed that the simplified profile generation method does not introduce an error with regard to the original method, while increasing the computational effi- ciency. Our new dataset proved representative enough to use simple statistical methods for testing the amylogenicity only based on six letter sequences. Sta- tistical machine learning methods such as Alternating Decision Tree and Mul- tilayer Perceptron can replace the energy based classifier, with advantage of very significantly reduced computational time and simplicity to perform the analysis. Additionally, a decision tree provides a set of very easily interpret- able rules. 1179-Pos Board B71 N-Glycan Structure Modeling and in Silico Glycosylation: Template- Based Structure Prediction of Carbohydrate Structures of Glyco- conjugates Sunhwan Jo, Hui sun Lee, George Li, Jeffrey Skolnick, Wonpil Im. The University of Kansas, Lawrence, KS, USA. Obtaining the crystal structure of glycoconjugate is challenging due to the flex- ibility of the carbohydrate chains. Alternatively, computational modeling, which combines the primary sequence information of glycans determined by the mass spectrometry and known N-glycan structure, is an appealing approach. Here we present a survey of N-glycan structures of 35 different glycan se- quences in the PDB, showing that N-glycan structures found on homologous glycoproteins are significantly conserved compared to the random background. This suggests that N-glycan chains can be confidently modeled to a glycoprotein if there exists a template N-glycan structure whose parent glycoprotein shares sequence similarity. On the other hand, N-glycan structures found on non- homologous glycoproteins have not shown significant structure similarity. However, despite that the global N-glycan structures are different, the internal substructure of those N-glycans found on the non-homologous glycoproteins, particularly, the substructure that are closer to the protein, showed significantly similar structure. Increased interaction with protein might be responsible to the restricted conformational space of N-glycan chains. Our results so far suggest that computational structure prediction of N-glycan portion of glycoconjugate using structure database would be effective, but different approaches must be needed depending on the availability of template structure. In addition, we also present a database for PDB glycan structural fragments (substructures) as well as PDB glycan-protein database, which are useful for glycan structure modeling. 1180-Pos Board B72 Adsorption of Bone Sialoprotein on Hydroxyapatite-A Combination Study with Bioinformatics and Molecular Dynamics Simulations Yang Yang 1 , Zhijun Xu 2 , Qiang Cui 3 , Nita Sahai 2 . 1 Rowan University, Glassboro, NJ, USA, 2 The University of Akron, Akron, OH, USA, 3 University of Wisconsin-Madison, Madison, WI, USA. Bone sialoprotein (BSP), an acidic non-collagenous protein specific to bone, is proposed previously to modulate hydroxyapatite (HAP) nanocrystal growth. Two highly conserved phosphorylated acidic amino-acid sequences in BSP are hypothesized as the functional motifs. Specifically, we choose one of them, (Sp) 2 E 8 , where Sp represents a phosphoserine, as a model peptide to study the interactions between BSP and the HAP (001) face. A bioinformatics method helps predict the likely peptide conformations adsorbed on the HAP surface, which, subsequently, is subject to further examinations using molecu- lar dynamics simulations with the explicit solvent model. The bioinformatics method predicts a Sp residue binds strongly to the surface, and the Glu residues show propensity to form a helical conformation. Long-time molecular dynamic simulations observe some variations of the sidechain orientations compared to the bioinformatics-predicted conformation, however, the backbone structure and the major binding features are largely preserved. In addition, no apparent geometrical templating between the peptide residues and the studied HAP sur- face sites is noticed, which implies that adsorption and subsequent crystal growth modulation by BSP may be structurally non-specific. 1181-Pos Board B73 Introducing Dinamo: A Package for Calculating Protein Circular Dichroism using Classical Electromagnetic Theory Boris A. Sango 1 , Neville Y. Forlemu 2 , Sandeep Pothuganti 1 , Rahul Nori 1 , Yvonne E. Bongfen 3 , Kathryn A. Thomasson 1 . 1 University of North Dakota, Grand Forks, ND, USA, 2 Georgia Gwinnett College, Lawrenceville, GA, USA, 3 Oklahoma Baptist University, Shawnee, OK, USA. The dipole interaction model is a classical electromagnetic theory that has suc- cessfully been able to reproduce the experimental circular dichroism (CD) for the p-p* transitions for peptides and proteins. This theoretical model, pio- neered by Jon B. Applequist, has been assembled into a package DInaMo that is written in C and Fortran allowing for treatment of whole proteins. The program reads Protein Data Bank formatted files of structures generated by mo- lecular mechanics and molecular dynamics. Simple crystal structures need to at least be energy minimized for use in the model because they do not contain all the hydrogens. DInaMo reduces all the amide chromophores to points with an- isotropic polarizability and all nonchromophoric aliphatic atoms to points with isotropic polarizability; all other atoms are ignored. By determining the inter- actions among the chromophoric and nonchromphoric parts of the molecule us- ing empirically derived polarizabilities, the rotational and dipole strengths are determined leading to the calculation of the CD spectrum for each molecule. Theoretically predicted CD for a variety proteins (lysozyme, myoglobin, insu- lin, and collagen) are compared with synchrotron radiation CD data. Theory agrees with experiment showing bands with similar morphology and absorption maxima for the p-p* transitions. 1182-Pos Board B74 Bridging the Gap between Sequence and Function Alexander Johs. Oak Ridge National Laboratory, Oak Ridge, TN, USA. Structural genomics initiatives have generated a massive quantity of high res- olution structures and sequenced genomes from archaea, bacteria, viruses and eukaryotes. These numbers are expected to grow rapidly within the coming years. The available data constitute an invaluable resource for the prediction of protein structure and function and will allow us to obtain a more comprehen- sive structure-based understanding of biological function. The acquisition of new biochemical functionality in the course of evolution does not necessarily involve the transfer of whole genes, but may be limited to the transfer of func- tional domains. In fact, most of a protein’s amino acids serve structural roles and may exhibit a low degree of sequence conservation even among closely re- lated genomes. Therefore, it is critical to identify and compare structurally re- lated domains across a wide spectrum of organisms to reveal unique metabolic functionalities. This presentation will outline examples that have combined sequence align- ments, homology modeling and biophysical approaches to predict function. In- tegration of models with existing knowledge about genomic context, biochemical pathways and sparse experimental data, such as small-angle X-ray scattering and spectroscopic data, enables us to accurately identify 230a Monday, February 4, 2013

Bridging the Gap between Sequence and Function

Embed Size (px)

Citation preview

230a Monday, February 4, 2013

Rosetta is a de novo/comparative protein structure modeling algorithm. Asone of the top-performing programs for protein structure prediction, it haspredicted protein structures with high backbone accuracies. As a test case,we modeled the N-domain of Troponin C (NTnC) at both 4�"f and 30�"f,for which NMR structures are known. Rosetta-predicted structures of NTnCalign with the NMR-determined structures at these two temperatures with~ 3 A backbone root mean square deviation. Our approach should be gener-ally applicable to modeling temperature-dependent protein conformationalrearrangements.

1178-Pos Board B70Classification of Amyloidogenic Hexapeptides with Machine LearningMethodsMalgorzata Kotulska, Olgierd Unold, Jerzy Stanislawski.Wroclaw University of Technology, Wroclaw, Poland.Amyloids are proteins that form fibrils. Many of them underlie serious dis-eases, like Alzheimer disease. Recent studies indicate that amyloidogenicproperties can be associated with short segments of aminoacids, which trans-form the structure when exposed. A few hundreds of such peptides have beenexperimentally found; experimental testing of all possible aminoacid combi-nations is currently not feasible. Instead, they can be predicted by computa-tional methods. 3D Profile is a physicochemical method that has generatedthe most numerous dataset. However, it is computationally very demanding.Here, we show that dataset generation can be accelerated. Two methods toincrease the classification efficiency of amyloidogenic candidates are pre-sented and tested: simplified 3D profile generation and machine learningmethods.We generated a new dataset of hexapeptides, using modified 3D profile algo-rithm, which showed very good classification overlap with ZipperDB(93.5%). The new part of our dataset contains 1779 segments, with 204 clas-sified as amyloidogenic. The dataset was applied for various machine learn-ing methods. The most effective methods were Multilayer Perceptron andAlternating Decision Tree with areas under ROC curve of 0.96, accuracyof 91%, true positive rate of ca. 80%, and true negative rate 95%. A few othermachine learning methods also achieved a good performance. We showedthat the simplified profile generation method does not introduce an errorwith regard to the original method, while increasing the computational effi-ciency. Our new dataset proved representative enough to use simple statisticalmethods for testing the amylogenicity only based on six letter sequences. Sta-tistical machine learning methods such as Alternating Decision Tree and Mul-tilayer Perceptron can replace the energy based classifier, with advantage ofvery significantly reduced computational time and simplicity to perform theanalysis. Additionally, a decision tree provides a set of very easily interpret-able rules.

1179-Pos Board B71N-Glycan Structure Modeling and in Silico Glycosylation: Template-Based Structure Prediction of Carbohydrate Structures of Glyco-conjugatesSunhwan Jo, Hui sun Lee, George Li, Jeffrey Skolnick, Wonpil Im.The University of Kansas, Lawrence, KS, USA.Obtaining the crystal structure of glycoconjugate is challenging due to the flex-ibility of the carbohydrate chains. Alternatively, computational modeling,which combines the primary sequence information of glycans determined bythe mass spectrometry and known N-glycan structure, is an appealing approach.Here we present a survey of N-glycan structures of 35 different glycan se-quences in the PDB, showing that N-glycan structures found on homologousglycoproteins are significantly conserved compared to the random background.This suggests that N-glycan chains can be confidentlymodeled to a glycoproteinif there exists a template N-glycan structure whose parent glycoprotein sharessequence similarity. On the other hand, N-glycan structures found on non-homologous glycoproteins have not shown significant structure similarity.However, despite that the global N-glycan structures are different, the internalsubstructure of those N-glycans found on the non-homologous glycoproteins,particularly, the substructure that are closer to the protein, showed significantlysimilar structure. Increased interaction with protein might be responsible to therestricted conformational space of N-glycan chains. Our results so far suggestthat computational structure prediction of N-glycan portion of glycoconjugateusing structure database would be effective, but different approaches must beneeded depending on the availability of template structure. In addition, wealso present a database for PDB glycan structural fragments (substructures)as well as PDB glycan-protein database, which are useful for glycan structuremodeling.

1180-Pos Board B72Adsorption of Bone Sialoprotein on Hydroxyapatite-A Combination Studywith Bioinformatics and Molecular Dynamics SimulationsYang Yang1, Zhijun Xu2, Qiang Cui3, Nita Sahai2.1Rowan University, Glassboro, NJ, USA, 2The University of Akron, Akron,OH, USA, 3University of Wisconsin-Madison, Madison, WI, USA.Bone sialoprotein (BSP), an acidic non-collagenous protein specific to bone, isproposed previously to modulate hydroxyapatite (HAP) nanocrystal growth.Two highly conserved phosphorylated acidic amino-acid sequences in BSPare hypothesized as the functional motifs. Specifically, we choose one ofthem, (Sp)2E8, where Sp represents a phosphoserine, as a model peptide tostudy the interactions between BSP and the HAP (001) face. A bioinformaticsmethod helps predict the likely peptide conformations adsorbed on the HAPsurface, which, subsequently, is subject to further examinations using molecu-lar dynamics simulations with the explicit solvent model. The bioinformaticsmethod predicts a Sp residue binds strongly to the surface, and the Glu residuesshow propensity to form a helical conformation. Long-time molecular dynamicsimulations observe some variations of the sidechain orientations compared tothe bioinformatics-predicted conformation, however, the backbone structureand the major binding features are largely preserved. In addition, no apparentgeometrical templating between the peptide residues and the studied HAP sur-face sites is noticed, which implies that adsorption and subsequent crystalgrowth modulation by BSP may be structurally non-specific.

1181-Pos Board B73Introducing Dinamo: A Package for Calculating Protein CircularDichroism using Classical Electromagnetic TheoryBoris A. Sango1, Neville Y. Forlemu2, Sandeep Pothuganti1, Rahul Nori1,Yvonne E. Bongfen3, Kathryn A. Thomasson1.1University of North Dakota, Grand Forks, ND, USA, 2Georgia GwinnettCollege, Lawrenceville, GA, USA, 3Oklahoma Baptist University, Shawnee,OK, USA.The dipole interaction model is a classical electromagnetic theory that has suc-cessfully been able to reproduce the experimental circular dichroism (CD) forthe p-p* transitions for peptides and proteins. This theoretical model, pio-neered by Jon B. Applequist, has been assembled into a package DInaMothat is written in C and Fortran allowing for treatment of whole proteins. Theprogram reads Protein Data Bank formatted files of structures generated by mo-lecular mechanics and molecular dynamics. Simple crystal structures need to atleast be energy minimized for use in the model because they do not contain allthe hydrogens. DInaMo reduces all the amide chromophores to points with an-isotropic polarizability and all nonchromophoric aliphatic atoms to points withisotropic polarizability; all other atoms are ignored. By determining the inter-actions among the chromophoric and nonchromphoric parts of the molecule us-ing empirically derived polarizabilities, the rotational and dipole strengths aredetermined leading to the calculation of the CD spectrum for each molecule.Theoretically predicted CD for a variety proteins (lysozyme, myoglobin, insu-lin, and collagen) are compared with synchrotron radiation CD data. Theoryagrees with experiment showing bands with similar morphology and absorptionmaxima for the p-p* transitions.

1182-Pos Board B74Bridging the Gap between Sequence and FunctionAlexander Johs.Oak Ridge National Laboratory, Oak Ridge, TN, USA.Structural genomics initiatives have generated a massive quantity of high res-olution structures and sequenced genomes from archaea, bacteria, viruses andeukaryotes. These numbers are expected to grow rapidly within the comingyears. The available data constitute an invaluable resource for the predictionof protein structure and function and will allow us to obtain a more comprehen-sive structure-based understanding of biological function. The acquisition ofnew biochemical functionality in the course of evolution does not necessarilyinvolve the transfer of whole genes, but may be limited to the transfer of func-tional domains. In fact, most of a protein’s amino acids serve structural rolesand may exhibit a low degree of sequence conservation even among closely re-lated genomes. Therefore, it is critical to identify and compare structurally re-lated domains across a wide spectrum of organisms to reveal unique metabolicfunctionalities.This presentation will outline examples that have combined sequence align-ments, homology modeling and biophysical approaches to predict function. In-tegration of models with existing knowledge about genomic context,biochemical pathways and sparse experimental data, such as small-angleX-ray scattering and spectroscopic data, enables us to accurately identify

Monday, February 4, 2013 231a

domains of unknown function. We also show how minor changes in otherwisehighly conserved active sites can significantly affect functionality. There isa growing need for intelligent prediction-based strategies that can tap intoour enormous genomic and structural databases and help bridge the gap be-tween sequence and function.

1183-Pos Board B75Epileptic Seizures-Induced Structural Changes in Rat Spine Bone Tissues:FTIR Microspectroscopic and Chemometric StudySebnem Garip1,2, Deniz Sahin3,4, Mete Severcan1,5, Feride Severcan1,6.1Middle East Technical University, Ankara, Turkey, 2Department ofBiochemistry, Middle East Technical University, Ankara, Turkey, 3KocaeliUniversity Faculty of Medicine, Kocaeli, Turkey, 4Department ofPhysiology, Kocaeli University Faculty of Medicine, Kocaeli, Turkey,5Department of Electrical and Electronics Engineering, Middle EastTechnical University, Ankara, Turkey, 6Department of Biological Sciences,Middle East Technical University, Ankara, Turkey.Epilepsy is a common serious neurodegenerative disease. Bone disorders due toanti-epileptic drug (AED) therapy in epileptic patients have been reported pre-viously. There is no study in the literature, investigating the independent effectof epileptic seizures on bone tissues. Thus, the side-effects of AEDs on bonetissues could not be differentiated from the effects of the epileptic seizures.The current study provides the first report on clarifying the effects of seizureson bones. The experiments performed on genetically epileptic and healthy rats,give the advantage of studying the effects of seizures alone. Cortical region ofspines were studied by FTIR microscopy to investigate the structural and com-positional changes in bones. Comparison of FTIR images belonged to the min-eral and protein parts of bone clearly showed the difference between healthyand epileptic bone tissues. Mineral content was found to be decreased in epilep-tic group compared to the healthy control. Although total carbonate content wasfound to be decreased, B-type carbonate content which substitutes for phos-phate groups in the mineral part of bone, was shown to be increased in epilepticgroup. The organic matrix of bone is mainly composed of collagen proteinswhose structure is stabilized by several intermolecular crosslinks. Collagencross-links ratio was found to be changed critically in epileptic group, indicat-ing an increase in immature crosslinks in the bones of that group. Crystallinityvalue indicating crystal size was found to be increased in epileptic group com-pared to the healthy control. Decreased mineral content and collagen crosslinksand increased crystal size and carbonate substitution, imply a severe damage onbone tissues. Moreover, the epileptic and control groups were separated fromeach other successfully by principle component analysis (PCA) based on theFTIRM data.

1184-Pos Board B76A Spectroscopic Survey of Substituted Indoles Reveals Effects of 1LB

Transition StabilizationXianwei Meng1, Trisheena Harricharran2, Laura J. Juszczak1.1Brooklyn College/The City University of New York, Brooklyn, NY, USA,2The Graduate Center/The City University of New York, New York, NY,USA.Although tryptophan is a natural probe of protein structure, interpretation ofits fluorescence emission spectrum is complicated by the presence of twoelectronic transitions, 1La and 1Lb. Theoretical calculations show thata point charge adjacent to either ring of the indole can shift the emissionmaximum. This study explores the effect of pyrrole and benzyl ring substitu-tions on the transitions’ energy via absorption and fluorescence spectroscopy,and lifetime measurements. The survey of indole derivatives shows thatmethyl substitutions on the pyrrole ring effect 1La and

1Lb energies in tandemwhile benzyl ring substitutions with electrophilic groups lift the 1La/

1Lb

degeneracy. For 5- and 6-hydroxyindole in cyclohexane, 1La and1Lb transi-

tions are resolved (5-hydroxyindole absor-bance, shown, solid line). This findingprovides for 1La origin assignment in theabsorption and excitation spectra for in-dole vapor. The 5-hydroxyindole excita-tion spectrum (dashed line) shows thatdespite a blue-shifted emission spectrum,both the 1La and

1Lb transitions contributeto emission. 10 0 ns fluorescence lifetimesfor 5-hydroxyindole are consistent with acharge acceptor-induced increase in thenonradiative rate.

Enzymes

1185-Pos Board B77Use Nanomechanical Sensor to Detect Cellulase Activities IncludingEnzymatic Decrystallization and Hydrolytic Cleavage on CelluloseWenjian Du, Liming Zhao, Chi Nguyen, Jun Xi.Drexel University, philadelphia, PA, USA.Cellulase is an interfacial enzyme that catalyzes the hydrolytic degradation ofcellulose at the interface between a liquid phase (enzyme) and a solid phase(cellulose substrate). Prior to the hydrolytic cleavage, cellulase utilizes an ac-tivity known as enzymatic decrystallization to break up the solid aggregateof cellulose molecules. The activity of enzymatic decrystallization has notbeen characterized and its mechanism has not been elucidated because veryfew existing experimental approaches are able to examine interfacial enzymaticactivity on solid substrates. Here, we report the development of a novel strategyfor the real-time detection of cellulase activities including enzymatic decrystal-lization and hydrolytic cleavage on cellulose with the use of a nanomechanicalsensor in a microcantilever. We present both kinetic and physical evidence tosupport the decrystallization as a kinetically viable step of cellulose hydrolysisby cellulase. To our knowledge, this is the first use of a nanomechanical sensorto study mechanistic enzymology and heterogeneous enzymatic catalysis thatinvolves a solid substrate. This nanomechanical sensor-based approach willhelp obtain a comprehensive understanding of cellulase actions on cellulose,which would be essential to the success of the development of new cellulaseswith enhanced efficiency for biofuels production.

1186-Pos Board B78Single Enzyme Studies Reveal the Existence of Discrete Functional Statesfor Monomeric Enzymes and How they are ‘‘Selected’’ upon AllostericRegulationNikos S. Hatzakis1, Li Wei1, Sune K. Jorgensen1, Andreas H. Kunding1,Pierre-Yves Bolinger1, Nicky Ehrlich1, Ivan Makarov1, Michael Skjot2,Allan Svendsen2, Per Hedegard3, Samuel M. Walsh1, Dimitrios Stamou1.1University of Copenhagen, Copenhagen, Denmark, 2Novozymes A/S,Department of Protein Biochemistry, Copenhagen, Denmark, 3Nano-ScienceCenter, Niels Bohr Institute, University of Copenhagen, Copenhagen,Denmark.Allosteric regulation of enzymatic activity forms the basis for controllinga plethora of vital cellular processes. While the mechanism underlying regula-tion of multimeric enzymes is generally well understood and proposed to pri-marily operate via conformational selection, the mechanism underlyingallosteric regulation of monomeric enzymes is poorly understood. Here wemonitored for the first time allosteric regulation of enzymatic activity at the sin-gle molecule level (1). We measured single stochastic catalytic turnovers ofa monomeric metabolic enzyme (Thermomyces lanuginosus Lipase) while ti-trating its proximity to a lipid membrane that acts as an allosteric effector.The single molecule measurements revealed the existence of discrete binaryfunctional states that could not be identified in macroscopic measurementsdue to ensemble averaging. The discrete functional states correlate with the en-zyme’s major conformational states and are redistributed in the presence of theregulatory effector. Thus, our data support allosteric regulation of monomericenzymes to operate via selection of preexisting functional states and not via in-duction of new ones.(1) Hatzakis, N. S.; Wei, L.; Jørgensen, S. K.; Kunding, A., H.; Bolinger, P-Y.;Ehrlich, N.; Makarov, I.; Skjot, M.; Svendsen, A.; Hedegard, P.; Stamou, D.(2012). Single Enzyme Studies Reveal the Existence of Discrete FunctionalStates for Monomeric Enzymes and How They Are ‘‘Selected’’ upon AllostericRegulation. J. Am. Chem. Soc.134 (22), 9296-9302.

1187-Pos Board B79Backbone 1H-13C-15N NMR Assignments and Ligand Binding Study ofOMP Synthase from Saccharomyces CerevisiaeMichael R. Hansen1, Eric W. Barr1, Richard Harris2, Kaj Frank Jensen3,Martin Willemoes3, Hong Cheng4, Mark Girvin2, Charles Grubmeyer1.1Temple University School of Medicine, Philadelphia, PA, USA, 2AlbertEinstein College of Medicine of Yeshiva University, Bronx, NY, USA,3University of Copenhagen, Copenhagen, Denmark, 4Fox Chase CancerCenter and Temple University School of Medicine, Philadelphia, PA, USA.Catalysis in OMP synthase (orotate phosphoribosyltransferase, EC 2.4.2.10) iscoupled to the unstructured-to-structured transition of a 10-residue peptideloop. OMP synthase from yeast is a small, homodimeric (49 kDa) and highlystable domain-swapped enzyme that catalyzes the formation of the UMP