5
Proc. Natl. Acad. Sci. USA Vol. 84, pp. 8335-8339, December 1987 Biochemistry Molecular cloning of matrix Gla protein: Implications for substrate recognition by the vitamin K-dependent y-carboxylase (r-carboxyglutamic acid/bone/protein carboxylation) PAUL A. PRICE*, JAMES D. FRASER, AND GABRIELLE METZ-VIRCA Department of Biology, University of California at San Diego, La Jolla, CA 92093 Communicated by Russell F. Doolittle, August 14, 1987 ABSTRACT Matrix Gla protein (MGP), a low molecular weight protein found in bone, dentin, and cartilage, contains 5 residues of the vitamin K-dependent amino acid y-carboxy- glutamic acid (Gla). We have used antibodies raised against MGP and oligonucleotide probes to screen a Agtll cDNA library constructed from the rat osteosarcoma cells (line ROS 17/2) that- had been pretreated with la,25-dihydroxyvitamin D3. By sequencing several cloned cDNAs, we established a 523-base-pair sequence that predicts an 84-residue mature MGP and a 19-residue hydrophobic signal peptide. The 84- residue mature rat MGP predicted from the cDNA sequence has an additional 5 residues at its C terminus (-Arg-Arg-Gly- Ala-Lys) not seen in the sequence of MGP isolated from bovine bone. The structure of rat MGP provides insight into the mechanisms by which the vitamin K-dependent y-carboxylase recognizes substrate. The present studies show that MGP, unlike other vitamin K-dependent proteins, lacks a propeptide. The absence of an MGP propeptide demonstrates that y- carboxylation and secretion of vitamin K-dependent proteins need not be linked to the presence of a propeptide or to its proteolytic removal. The propeptides of other vitamin K- dependent proteins are structurally homologous, and there is evidence that this homologous propeptide domain is important to substrate recognition by the y-carboxylase. Mature MGP has a sequence segment (residues 15-30) that is homologous to the propeptide of other vitamin K-dependent proteins and probably serves the same role in y-carboxylase recognition. Rat MGP also has a second sequence that has recently been identified in all known vitamin K-dependent vertebrate pro- teins, the invariant unit Glu-Xaa-Xaa-Xaa-Glu-Xaa-Cys (EXXXEXC). Since the glutamic residues in this unit are sites of y-carboxylation, it has been suggested that the EXXXEXC unit could allow the y-carboxylase to discriminate between substrate and product. The demonstration that two structures common to vitamin K-dependent proteins, the homologous propeptide domain and the invariant EXXXEXC unit, are in mature MGP indicates that des-y-carboxy-MGP should be an excellent in vitro y-carboxylase substrate for analysis of mech- anisms involved in substrate recognition and product dissocia- tion. Matrix Gla protein (MGP) is a 79-residue protein, found in bone, dentin, and cartilage, that contains 5 residues of the vitamin K-dependent amino acid, y-carboxyglutamic acid (Gla) (1-3). The C-terminal domain of bovine MGP (residues 31-79) has 20% sequence identity to the 49-residue structure of bone Gla protein (BGP), the only other vitamin K- dependent protein isolated from bone. Although this degree of sequence identity is at the threshold of statistical signifi- cance, the fact that specific amino acid residues conserved in BGP from all species examined to date are also found in MGP indicates that the C-terminal domain of MGP and the secreted form of BGP arose from a common ancestor by gene duplication and subsequent divergent evolution (2). MGP associates with the organic matrix of cartilage and bone and is virtually water-insoluble (<10 Ag/ml) in the absence of denaturants, a property that is noteworthy given its high percentage of hydrophilic amino acids and its small size (1-4). Although the function of MGP is unknown, vitamin K-deficient rats develop excessive mineralization of growth- plate cartilage, which indicates that a vitamin K-dependent cartilage protein, possibly MGP, normally prevents cartilage mineralization (5). In clonal osteoblastic cells, MGP pro- duction is regulated by la,25-dihydroxyvitamin D3 [1,25- (OH)2D3] (6, 7). In the present study, we have undertaken the molecular cloning of rat MGP in order to generate probes for further investigation of its biosynthesis and hormonal regu- lation and in order to obtain the complete amino acid sequence of the initial translation product.t The results identified the probable structures involved in substrate rec- ognition by the y-carboxylase and the relationship between the y-carboxylase binding site and the location of glutamic residues that are modified. EXPERIMENTAL PROCEDURES Antibody Screening. Antibodies raised in rabbits against bovine MGP (4) were purified by affinity adsorption to rat MGP-Sepharose 4B, elution with 4 M guanidine hydrochlo- ride, and dialysis against phosphate-buffered saline at 40C. This antibody preparation, which crossreacts with rat MGP (4), was used at a protein concentration of 1.5 ,ug/ml to screen our rat osteosarcoma (ROS) 17/2 Xgtll cDNA library (8) with the ProtoBlot immunoscreening system (Promega Biotec, Madison, WI). This library was prepared with mRNA from ROS 17/2 cells that had been treated with 1,25-(OH)2D3 at 1 ng/ml for 54 hr (8), a treatment time that we have shown is sufficient to induce MGP expression by ROS 17/2 cells (7). Positive clones were plaque-purified, subcloned into plasmid pUC8, and sequenced by the dideoxynucleotide chain- termination method on double-stranded DNA (9). Screening with Labeled Probes. Additional MGP clones were obtained by screening (8) our ROS 17/2 library with MGP-1 DNA, the largest insert identified by screening with antibody; labeling by nick-translation, nitrocellulose filter preparation, hybridization, and washing have been described (8, 10, 11). In a separate approach to MGP cloning, we also probed our ROS 17/2 Xgtll cDNA library with oligonucle- otides corresponding to segments of the N-terminal sequence of rat MGP. MGP was isolated from rat bone (1, 4) and Abbreviations: MGP, matrix Gla protein; BGP, bone Gla protein; 1,25-(OH)2D3, 1a,25-dihydroxyvitamin D3. *To whom reprint requests should be addressed. tThis sequence is being deposited in the EMBL/GenBank data base (Bolt, Beranek, and Newman Laboratories, Cambridge, MA, and Eur. Mol. Biol. Lab., Heidelberg) (accession no. J03026). 8335 The publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. §1734 solely to indicate this fact. Downloaded by guest on May 19, 2021

Molecularcloning Gla Implications substrate K-dependent · ABSTRACT Matrix Glaprotein (MGP),a low molecular weightproteinfoundin bone,dentin, andcartilage, contains5 residues of the

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Molecularcloning Gla Implications substrate K-dependent · ABSTRACT Matrix Glaprotein (MGP),a low molecular weightproteinfoundin bone,dentin, andcartilage, contains5 residues of the

Proc. Natl. Acad. Sci. USAVol. 84, pp. 8335-8339, December 1987Biochemistry

Molecular cloning of matrix Gla protein: Implications for substraterecognition by the vitamin K-dependent y-carboxylase

(r-carboxyglutamic acid/bone/protein carboxylation)

PAUL A. PRICE*, JAMES D. FRASER, AND GABRIELLE METZ-VIRCADepartment of Biology, University of California at San Diego, La Jolla, CA 92093

Communicated by Russell F. Doolittle, August 14, 1987

ABSTRACT Matrix Gla protein (MGP), a low molecularweight protein found in bone, dentin, and cartilage, contains 5residues of the vitamin K-dependent amino acid y-carboxy-glutamic acid (Gla). We have used antibodies raised againstMGP and oligonucleotide probes to screen a Agtll cDNAlibrary constructed from the rat osteosarcoma cells (line ROS17/2) that- had been pretreated with la,25-dihydroxyvitaminD3. By sequencing several cloned cDNAs, we established a523-base-pair sequence that predicts an 84-residue matureMGP and a 19-residue hydrophobic signal peptide. The 84-residue mature rat MGP predicted from the cDNA sequencehas an additional 5 residues at its C terminus (-Arg-Arg-Gly-Ala-Lys) not seen in the sequence ofMGP isolated from bovinebone. The structure of rat MGP provides insight into themechanisms by which the vitamin K-dependent y-carboxylaserecognizes substrate. The present studies show that MGP,unlike other vitamin K-dependent proteins, lacks a propeptide.The absence of an MGP propeptide demonstrates that y-carboxylation and secretion of vitamin K-dependent proteinsneed not be linked to the presence of a propeptide or to itsproteolytic removal. The propeptides of other vitamin K-dependent proteins are structurally homologous, and there isevidence that this homologous propeptide domain is importantto substrate recognition by the y-carboxylase. Mature MGPhas a sequence segment (residues 15-30) that is homologous tothe propeptide of other vitamin K-dependent proteins andprobably serves the same role in y-carboxylase recognition. RatMGP also has a second sequence that has recently beenidentified in all known vitamin K-dependent vertebrate pro-teins, the invariant unit Glu-Xaa-Xaa-Xaa-Glu-Xaa-Cys(EXXXEXC). Since the glutamic residues in this unit are sitesof y-carboxylation, it has been suggested that the EXXXEXCunit could allow the y-carboxylase to discriminate betweensubstrate and product. The demonstration that two structurescommon to vitamin K-dependent proteins, the homologouspropeptide domain and the invariant EXXXEXC unit, are inmature MGP indicates that des-y-carboxy-MGP should be anexcellent in vitro y-carboxylase substrate for analysis of mech-anisms involved in substrate recognition and product dissocia-tion.

Matrix Gla protein (MGP) is a 79-residue protein, found inbone, dentin, and cartilage, that contains 5 residues of thevitamin K-dependent amino acid, y-carboxyglutamic acid(Gla) (1-3). The C-terminal domain of bovine MGP (residues31-79) has 20% sequence identity to the 49-residue structureof bone Gla protein (BGP), the only other vitamin K-dependent protein isolated from bone. Although this degreeof sequence identity is at the threshold of statistical signifi-cance, the fact that specific amino acid residues conserved inBGP from all species examined to date are also found in MGP

indicates that the C-terminal domain ofMGP and the secretedform of BGP arose from a common ancestor by geneduplication and subsequent divergent evolution (2). MGPassociates with the organic matrix of cartilage and bone andis virtually water-insoluble (<10 Ag/ml) in the absence ofdenaturants, a property that is noteworthy given its highpercentage of hydrophilic amino acids and its small size(1-4). Although the function of MGP is unknown, vitaminK-deficient rats develop excessive mineralization of growth-plate cartilage, which indicates that a vitamin K-dependentcartilage protein, possibly MGP, normally prevents cartilagemineralization (5). In clonal osteoblastic cells, MGP pro-duction is regulated by la,25-dihydroxyvitamin D3 [1,25-(OH)2D3] (6, 7). In the present study, we have undertaken themolecular cloning of rat MGP in order to generate probes forfurther investigation of its biosynthesis and hormonal regu-lation and in order to obtain the complete amino acidsequence of the initial translation product.t The resultsidentified the probable structures involved in substrate rec-ognition by the y-carboxylase and the relationship betweenthe y-carboxylase binding site and the location of glutamicresidues that are modified.

EXPERIMENTAL PROCEDURESAntibody Screening. Antibodies raised in rabbits against

bovine MGP (4) were purified by affinity adsorption to ratMGP-Sepharose 4B, elution with 4 M guanidine hydrochlo-ride, and dialysis against phosphate-buffered saline at 40C.This antibody preparation, which crossreacts with rat MGP(4), was used at a protein concentration of 1.5 ,ug/ml to screenour rat osteosarcoma (ROS) 17/2 Xgtll cDNA library (8) withthe ProtoBlot immunoscreening system (Promega Biotec,Madison, WI). This library was prepared with mRNA fromROS 17/2 cells that had been treated with 1,25-(OH)2D3 at 1ng/ml for 54 hr (8), a treatment time that we have shown issufficient to induce MGP expression by ROS 17/2 cells (7).Positive clones were plaque-purified, subcloned into plasmidpUC8, and sequenced by the dideoxynucleotide chain-termination method on double-stranded DNA (9).

Screening with Labeled Probes. Additional MGP cloneswere obtained by screening (8) our ROS 17/2 library withMGP-1 DNA, the largest insert identified by screening withantibody; labeling by nick-translation, nitrocellulose filterpreparation, hybridization, and washing have been described(8, 10, 11). In a separate approach to MGP cloning, we alsoprobed our ROS 17/2 Xgtll cDNA library with oligonucle-otides corresponding to segments of the N-terminal sequenceof rat MGP. MGP was isolated from rat bone (1, 4) and

Abbreviations: MGP, matrix Gla protein; BGP, bone Gla protein;1,25-(OH)2D3, 1a,25-dihydroxyvitamin D3.*To whom reprint requests should be addressed.tThis sequence is being deposited in the EMBL/GenBank data base(Bolt, Beranek, and Newman Laboratories, Cambridge, MA, andEur. Mol. Biol. Lab., Heidelberg) (accession no. J03026).

8335

The publication costs of this article were defrayed in part by page chargepayment. This article must therefore be hereby marked "advertisement"in accordance with 18 U.S.C. §1734 solely to indicate this fact.

Dow

nloa

ded

by g

uest

on

May

19,

202

1

Page 2: Molecularcloning Gla Implications substrate K-dependent · ABSTRACT Matrix Glaprotein (MGP),a low molecular weightproteinfoundin bone,dentin, andcartilage, contains5 residues of the

Proc. Natl. Acad. Sci. USA 84 (1987)

subjected to N-terminal sequence analysis for 33 steps asdescribed (2). The N-terminal sequence of rat MGP isTyr-Gla-Ser-His-Glu-Ser-Met-Glu-Ser-Tyr-Glu-Val-Ser-Pro-Phe-Thr-Asn-Arg-Arg-Asn-Ala-Asn-Thr-Phe-Ile-Ser-Pro-GOln-Gln-Arg-Trp-His-Ala-. The following oligonucleo-tide probes were synthesized by the University of Californiaat San Diego Oligonucleotide Synthesis Facility and end-labeled (11): 17A, d(GTYGTYGCNACCGTRCG); 17B,d(GTYGTYTCYACCGTRCG); 14C, d(GTRCTYTCRTAC-CT); and 14D, d(GTRCTYAGNTACCT). Probes 17A and Bcorrespond to residues 28-33 in rat MGP, and probes 14A andB correspond to residues 4-8. Y refers to C and T, R to G andA, and N to all four bases, each at equimolar levels. CloneXMGP-6 was purified based on its hybridization with 17B and14C. The hybridization of these probes rather than 17A and14D is in agreement with the final cDNA sequence (Fig. 3).RNA Blot Hybridization. RNA was' isolated from rat

calvaria and from ROS 17/2 cells treated for 6 days with1,25-(OH)2D3 at 1 ng/ml (7) by the guanidinium isothio-cyanate/CsCl method (8, 11). Poly(A)V RNA selection,deadenylylation of mRNA, and blot hybridization ofelectrophoretically fractionated RNA were carried out asdescribed (8, 11, 12).

RESULTSIsolation of cDNA Clones by Antibody Screening. The initial

screening of 5 x 105 recombinants from our ROS 17/2 cDNAlibrary with rat MGP antibodies yielded 3 positive plaques.After plaque purification, the clone containing the largestinsert, designated XMGP-1, was excised with restriction en-donuclease EcoRI, subcloned into pUC8, and sequenced. Tobetter establish the sequence in the middle of this insert,MGP-1 was also cleaved with Stu I and the two fragmentswere subcloned into pUC8 and sequenced. The 495-base-pairinsert of XMGP-1 contains one long open reading framecoding for a polypeptide that exactly corresponds to the 33N-terminal residues of rat bone MGP, which had beendetermined by amino acid sequence analysis (see Experimen-tal Procedures) and matches the amino acid sequence ofMGP purified from bovine bone (2) in 66 out of 79 positions.The open reading frame extends to the 5' end withoutrevealing any ATG triplet that might act as a translationalstart site. In the 3' direction, the open reading frame containsa TAA termination codon followed by 197 nucleotides.

Analysis ofMGPmRNA. In order to estimate the size of thefull-length MGP mRNA, RNAs from rat calvaria (Fig. 1, laneA) and ROS 17/2 cells treated for 6 days with 1,25-(OH)2D3(lane B) were analyzed by gel electrophoresis followed byblot hybridization with 32P-labeled MGP-1 DNA. The proberecognized a single broad band at 660-740 nucleotides in bothRNA samples, indicating that the MGP cDNA we had clonedfrom ROS 17/2 was indeed representative of MGP messagein normal bone but was not full-length. Analysis of deade-nylylated ROS 17/2 RNA yielded a single discrete band at570-580 nucleotides (Fig. 1, lane C). No hybridization wasdetected to RNA from ROS 17/2 cells that had not beentreated with 1,25-(OH)2D3 (data not shown), a finding con-sistent with the absolute dependence of MGP synthesis inthese cells on treatment with 1,25-(OH)2D3 for at least 48 hr(7).

Nucleotide Sequence of Complete MGP cDNA. Since noneof the original 3 clones contained a full-length cDNA insert,32P-labeled MGP-1 DNA was used to identify additional MGPclones in the library by plaque hybridization. This screeningyielded 11 positive clones, none of which proved longer thanMGP-1. Most inserts were subcloned into pUC8 and se-quenced; one of these, designated XMGP10, is shown in Fig.2. Independent screening of 3 x 105 recombinants from ourROS 17/2 cDNA library with two oligonucleotide probes

J

14

C_Uri

A B C

4.4 -

2.4 -

1.4-

FIG. 1. Blot hybridization analysis of MGP mRNA. All mRNAsamples were purified by a single round of oligo(dT)-cellulosechromatography. Lanes: A, 10 /ig of rat calvarial mRNA; B, 2 /Ag ofmRNA from ROS 17/2 cells treated for 6 days with 1,25-(OH)2D3 at1 ng/ml; C, 2 tig of deadenylylated ROS 17/2 mRNA [same1,25-(OH)2D3 treatment as lane B]. A 96-hr autoradiographic expo-sure was used for all lanes. Positions and sizes [in kilobases (kb)] ofRNA standards are indicated.

based on the N-terminal sequence of rat MGP identified 1plaque positive for both probes; this plaque also hybridizedwith 32P-labeled MGP-1. The insert from this clone, desig-nated XMOP-6, was excised with EcoRI, subcloned intopUC8, and sequenced. The cDNA inserts whose sequenceswere used to obtain the cDNA structure of MGP arediagrammed in Fig. 2. The sequences of all other clonesexamined agreed with this structure.

Overlapping sequences were consolidated into the 521-nucleotide' sequence shown in Fig. 3. The first ATG triplet(nucleotide positions 1-3) in the longest open reading framehas been designated the translational start site because thesequence surrounding this triplet matches the proposedconsensus sequence for initiation of eukaryotic translation at6 of 7 bases (13). The C-terminal region of the 103-residue-predicted polypeptide (corresponding to nucleotide positions58-309) has 83% sequence identity with the amino acid

Stul

mRNA 5'- 1pre MGP

3'

XMQP-1 I

XMGP-6|

XMGP-10O

100 bp

FIG. 2. Strategy for determining the nucleotide sequence of MGPcDNA. Inserts of the Xgt11 cDNA clones that were sequenced aredepicted by heavy lines. Horizontal arrows show the direction andextent of sequencing. Vertical arrows indicate Stu I restriction site.bp, Base pairs.

8336 Biochemistry: Price et al.

Dow

nloa

ded

by g

uest

on

May

19,

202

1

Page 3: Molecularcloning Gla Implications substrate K-dependent · ABSTRACT Matrix Glaprotein (MGP),a low molecular weightproteinfoundin bone,dentin, andcartilage, contains5 residues of the

Proc. Natl. Acad. Sci. USA 84 (1987) 8337

ACACCCGAGACATC

-110 -1 1

Met Lys Ser Leu Leu Pro Leu Ala lle Leu Ala Ala Leu Ala Val Ala Ala Leu Cys Tyr Glu Ser His GluATG AAG AGC CTG CTC CCT CTG GCC ATC CTG GCT GCG CTG GCC GTG GCA GCC CTG TGC TAT GAA TCT CAC GAA

20 40 60

+10 +20Ser Met Glu Ser Tyr Glu Val Ser Pro Phe Thr Asn Arg Arg Asn Ala Asn Thr Phe lle Ser Pro GIn GinAGC ATG GAA TCC TAT GAA GTC AGT CCC TTC ACC AAC CGG AGA AAT GCC AAC ACC TTT ATA TCC CCT CAG CAG

80 100 120 140

*30 +40 +50Arg Trp His Ala Lys Ala GIn Glu Arg Val Arg Glu Leu Asn Lys Pro Ala GIn Glu lie Asn Arg Glu AlaAGA TGG CAC GCT AAA GCC CAG GAA AGA GTC CGG GAA CTC AAC AAG CCT GCC CAG GAG ATC AAC AGG GAG GCC

160 180 200

-60 +70Cys Asp Asp Tyr Lys Leu Cys Glu Arg Tyr Ala Leu lle Tyr Gly Tyr Asn Ala Ala Tyr Asn Arg Tyr PheTGT GAT GAC TAC AAG CTG TGT GAG CGC TAC GCC CTG ATC TAC GGG TAC AAC GCC GCC TAC AAC CGC TAC TTC

220 240 260 280

+80Arg GIn Arg Arg Gly Ala Lys StopAGG CAG CGC CGA GGA GCC AAA TAA GAGCGCAAGGAAACAGTCGCTTTGTTACGGAGTCTGGTTTCATATTCCCCTGCAGT

300 320 340 360

AGCATTACCGATACACAGATTCGTGTTCGTTGCTTGTTCCTCACATGATCTCCTGTCGGCTGGGCCACTCCTTCCTTCCGCGGG380 400 420 440

TTGAAAGTAATGAAAGAGCAGTATTAAGAAATGTGTTTATATATAATAAAATTCTGG460 480 500

FIG. 3. Complete nucleotide sequence ofMGPcDNA and sequence of its predicted polypeptide. The nucleotide sequence is numbered beloweach line, with position 1 corresponding to the start of the protein-coding region. The amino acid sequence is numbered above each line, withposition 1 corresponding to the N-terminal residue of the protein isolated from bone. The polyadenylylation signal is underlined.

sequence of bovine MGP (2) and, in the first 33 amino acidresidues of this region, is identical to the N-terminal sequenceof MGP purified from rat bone. The coding region is termi-nated by a single TAA triplet (positions 310-312). AnAATAAA polyadenylylation signal (14) is found beginning atposition 497.

DISCUSSIONProcessing of the Predicted Polypeptide. An unusual feature

of the 103-residue primary translation product predicted fromthe MGP cDNA is the absence of a propeptide. To ourknowledge, MGP is the only known example of a vitaminK-dependent protein that lacks a propeptide. The first ATGtriplet in MGP is followed by codons for lysine, serine, anda stretch of 16 hydrophobic residues that presumably formthe transmembrane core of a signal peptide (15). Signal-

peptide cleavage is predicted to occur after the cysteineresidue at position -1 (16), and tyrosine at position + 1 is thefirst residue in the N-terminal sequence of mature rat andbovine MGP (Fig. 4).The primary translation product predicts an 84-residue rat

MGP that has an additional 5 amino acids at the C terminusthat are not present in the MGP previously isolated frombovine bone and sequenced (Fig. 4). The first two residues ofthis C-terminal pentapeptide are Arg-Arg, a dibasic sequencethat is often a site of proteolytic cleavage in the processing ofproteins (17). Although it is not known whether proteolyticcleavage to remove the C-terminal pentapeptide precedessecretion from the cell, a closely spaced doublet has beennoted in the protein blot analysis of MGP in guanidineextracts of bone and cartilage from rat and calf (3). If thisdoublet represents the 84-residue protein predicted from the

1 5 10 15 20Rat: Tyr Gla Ser His Glu Ser Met Glu Ser Tyr Glu Val Ser Pro Phe|Thr Asn Arg Arg Asn Ala Asn

Bovine: Tyr Gla Ser His Glu Ser Leu Glu Ser Tyr Glu lie Asn Pro Phe| lie Asn Arg Arg Asn Ala Asn

25 30 35 40Rat: Thr Phe lle Ser Pro GIn GIn Arg Trp His Ala Lys Ala GIn Gla Arg Val Arg Gla Leu Asn Lys

Bovine: Ser Phe lie Ser Pro GIn Gin Arg Trp Arg Ala Lys Ala Gin Gla Arg lie Arg Gla Leu Asn Lys

45 50 55 60 65Rat: Pro Ala Gin a lie Asn Arg Gla Ala Cys Asp Asp Tyr Lys Leu Cys Glu Arg Tyr Ala Leu lle

Bovine. Pro GIn Tyr [la Leu Asn Arg Gla Ala Cys Asp Asp Phe Lys Leu Cys Glu Arg Tyr Ala Met Val

70 75 80Rat: Tyr Gly Tyr Asn Ala Ala Tyr Asn Arg Tyr Phe Arg Gin Arg Arg Gly Ala Lys

Bovine. Tyr Gly Tyr Asn Ala Ala Tyr Asp Arg Tyr Phe Arg Gin

FIG. 4. Comparison of bovine and rat MGP sequences. The bovine MGP sequence was determined by direct amino acid sequence analysis;the position ofeach y-carboxyglutamic (Gla) residue in this sequence is therefore known (2). The rat MGP sequence was deduced from the cDNAsequence and the position of the 5 Gla residues in rat MGP is assumed to be the same as in the bovine protein.

Biochemistry: Price et al.

Dow

nloa

ded

by g

uest

on

May

19,

202

1

Page 4: Molecularcloning Gla Implications substrate K-dependent · ABSTRACT Matrix Glaprotein (MGP),a low molecular weightproteinfoundin bone,dentin, andcartilage, contains5 residues of the

8338 Biochemistry: Price et al.

cDNA and the 79-residue protein sequenced from bone, thetwo proteins must be present at comparable levels in theextracellular matrix and proteolytic processing probablyoccurs after secretion. It will clearly be of great interest toknow the function of proteolytic processing at the C terminusof MGP and, in particular, whether the extraordinary water-insolubility of the 79-residue MGP isolated from bone, anunusual feature of the protein given its high percentage ofhydrophilic amino acids and small size (1-4), is also aproperty of the 84-residue protein.Mechanisms by Which Substrates Are Recognized by the

y-Carboxylase. Previous work showed that the propeptide ofBGP is homologous to the propeptide of other vitaminK-dependent proteins (8), and it was postulated that thehomologous propeptide domain is a recognition site for they-carboxylase (8, 18). Although MGP has no propeptide,there is a region of the mature protein, residues +15 to +30in Fig. 4, that is homologous to the propeptide of othervitamin K-dependent proteins. The absence of a propeptidein MGP demonstrates that the y-carboxylation and secretionof a vitamin K-dependent protein need not be linked to thepresence of a propeptide or to its proteolytic removal.

If residues 15-30 in MGP correspond to the propeptidecomponent of the binding site for the y-carboxylase, MGP isthe first example of a vitamin K-dependent protein in whicha glutamic residue on the N-terminal side of this y-carbox-ylase binding site is known to be y-carboxylated. PreviousN-terminal sequence analysis of bovine MGP revealed ap-proximately 80% ey-carboxyglutamic acid and 20% glutamicacid at position 2 (2). The other three glutamic residues on theN-terminal side of this y-carboxylase binding site were not'y-carboxylated. In the course of the present studies, wecarried out N-terminal sequence analysis of rat MGP. Theresults support the presence of partial y-carboxylation atposition 2 and no y-carboxylation at positions 5, 8, and 11.Based on extrapolated repetitive yields in the N-terminalsequence of rat MGP, position 2 is 20% glutamic acid and 80%

Proc. Natl. Acad. Sci. USA 84 (1987)

y-carboxyglutamic acid. The consistent finding of a y-carboxyglutamic residue on the N-terminal side of the y-carboxylase binding site demonstrates that glutamic residueson either side of the binding site have access to the active siteof the y-carboxylase. It follows from these observations thatthe y-carboxylase binding site in as-yet-uncharacterizedvitamin K-dependent proteins can be anywhere in the matureprotein and that y-carboxylation of glutamic residues canoccur to either side of the propeptide component of thebinding site.A previously unrecognized invariant structure in the y-

carboxyglutamic acid-containing region of vitamin K-depen-dent vertebrate proteins is the sequence Glu-Xaa-Xaa-Xaa-Glu-Xaa-Cys (EXXXEXC, positions 17-23, Fig. 5). Theseparation between the first glutamic residue in this invariantstructure (at +17) and the previously recognized invariantarginine in the homologous propeptide domain (at -1) is15-17 residues in all cases except mouse BGP, where theseparation is 12 residues. If the sequences of the homologouspropeptide domain and the y-carboxyglutamic acid-contain-ing region ofknown vitamin K-dependent vertebrate proteinsare compared (-16 to +30, Fig. 5), the two vitamin K-dependent bone proteins, BGP and MGP, are each as closelyrelated to coagulation proteins as to each other. There are 2additional residues that are invariant in MGP and in allcoagulation proteins but not in BGP (alanine at -10 andglutamic acid at +30), and 1 additional residue that isinvariant in BGP and in all coagulation proteins but not inMGP (leucine at +6). There are, for comparison, only 2residues that are invariant in BGP and MGP but are not foundin coagulation proteins (arginine at +20 and cysteine at +29).These sequence relationships are consistent with the hypoth-esis that the propeptide and y-carboxyglutamic acid domainunits of known vitamin K-dependent proteins evolved froma common ancestor by gene duplication and subsequentdivergent evolution and indicate that the two bone proteins

-10

hBGP K A F

rBGP K A F

mBGP K A F

bMGP N P F

rMGP S P F

hPT HV F

bPT H VF

hFX SL F

bFX SV F

hFIX TV F

hFVII R V F

hPS AN L

bPS AN F

hPC. SV F

bPC SV F

bPZ

V S K Q E

M S K Q E

G S E V

G S K V

V K R P R

V N R L R

M S K Q E G N K V V N R L R

IN RR NAN S F SP QQ

TN R R N A N T F S P Q Q

LA P Q Q A R S L L Q R V R

LA H Q Q A S S L L Q R A R

IR R E Q A N N L A R V T

LP R DQ A H R V L Q R A R

LD H E N A N K L N R P K

VT Q E E A H G V L H R R R

L S K Q Q A S Q V L V R K R

LS R Q H A S Q V L R R R

SS S E R A H Q V L R IR K

SS S Q R A H Q V L R IR K

-1 1

[R IY L Y

R

R

R

R

R

R

R

R

R

R

R

R

R

R

10

OW L G A P V P Y P D P

lY L N N G L G A P A P Y P D P

lY L G A S V P S P D P

WR A K AQ E R I R E L N K P Q

L

L

L

YI

E

20

P R R

P H R

P T R

L N R

30

E

WH A K A Q ER V R E L N K PA QIEI N R E

lA N T F L E E V R K G N L E RIEIC V E* V R KGlA N K G F L E EVRKGNLER E C L EI I

*I~~~~~~~lA N S F L E E M K K G H L E

1A N S F L E E V K Q G N L E

R

R

lY N S G K L E E F V 0 G N L E R

1A N A F L E E L R P G S L E R**ANSLKQGNLERlA N S L L E ETKQGNLER

E

E

E

IA N T L L E E T K K G N L E R E

lA N S F L E E L R H S S L E RIE

lA N S F L E E L R P G N VE RIE

lA G S Y L L E E L F E G H L E K E

CM E

C L E

C M E

C KE

C E

C IE

CI E

C S E

CW E

E

E

E

E

E

E

El

V

0

A

A

T

P

T

A

K

0

L

L

V

C

C

C

-CC~

C

C

C

C

C

C

C

C

C

~Vn L N P D C D

EL N P N C D

E L N P A C D

D DF K L C E

D DY K L C E

S Y EEA FE

S RE E A F E

S Y E EA R E

SF E EAR E

S F E EAR E

SN E EAR E

N E E A R E

N E E AR E

D F EEA KE

E F EEA RE

V Y EEA RE

FIG. 5. Conserved amino acids in the propeptide and y-carboxyglutamic acid-containing domains of known vertebrate vitamin K-dependentproteins. Amino acid sequence positions are numbered so that 1 corresponds to the first residue of mature BGP. All other sequences have beenaligned to give maximum homology. Arrows preceding residue 1 indicate the site of propeptide cleavage; the absence of an arrow in the MGPstructure is due to the absence of a propeptide in this protein. Sequences are given in standard one-letter symbols [E represents y-carboxyglutamic acid (Gla)]. Sequences (from top to bottom): hBGP, human BGP (19); rBGP, rat BGP (8); mBGP, mouse BGP (19); bMGP,bovine MGP (2); rMGP, rat MGP (Fig. 3); hPT, human prothrombin (20); bPT, bovine prothrombin (21); hFX, human factor X (22); bFX, bovinefactor X (23); hFIX, human factor IX (24); hFVII, human factor VII (25); hPS, human protein S (26); bPS, bovine protein S (27); hPC, humanprotein C (28); bPC, bovine protein C (29); bPZ, bovine protein Z (30).

Dow

nloa

ded

by g

uest

on

May

19,

202

1

Page 5: Molecularcloning Gla Implications substrate K-dependent · ABSTRACT Matrix Glaprotein (MGP),a low molecular weightproteinfoundin bone,dentin, andcartilage, contains5 residues of the

Proc. Natl. Acad. Sci. USA 84 (1987) 8339

probably diverged from this common ancestor at about thesame time as coagulation proteins.There is evidence that the homologous propeptide and

invariant EXXXEXC structures of vitamin K-dependentproteins both play a role in substrate recognition by they-carboxylase. Site-directed mutagenesis studies (31)showed that two conserved amino acids in the propeptideregion of factor IX (phenylalanine at -16 and alanine at -10;Fig. 5) are indeed critical to recognition by the y-carboxylase.Although the importance of the EXXXEXC unit has not yetbeen tested by site-directed mutagenesis, there is indirectevidence to support a role for this unit in y-carboxylasebinding to substrate. A variety of peptides have been syn-thesized that correspond to the sequences of vitamin K-dependent proteins in y-carboxyglutamic acid-containingregions but that lack the invariant EXXXEXC unit (32). Noneof these peptide substrates proved to have Km values lowerthan a few millimolar (32-34). In contrast, both intact BGPand the peptide corresponding to residues 13-29 in prothrom-bin proved to be excellent y-carboxylase substrates (33, 34)after y-carboxyglutamic residues were converted to glutamicresidues by decarboxylation (35). For both decarboxylatedpolypeptides, Km values are 2-3 orders of magnitude lowerthan for the best known synthetic peptide substrate, Phe-Leu-Glu-Glu-Leu (33, 34). Since both decarboxylated BGPand decarboxylated prothrombin-(13-29)-peptide lack thehomologous propeptide domain yet retain the EXXXEXCinvariant structure, the excellent substrate activities of thesedecarboxylated polypeptides is strong evidence for a role ofthe EXXXEXC structure in y-carboxylase binding to sub-strate.

We thank Matthew K. Williamson for carrying out the N-terminalsequence analysis of rat MGP. This work was supported by U.S.Public Health Service Grant AM25921.

1. Price, P. A., Urist, M. R. & Otawara, Y. (1983) Biochem.Biophys. Res. Commun. 117, 765-771.

2. Price, P. A. & Williamson, M. K. (1985) J. Biol. Chem. 260,14971-14975.

3. Hale, J. E., Fraser, J. D. & Price, P. A. (1987) J. Bone Miner.Res. 2, Suppl. 1, A117.

4. Otawara, Y. & Price, P. A. (1986) J. Biol. Chem. 261, 10828-10832.

5. Price, P. A., Williamson, M. K., Haba, T., Dell, R. B. & Jee,W. S. S. (1982) Proc. Natil. Acad. Sci. USA 79, 7734-7738.

6. Fraser, J. D., Otawara, Y. & Price, P. A. (1987) J. Biol.Chem., in press.

7. Fraser, J. D. & Price, P. A. (1987) J. Bone Miner. Res. 2,Suppl. 1, A21.

8. Pan, L. C. & Price, P. A. (1985) Proc. Natil. Acad. Sci. USA82, 6109-6113.

9. Chen, E. Y. & Seeburg, P. (1985) DNA 4, 165-170.

10. Benton, W. D. & Davis, R. W. (1977) Science 196, 180-182.11. Maniatis, T., Fritsch, E. F. & Sambrook, J. (1982) Molecular

Cloning: A Laboratory Manual (Cold Spring Harbor Labora-tory, Cold Spring Harbor, NY), pp. 203-206.

12. Vournakis, J. N., Efstratiadis, A. & Kafatos, F. C. (1975)Proc. Natl. Acad. Sci. USA 72, 2959-2963.

13. Kozak, M. (1986) Cell 44, 283-292.14. Proudfoot, N. J. & Brownlee, G. G. (1976) Nature (London)

263, 211-214.15. Blobel, G., Walter, P., Chang, C. N., Goldman, B. M.,

Erickson, A. H. & Lingappa, R. (1979) in Secretory Mecha-nisms, Society for Experimental Biology Symposium, eds.Hopkins, C. R. & Duncan, C. J. (Cambridge Univ. Press,Cambridge, U.K.), Vol. 33, pp. 9-36.

16. Perlman, D. & Halvorson, H. 0. (1983) J. Mol. Biol. 167,391-409.

17. Lazure, C., Seidah, N. G., Pelaprat, D. & Chrefien, M. (1983)Can. J. Biochem. Cell Biol. 61, 501-515.

18. Pan, L. C., Williamson, M. K. & Price, P. A. (1985) J. Biol.Chem. 260, 13398-13401.

19. Celeste, A. J., Rosen, V., Buecker, J. L., Kriz, R., Wang,E. A. & Wozney, J. M. (1986) EMBO J. 5, 1885-1890.

20. Degen, S. J. F., MacGillivray, R. T. A. & Davie, E. W. (1983)Biochemistry 22, 2087-2097.

21. MacGillivray, R. T. A. & Davie, E. W. (1984) Biochemistry23, 1626-1634.

22. Fung, M. R., Hay, C. W. & MacGillivray, R. T. A. (1985)Proc. Natl. Acad. Sci. USA 82, 3591-3595.

23. Fung, M. R., Campbell, R. M. & MacGillivray, R. T. A.(1984) Nucleic Acids Res. 12, 4481-4492.

24. Kurachi, K. & Davie, E. W. (1982) Proc. Natl. Acad. Sci.USA 79, 6461-6464.

25. Hagen, F. S., Gray, C. L., O'Hara, P., Grant, F. J., Saari,G. C., Woodbury, R. G., Hart, C. E., Insley, M., Kisiel, W.,Kurachi, K. & Davie, E. W. (1986) Proc. Natl. Acad. Sci.USA 83, 2412-2416.

26. Hoskins, J., Norman, D. K., Beckmann, R. J. & Long, G. L.(1987) Proc. Natl. Acad. Sci. USA 84, 349-353.

27. Dahlback, B., Lundwall, A. & Stenflo, J. (1986) Proc. Natl.Acad. Sci. USA 83, 4199-4203.

28. Foster, D. C., Yoshitake, S. & Davie, E. W. (1985) Proc.Natl. Acad. Sci. USA 82, 4673-4677.

29. Long, G. L., Belagaje, R. M. & MacGillivray, R. T. A. (1984)Proc. Natl. Acad. Sci. USA 81, 5653-5656.

30. Hojrup, P., Jensen, M. S. & Petersen, T. E. (1985) FEBS Lett.184, 333-338.

31. Jorgensen, M. J., Cantor, A. B., Furie, B. C., Brown, C. L.,Shoemaker, C. B. & Furie, B. (1987) Cell 48, 185-191.

32. Rich, D. H., Lehrman, S. R., Kawai, M., Goodman, H. L. &Suttie, J. W. (1981) J. Med. Chem. 24, 706-711.

33. DeBoer-van den Berg, M. A. G., Ulrich, M. M. W., Hemker,H. C., Soute, B. A. M. & Vermeer, C. (1985) Biochim. Bio-phys. Acta 831, 94-98.

34. Ulrich, M. M. W., Soute, B. A. M., DeBoer-van den Berg,M. A. G. & Vermeer, C. (1985) Biochim. Biophys. Acta 830,105-108.

35. Poser, J. W. & Price, P. A. (1979) J. Biol. Chem. 254, 431-436.

Biochemistry: Price et al.

Dow

nloa

ded

by g

uest

on

May

19,

202

1