Upload
sote
View
0
Download
0
Embed Size (px)
Citation preview
Evolution of the thyrotropin receptor: a G protein coupledreceptor with an intrinsic capacity to dimerize
Vikt�ooria Kaczur,a,b,1 L�aaszl�oo G. Pusk�aas,b,c M�aaria Tak�aacs,d,2 Istv�aan Andr�aas R�aacz,e
Andrea Szendr}ooi,d,3 S�aara T�ooth,d Zsuzsanna Nagy,d Csaba Szalai,f Csaba Bal�aazs,a
Andr�aas Falus,d Bjarne Knudsen,g and Nadir R. Faridb,*
a Ken�eezy Teaching Hospital of Debrecen, Laboratory for Endocrinology and Human Genetics and Third Department of Internal Medicine,Debrecen, Hungary
b Osancor Bioinformatics Ltd, Watford, Herts, UKc Laboratory of Functional Genomics, Biological Research Center of the Hungarian Academy of Sciences, Szeged, Hungary
d Institute of Genetics, Cell, and Immunbiology, Semmelweis University Medical School, Budapest, Hungarye Department of Evolutionary Zoology and Human Biology, University of Debrecen, Debrecen, Hungary
f Central Laboratory, Heim Pal Hospital of Pediatrics, Budapest, Hungaryg Bioinformatics Research Center, Univeristy of Aarhus, Arhus, Denmark
Received 4 February 2003; accepted 10 February 2003
Abstract
The rapidly escalating number of genome sequences has emphasized the basic tenants of the schema of life. By the same token
comparisons according to specialized function or niche within nature expose genomic strategies to optimize the use of resources and
ensure biological success. Increasing complexity may result from diversification, shuffling, and re-arrangement of an otherwise
limited functional genomic complement. To further test the concept of relative structural plasticity of the TSH receptor we se-
quenced the TSHR gene of two Old World monkey speciesMacaca mulatta and Cercopithecus aethiops, evolutionary removed from
Homo sapiens by >20Myr. Both genes encoded a protein of 764 residues. This structure was 99% homologous between the two
species of Old World monkeys while C. aethiops was 97% and M. mulatta was 96% homologous to H. sapiens. TSHR sequence
comparisons were sought for an additional eight mammals as well as four (two Salmon, Tilapia, and Sea Bass) from teleosts. The
amino-acid sequences of the 14 TSH receptors were similar. The most variable sequences were those of the intracellular tail and the
distal cysteine-rich C-terminus flanking region of the ectodomain, whereas the trans-membrane domain was most preserved. Some
sequences were decidedly H. sapiens specific, while others were primate specific or showed the changes expected of evolutionary
descent. Others, however, exhibited ‘‘cross-species polymorphism,’’ sometimes at quite remarkable evolutionary distances. As op-
posed to H. sapiens the sequence differences may have subtle influences on TSHR function or may affect long-range compensation
for radical changes in adducts. The two Old World monkeys share with other lower mammals the absence of a glycosylation site at
113–115. Sea Bass and Tilapia have four glycosylation sites, whereas the two salmon receptors have only three. Changes in some
critical residues raise questions about variation in function: thus S281 is conserved in all mammals and an important determinant of
negative agonist function of TSHR is replaced by R in Sea Bass. Likewise the K183, found at an important transitional region at
LRR 6 conserved in all mammals, is represented by M in fish and may contribute to TSHR lutenization in fish. There is no evidence
that evolutionary changes in primate receptors are more rapid than that in other mammals and the separation times of different
mammals based on silent nucleotide changes of TSHR are closely parallel to archaeological estimates. Results of correlated mu-
tation analysis, referenced to the rhodopsin crystal structure, affirms dimerization of TSHR transmembrane helices. In addition, it
suggests the involvement of critical lipid-facing residues in the helices in receptor dimerization and oligomerization. We highlight the
Molecular Genetics and Metabolism 78 (2003) 275–290
www.elsevier.com/locate/ymgme
*Corresponding author.
E-mail address: [email protected] (N.R. Farid).1 Present address: Department of Biophysics and Cell Biology, Medical University of Debrecen.2 Present address: Faculty of Veterinary Science, Szent Istvan University, Budapest.3 Present address: Division of Virology, Department of Microbiology and Infectious Disease, National Center for Epidemiology Budapest,
Hungary.
1096-7192/03/$ - see front matter � 2003 Elsevier Science (USA). All rights reserved.
doi:10.1016/S1096-7192(03)00036-2
value of evolutionary informatics and set the stage for dissecting out potential subtle differences in TSHR function associated with
structural variations.
� 2003 Elsevier Science (USA). All rights reserved.
Keywords: TSH receptor; Macaca mulatta; Cercopithecus aethiops aethips; Evolution; Mammals; Teleosts; Dimerization
Introduction
The family of leucine-rich repeat containing G pro-
tein-coupled receptors (LGRs) has recently been ex-
panded by the addition of homologs of as yet unknown
function in mammals and orthologs from lower lifeforms (see [1] for review). This knowledge has provided
an unusual opportunity to pose questions about the
structure, function, and evolutionary history of the
LGRs. These receptors have arisen as a result of
the conjunction of two successful protein–protein in-
teraction and signalling modules, namely leucine-rich
repeats (LRR) and (a bared down) prototypic 7 trans-
membrane G protein-coupled signalling unit (which wehave dubbed signalon). These two modular systems are
bridged by the cysteine-rich C-flanking region (CFR).
The CFRs in most LGRs are anchored by double cy-
steines in their N-termini and PCED (Pro–Cys–Glu–
Asp) motif towards their C-termini. The sequences in
between vary in length and composition among LGRs.
The phylogeny of the CFR suggests that the insertion
herein, considered characteristic of TSHR, to have avery deep evolutionary history is not recent innovation
in TSHR. Indeed, we have recently contended that
remnants of such an insertion may be discerned in LHR
[2].
We report on the sequences of TSHRs from two Old
World monkey tribes to try and span the evolutionary
gap between Homo sapiens and canines, the next highest
order of mammals whose sequence is available to date.Arguably mammalian evolution is no more remarkable
than any other branches of life. The transition from
higher apes to humanoids is arresting not only because
of anthropocentric bias but because of the birth of
cognition, speech, and social bonding in H. sapiens. It is
evident that only a limited number of genes are involved
in this evolutionary transition [3]. It has been suggested
that a post-translational modification of some moleculesarising as a result of a human-specific mutation in a
transferase may be a crucial step in this dramatic tran-
sition [3]. While the Cercopithecidae, we have studied do
not stand at the cusp of this transition, we reasoned that
they would provide an important bridge between human
and lower mammals in evolution of the TSHR. At the
other end of the evolutionary scale there are now avail-
able TSHR sequences from four teleosts, allowing us tostudy the evolutionary history and diversification of the
structure to possibly account for new or additional
function for TSHR in alternate life forms. Even within
mammalia the pleomorphism of growth and prolifera-
tive capacity of thyroid cells from different species has
long been the subject of heated debate [4]. Given on the
one hand the central role of TSH signalling to thyrocyte
physiology and on the other the potential of signallingthrough the receptor by a multitude of G-proteins as
well as other mediators [4–6] raise the possibility that
these variations may be intrinsic to species-specific dif-
ferences in TSHR structure between species.
Materials and methods
RNA purification and cDNA synthesis
Total thyroid RNA was extracted using NucleoSpin
RNA II extraction kit (Macherey-Nagel, D€uuren, Ger-many) according to manufacturer�s instructions. For
cDNA synthesis, 10 lg of total RNA was mixed with
1.5 lg of a HPLC-purified anchored oligo(dT) primer
(50-T25(ACG)-30), and a TSHR gene specific reverseprimer (hTSHR gene nucleotides: 2339–2315; 50-TTATTTTGTAAGTTCCCCTACCATT-30) 40U RNasin
(Fermentas, Vilnius, Lithuania) in a total volume of
14 ll, heated to 75 �C for 5min, and cooled on ice. To
this mixture 4 ll of 5� first strand buffer (Promega,
Madison, WI), 1 ll of 10mM dNTP mix, and 1 ll of200U RNaseH ()) point mutant M-MLV reverse
transcriptase (Promega, Madison, WI) were added in20 ll total volume. Samples were incubated at 42 �C for
2 h. One microliter of cDNA synthesis mix was applied
for PCR.
PCR amplifications, cloning, and sequence determination
Amplification was done under the following condi-
tions: initial denaturation at 95 �C for 1min; 32 cycles ofdenaturation at 95 �C for 35 s, annealing at 58 �C for
40 s, and extension at 72 �C for 1min. The reaction mix
contains 50 nM final concentrations of each primers, 1�KlenTaq PCR buffer (Sigma, Mannheim, Germany),
4U KlenTaq LA DNA polymerase mix (Sigma), and
200 lM dNTP. The primers shown in Table 1 were
utilized in various combinations to generate a series of
overlapping fragments that encompassed the full-lengthTSHR cDNA: 1a and 1b to generate a 1079 nt fragment;
5a and 5b to amplify a 1342 nt long; 3a and 3b to am-
276 V. Kaczur et al. / Molecular Genetics and Metabolism 78 (2003) 275–290
plify a 422 nt long fragment; 3a and 7b to produce a
796 nt long fragment; and 6a and 2b to amplify a 1030 nt
long fragment.
PCR products were purified with PCR purification kit
(Macherey-Nagel, D€uuren, Germany) according to man-ufacturer�s specifications and cloned with blunt-end li-
gation into pBluescript SK-II ()) plasmid (New England
Biolabs, Hertfordshire, England) that was previouslydigested with EcoRV restriction enzyme (Fermentas,
Vilnius, Lithuania) using standard protocols [7]. Each
amplification was performed in duplicate and cloned
separately. Plasmid midi preparations were done on
Qiagen-tip 100 columns (Qiagen, Hilden, Germany).
Both strands of two positive clones from each cloning
step were sequenced by dye-terminator method with
vector-specific primers (T7 and T3 primers) on an au-tomated sequencer (ABI, Applied Biosystems, Foster
City, CA).
Multiple sequence alignments were performed using
CLUSTAL W (version 1.81), secondary protein struc-
ture prediction using GOR4, and sequence similarity
using Blast2.0 all at SWISS-PORT (http://www.ch.
embnet.org). Mammalian TSH receptor sequences were
submitted to evolutionary rate shift analysis. The evo-lutionary rate shift analysis is based on likelihood ratio
testing and assumes that changes occur at a specific
point in evolution and that these changes are indepen-
dent. The changes are not limited to variable versus
invariable sequences but involves shift between any two
rates [8].
Moreover, mutation in mammalian TSHR predicated
on the rates of silent mutations were computed as inKumar and Subramanian [9] and used to estimate evo-
lutionary distances.
We have taken advantage of the correlated mutation
analysis (CMA) available in the GPCR (http//:
www.gpcr.org, [11]) database on 12 of the 14 receptors
reported here, incorporating the Old World monkey
sequences into it. Likewise we incorporated our data
into analysis performed by Gouldson et al. [12] based onfive TSHR sequences. These investigators embedded
their analysis of the TSHR sequences in the framework
of CMA based on a very large number of receptors from
an additional four GPCR families. Moreover, they re-
lated the placement of conserved and correlated residues
on the TM helices to those in the rhodopsin crystal
structure [13]. Where there was discrepancy in the limits
of the TM domains between our schema [14] and that on
the rhodopsin crystal, we opted for the latter.GenBank Accession Nos. of the TSH receptors stud-
ied were as follows: Rattus norvegicus TSHR M34842,
Mus musculus TSHR U2601, Ovis aries TSHR Y13434,
Bus taurus TSHR U15570, Sus scrofa TSHR AF 338249,
Felis catus TSHR AF218264, Canis familiaris TSHR
X17146, Homo sapiens TSHR M3177, Morone saxatilis
(Striped Sea Bass) TSHR AF239761, Oncorhynchus
rhodurus (Salmon) TSHR_A Q9I949, O. rhodurus TSHR_B Q9I948, Orechromis niloticus (Tilapia) Q98TF4,
Cercopithecus aethiops TSHR AY169399, and Macaca
mulatta AY 169400.
Results and discussion
Receptor divergence over 450Myr
With the description of the two Old World monkey
sequences, we now have information on 14 vertebrate
species including 4 from fish and 8 mammalians. Given
the evolutionary separation of these orders of life (400–
450Myr) it is not surprising that they segregate into two
major phylogenetic groups on the basis of protein se-
quence [15, Figs. 1 and 2]. All four fish species belong tothe ray-fin subgroup of bony fish (teleosts). Even within
this subgroup there is a remarkable degree of species
variation. The case of Tilapia is particularly intriguing.
As a Cichlid subspecies it would have been part of a
major species radiation and specialization after the
emergence from a draught bottle-neck and with the
formation of the Central African lakes [16]. Yet this
burst of speciation occurred only 750,000 years ago.The similarity in regional organization and protein
Table 1
The PCR primers used to amplify TSHR cDNAs from C. a. aethiops and M. mulatta
Primer designation TSHR sequence Primer sequence (50 ! 30) Strand
1a 882–991 TCCTTGAGTCCTTGATGTGT Sense
1b 1961–1941 TGAGAGGCTTGTTCAGAATT Antisense
3a 1505–1530 TTTGCAAGCGAGTTATCGGTGTATA Sense
3b 1927–1903 TAGAATGAGATTGGGGCCATGCATA Antisense
5a )47 to 24 TGAGGCGATTTCGGAGGATGGAGA Sense
5b 1295–1274 ATTGCCCAGGAGAGCCAGCAGA Antisense
6a )6 to 16 TGGAAAATGAGGCCGGCGGACT Sense
2b 1024–999 TTGGACTTTTCCTTGTACCCAACAAT Antisense
7b 2302–2274 ACTTACAAAAC(T/C)GTTTGC(G/C)TATACTCTT Antisense
The numbering of nucleotides is based on hTSHR coding sequence from Nagayama et al. [23]. Combinations of various primers were used to
amplify overlapping PCR fragments of the following sizes. 5a/5b: 1342 bp; 3a/3b: 422 bp; 3a/7b: 796 bp; 1a/1b: 1079 bp; and 6a/2b: 1030 bp. Both
strands were sequenced from at least two independent clones; for most three clones were sequenced.
V. Kaczur et al. / Molecular Genetics and Metabolism 78 (2003) 275–290 277
Fig. 1. Multiple sequence alignment of TSH receptors from 14 vertebrates. Multiple sequence alignment was performed with Clustal W (1.81) with
gap open penalty of 10.0, gap extension penalty of 0.05, and the BLOSUM 30 protein weight matrix. The following designations are shown: identity
between residues (*), conservative substitutions (:), and substitutions a general category of amino-acids (.). Subsequent tables separately list residues
in these categories, as well as positions in TSHRs that show no homology. For the sake of convenience reference to TSHR residues numbering
throughout the paper is anchored on that in the majority of mammals (764 residues), except when otherwise stated. Among mammals sequence
homology with M. mulatta varied from >99% (C. aethiops) to 87.0% (Bovis). Human TSHR showed 96.9% homology. The four teleost receptors
showed �58.5% homology. Among these receptors less homology than between mammals was observed: 87.3% between the two salmon receptors,
89.5% between Sea Bass and Tilapia and �78% between salmon receptors and the two other teleost TSHRs.
278 V. Kaczur et al. / Molecular Genetics and Metabolism 78 (2003) 275–290
sequences between Tilapia and Striped Sea Bass suggests
forces other than speciation and thus phylogenetic re-
moval were at play in generating and maintaining the
TSHR sequences. By contrast, in Salmon two TSHR
structures have been identified and that these bear se-
quence differences to suggest that they have different
functions or bind different ligands. This situation is as
yet unprecedented in other species studied. Although
TSHR is expressed in a variety of tissues and given the
variation in mediators or putative mediators is surmised
to subserve different functions [2], all those forms are
transcribed from a single TSHR gene. Variants arise as a
Fig. 1. (continued)
V. Kaczur et al. / Molecular Genetics and Metabolism 78 (2003) 275–290 281
consequence of alternative splicing or foreshortening[17]. Whether the situation in the Salmon is unique and
thus an evolutionary dead-end experiment or an indi-
cation that as yet unidentified TSHR forms exist cannot
at this time be definitively answered. That the degree of
homology between the four fish sequences are much
lower than those between mammals may either reflect
the time of species separation or possibly a more rapid
rate of change than in mammals. Indeed, the salmonidreceptors approach a degree of homology similar to that
between humans and rodents, a separation estimated at
100Myr.
The basic organization of the TSH receptor has been
well preserved over evolution and indeed it harks back
to that of the earliest LGR forms described [1,2]. The
LRR repeats and transmembrane domains are invariant
in length. The region of most variability is the cysteinerich CFR and less so the intracellular tail. Fish receptors
are characterized by insertions in CFR, exaggerated in
the A form of the salmonid receptor whereas single
residue deletions (at separate locations) are found in cat
and bovine receptors. Fish have 8–10 residue insertions
in the intracellular tail and a longer leader peptide in
salmonid TSHR (Fig. 1).
As anticipated the four teleost sequences showgreater structural affinity to each other than they do
with mammals, particularly in the TM domain (Figs. 1
and 2). The two salmon TSHRs sequence differ princi-
pally through and insertion (residues 321–346) in the
CFR of TSHR-A receptor. Salmonids are the only
vertebrates with two TSHR to be identified to date and
both show divergence from the two other bony fish se-quences (Fig. 2).
TSH receptors of Cercopithecidae
The two Old World monkeys TSHRs were >99%
homologous to each other and 97% to that ofH. sapiens.
The two Cercopithecidae represent two distinct pri-
mate�s families and are presumed to have separatedfrom H. sapiens more than 23Myrs ago [15–18]. The
TSHR of the two Cercopithecidae species are identical
except for 17 bp at the DNA level (10 of which result in
silent mutations) and 6 positions at the protein level
(Fig. 1), and only at position 296 is there a replacement
substitution (Tables 1 and 2). The rate of silent muta-
tions allow us to calculate an evolutionary separation
time of approximately 10 million years between the twoOld World monkey families (also see below).
Macaca is an Asian branch of the Old World monkey
tribe Papioni, the majority of whose clades are found in
Africa. Cercopithecus aethiops aethips represent a dis-
tinct Cercopithecidae tribe from Papioni [18–21]. The
evolutionary separation predicated on the evolutionary
clock of TSHR sequences may not be that inconsistent
with the phylogenetic data, although some nuclear geneswithin Cercopithecidae, especially Macaca, yielded in-
congruent results with a consequent anomalous place-
ment within the Papioni [21].
Table 2
Differences in the mature TSHR sequences between humans and Old
World monkeys
Residue Cercopithecidae Homo sapiens
32 Q H
63 R H
74 S N
83 L V
87 A V
101 N S
113 S N
192 I V
235 H Q
331 D G
333 G S
346 A T
389 V I
393 N S
601a Y H
727 E D
759 T M
Only residues in which Homo differ from both Old World monkeys
are considered.a The placement of H in this position was based on the original
hTSHR sequence reported by Nagayama et al. [23]. Histidine in that
position alters the basal activity of the receptor and its coupling to Gq/
11, analysis of other clones by the same workers and the results of
other investigators affirms a conserved Y in that position in all verte-
brates.
Fig. 2. Phylogenetic relationship between the 14 TSH receptor se-
quences considered in this study. Two major families are distinguished:
teleosts and mammals and within the latter primates show up as a
distinct clade. Clustering of other mammals: rodents, ungulates and
carnivores are in keeping with speciation.
282 V. Kaczur et al. / Molecular Genetics and Metabolism 78 (2003) 275–290
As the proline 296 was found in two unrelated indi-
vidual M. mulatta it is unlikely to be a mutation in a
particular individual. The other five residue changes
between the two Cercopithecidae are conservative and
cluster in the intracellular domain: 94 (S in aethiops, T
in mulatta), 635 (M in aethiops, I in mulatta), 731 (G in
aethiops D in mulatta), 736 (E in aethiops, Q in mul-
atta), and 741 (I in aethiops, L in mulatta). With theexception of 296 at which L is invariant in all vertebrate
TSHR, these substitutions may not necessarily affect
TSHR secondary structure or function. Residue 296
occurs in the proximal part of CFR, a domain with a
fair degree of structural flexibility. The replacement of L
with a P in this domain is not without precedent, in that
in Mus a canonical L330 is replaced by P and at 347 P is
characteristic of rodent TSHR, whereas in other verte-brates there are a variety of hydrophobic residues.
As far as the differences between human and Old
World monkey TSHR are concerned only four, includ-
ing that at position 296 were their repeat substitutions,
and only one within the mature receptor (T 760 M).
Interestingly, the substitutions in the leader peptide
(V14D), (K22M) were characteristically H. sapiens.
Some conservative or same group residues are charac-
teristically human.
Other changes in residues may reflect evolution-re-
lated changes within primates. Thus their four charac-teristic changes: 294 (T in mulatta, S in aethiops and
Homo), 645 (I in mulatta and Homo M in aethiops), 731
(D in mulatta G in the two other primates), and 736 (Q
in mulatta E in the two other primates). None of these
residues are the sites of recognized TSHR mutations
associated with change in receptor function. We cannot,
however, exclude the possibility that the changes be-
tween primate sequences may not result in subtlechanges in ligand-binding or receptor signalling [22] gi-
ven the differences in volume, potential side-chain in-
teraction, and hydrophobicity.
Table 3
Replacement substitutions of residues in the mature TSHR of 10 mammalian species
Residue Region Mull Aethio Homo Cat Dog Pig Sheep Cow Mus Rat
22 K K M K K K E E K R
23 G G G G G G R R E G
46 NCR R R R R R S R S R G
70 LLR1 H H H R R R R R L L
116 LLR3 Y Y Y Y S Y Y Y Y Y
139 LLR4 K K K G G R R R R R
169 LLR5 V V V A A A A A E E
235 LLR8 H H Q Y Y Y Y Y S S
296 CFR P L L L L L L L L L
307 CFR Q Q Q R R R R W R R
317 CFR A A A A T A A A I V
323 CFR H H H D D Y Y Y Y Y
329 CFR N N N Y Y D D del D G
330 CFR L L L L L L L La P L
334 CFR I I I H H S S Sa S H
337 CFR Y Y Y Y Y N Y Ya Y Y
346 CFR A A T T T T T Ta S G
347 CFR H H H R D H H Qa P P
360 CFR E E E del E E E Ea E E
426 TM1 L L L La L L L La L P
539 TM4 C C C Ya Y Y Y Ya Y Y
548 TM4 C C C Ca C C C Ca S S
588 TM5 T T T La L L L La L L
595 TM5 V V V Ia I T I Ia V V
617 I2 G G G Ga G G G Ga R R
660 TM7 K K K Ka K K K Ka K G
712 ICT P P P Sa S S S Sa C C
726 ICT H H H Ra R Q P Pa Q Q
738 ICT V V V Da E D D Da T T
741 ICT L L L La L L L La L P
743 ICT E E E Ea E E G Ea G G
749 ICT P P P Pa P H P Pa P P
751 ICT K K K Ka K K Q Qa L L
755 ICT I I I Ia I I T Ta I I
760 ICT T T M Na N K K Ka K T
aBecause of respective deletions (del) in residues 329 of cow and 360 of cat, all residues numerically above that position shown by the asterisk should
have one subtracted. Numbering of residues is based on 764 residues in 8 mammals including 21 in the leader peptide. NCR, N-terminus cysteine-rich
region; LLR, leucine-rich repeats; CFR, C-flanking region; I2, second intracellular loop; TM, transmembrane helix; ICT, intracellular tail.
V. Kaczur et al. / Molecular Genetics and Metabolism 78 (2003) 275–290 283
A substitution of important functional impact is S113
N, the latter creates the NLT N-terminal glycosylation
signal, which appears to only exist in humans (see [17] for
review). This substitution can result potentially from a
single base-pair substitution (AAT ! AGT).
Variations in mammalian TSHR sequences. Any impli-
cations for function?
With the widening of sequence comparisons in all 10
available mammals separated by in excess of 100 millions
of evolution differences in sequences [9,15] would be ex-
pected. As anticipated these predominate in the leader
peptide, intracellular tail, particularly its terminal end.
Some of these changes are primate-specific, some show
evolutionary gradients, and yet some, interestingly, dis-
play across species polymorphism. Most of the latter fall
in the category of conservative or related residue substi-
tutions. Over the 10 TSHR sequences specific positionmay fall into more than one category outlined above.
Residues 2–5 of the leader peptide are instructive in
that the first four residues are identical in primates,
Table 4
Same class substitutions in mammalian TSHRa
Residue Region Mul Aethio Homo Catb Dog Sheep Cowb Pig Mus Rat
10 Leader V V V A A A A A V T
16 Leader P P P P P P P P S P
25 NCR S S S P P P P P A T
49 NCR S S S S T S S P S S
53 NCR S S S S S S S N S S
74 LLR1 S S N N N N N N S S
87 LLR2 A A V A A A A A A A
101 LLR2 N N S S S S S S S S
113 LLR3 S S N S S S S S S S
119 LLR3 P P P P P S S P P P
120 LLR3 D D D G D G G G D D
176 LLR5 C C C C C S C C C C
215 LLR7 V V V A A V V V A A
217 LLR7 D D D D D D G D D D
222 LLR7 G G G G G G A A G G
243 LLR8 S S S S S S S P S S
308 CFR S S S S S G G S N N
315 CFR V V V V V A A V V V
316 CFR N N N N N S S N N N
319 CFR N N N N N N N N R R
320 CFR S S S G G G G G G G
322 CFR L L L F F F F F I V
331 CFR D D G G G G G G G G
333 CFR G G S S S G G S N N
335 CFR V V V A A A A V V V
348 CFR N N N S S S S S S S
392 CFR D D D G G G G G D D
393 CFR N N S N N S S S N N
437 TM1 L L L I I I V V L F
447 TM2 N N N T T T T T T T
540 TM4 A A A A A V V A T T
548 TM4 C C C C C C C C S S
599 TM5 C C C S S A A S S S
615 I2 P P P T P P P P P P
708 ICT G G G G G G G G G A
713 ICT P P P P P S P P P P
716 ICT S S S S S S S S S N
718 ICT D D D G G G G G G G
725 ICT T T T T T P P T P P
744 ICT N N N N N N N N N S
752 ICT G G G S G D D D G G
762 ICT T T T T T T T P T T
763 ICT V V V V V V V V A A
aThese residues as shown as (.) in the multiple alignment printout.bAs previously subtract 1 for cat residues 360 and above and 329 and above for cow. 447 is the first residue of TM2. As an example of what
constitutes this grouping all the residues at position 10 are small; however V is aliphatic, T is polar, and A is non-polar and T is more soluble than the
other two. At position 120 both G and D are small and D but not G is charged. We suggest that these differences in the characteristics of alternative
residues may well influence receptor structure and function.
284 V. Kaczur et al. / Molecular Genetics and Metabolism 78 (2003) 275–290
position 2 shows R ! S in pig, 3 P ! Q in cat, P ! L in
pig whereas residue 4 which is A in primates may display
a trend to evolutionary gradient in that it is G in ro-
dents, T in ruminants, pig and cat but P in dog. Position
7 shows an aspect of across species polymorphism in
that it is Q in primates, cat, pig, and rat but H in dog, R
in ruminants, and L in mouse. Position 17 is R in all
TSHRs expect ruminants and position 20 is G except forsheep, rat (W), mouse, and pig (R). Because the leader
peptide is free from functional evolutionary pressures,
variation in its sequence gives us some insight into re-
ceptor diversification, within the limits of its short
length. Interestingly, sequence alignment of only the first
60 residues of all 14 TSHR proteins result in phyloge-
netic separation identical to that using the full-length
sequences (not shown).Not surprisingly, there are relatively fewer radical
residue substitutions in the mature receptor, which we
will now consider (Tables 3–5).
Residue 46 is located in the N-terminus cysteine rich
region in a domain important for TSH binding. The
most frequent residue R is replaced by S in cow and pig
but with a relatively hydrophobic G in rat and would be
expected to have an impact on ligand binding [17].
Likewise it is conceivable that replacement of R with G
in rodent second intracellular loop may influence the
association with trimeric G-protein and thus signalling.
Many of the substitutions in LLR fall in the middle part
of a repeat and may thus be buried or constrained to
influence receptor function. It is also worthy to note that
the TM4 is the least mobile of the seven helices and thusthe changes in two cysteines between species is likely to
have less of an effect than if it had occurred elsewhere
[15,24]. All these conjectures can, however, only be tes-
ted by head to head comparisons of homologous and
heterologous TSHR on a slate of expressed recombinant
receptors in cell systems which allow for desensitization,
internalization, and re-cycling.
The sequence flexibility in mammalian TSHR ofcysteine-rich CFR of the ectodomain and the cytoplas-
mic tail reflects that in vertebrates in general. That CFR
represents a bridge between a protein/protein interaction
module and the signalon and that it is cleaved off, at
least in the human receptor (and possibly its dispens-
ability after ligand binding), may justify the marked
structural flexibility. The case of the intracellular
Table 5
Conservative (:) residue substitutions in selected TSH R regionsa
Residue Region Mull Aeth Homo Catb Dog Sheep Cowb Pig Mus Rat
32 NCR Q Q H H R R R H H H
43 NCR D D D D D D D D E E
44 NCR Q Q Q H H Q Q H H H
415 TM1 K K K K K K K R R K
427 TM1 L L L L L L L L L M
433 TM1 I I I I V I I I I V
465 TM2 M M M M M L L M V V
478 E1 H H H H H Q Q Q H H
479 E1 S S S S S S S S S T
496 TM3 T T T A T T T T T T
521 I2 Y Y Y Y Y Y H Y Y Y
528 I2 R R R R R H R R R R
533 I2 I I I M I I I I I I
557 TM4 L L L L L L L L M M
575 E2 E E E E E E E E D D
585 TM5 F F F L L L L L L L
591 TM5 I I I I I I I I V V
592 TM5 V V V V V I I V V V
596 TM5 I I I I I I I I V I
601c TM5 Y Y H Y Y Y Y Y Y Y
612 I3 Q Q Q Q Q H H Q Q Q
621 I3 K K K K K R R K K K
635 TM6 I M I M M M M M M M
648 E3 I I I L L L L L L L
649 E3 L L L M M M M M M M
657 I3 S S S T T T T T T T
661 TM7 I I I I I I I I I V
684 TM7 A A A T A A A A A A
E is external intervening loop.
415 is first residue of TM1, 621 last residue of I3, 648 first residue of E3, and 657 its last residue and 648 is the last residue of TM7.a These residues are indicated by (:) in the multiple alignment output.b Subtract 1 from all sequence numbering starting with 415.cH at 601 in hTSHR is based on Nagayama et al. [23] and was subsequently shown to be Y. It is left here to emphasize the potential deleterious of
what is considered to be an almost equivalent residue.
V. Kaczur et al. / Molecular Genetics and Metabolism 78 (2003) 275–290 285
domain is, however, less easy to explain on the basis ofcurrent knowledge. The cytoplasmic tail is likely in-
volved in most post-binding events including signalling,
desensitization, internalization, and re-cycling. The
variability in structure between species may imply
qualitative and possibly quantitative differences between
species in these attributes. It has, for example, been
implied that the cytoplasmic tail encodes sequences for
TSHR internalization and re-cycling but the motif in-volved was not disclosed [25]. It is instructive that the
mouse LH receptor has a motif, absent in rat, pig, and
human receptors, in the terminal tail which determines
receptor re-cycling [26]. It is apparent (results not
shown) that hTSHR, which recycles [25], lacks this motif
and must employ other structural cues.
An eighth helical structure predicted by the resolu-
tion of the rhodopsin crystal [13] follows on TM7. Iteffectively forms a fourth intracellular loop as a con-
served cysteine following on that domain is pamuloly-
ated thus attaching it to the membrane. To our
knowledge this covalent linkage has not been directly
demonstrated for the TSHR. We wish to stress the po-
tential relevance of some conservative substitutions on
receptor function. Thus the hTSHR originally reported
by Nagayama et al. [23] and which we have used here inour tables by way of emphasis, designated residue 601 as
an H whereas in other sequences it was Y, confirmed by
the authors when further clones were sequenced. Ap-
parently Y601 forms an important molecular switch,
determining the specificity of engagement with the G
protein. The replacement of Y601 with H abrogates
receptor basal constitutive activity and its ability to
couple to Gq/11. Tyrosine has a larger volume, is moresoluble, and slightly more acidic than H [27,28].
It is worthy to note that although none of many of
the conservative substitutions are found at the beginning
or the ends of TM and inter-helical loops, (Tables 4 and
5), no example of a radical substitution could be dis-
cerned. These substitutions may, nevertheless, influence
the angle/charge of the ends of the TM helices and may
modify binding and/or signal transduction. It is worthyto note that none of the substitutions found correspond
to described sites of mutations or polymorphisms.
Structural motifs and characteristic residues
There are a number of structural motifs, which are
characteristic of TSHR as opposed to LHR and FSHR
or specific regions or residues thought to be essential orimportant for its specific function. Three signature mo-
tifs have been identified for all 14 TSHR sequences and
include: NISRIY predominantly at the beginning of
LLR2 and which comprise the first glycosylation site of
hTSHR, THIEIRN spanning the first eight residues
of LL3, whereas the third LKNPQE [29] occurs just
distal to the end of the 50 residue ‘‘add on’’ in CFR
(CFLANS). These conserved motifs very likely playimportant roles in the packing of the contact surface for
the ligand (the first two) or signalling (the third). While
experimental evidences have not directly identified these
regions as to be that important [17], some clues are being
provided by LLR modelling [30]. Considerations of the
secondary structure of the LLR region of the ectodo-
main indicates that the first eight residues of each LLR
describe a b coil whose external surfaces provide contactwith ligand [30, Fig. 3]
Cysteines
The cysteines in the extra-cellular domain consist of
two paired sets at the N- and C-terminus of the ecto-
domain, a general orientation typical of LLR receptors,
and an orphan at 176 (primate numbering) [17]. The
latter is the only one not preserved though the 14 se-quences: it is replaced by S in sheep and T in teleosts.
The former two cysteine sets are thought to be essential
for maintaining the secondary structure of the ectodo-
main, there is, however some uncertainty with respect to
the pairing of the C-terminus cysteines [2]. The equiva-
lent of primate cysteines 24, 29, 31, 41, 283, 284, 301,
390, 398, and 408 are strictly conserved in all 14 recep-
tors. Cysteines 539 and 599 appear to be strictly primatespecific.
Glycosylation sites differ among species
The hTSHR boasts six potential glycosylation sites.
All six have now been directly demonstrated to be at-
tached to a glycan [17,31]. The equivalents of those at
77, 198, and 302 are preserved in all 14 TSHRs (Figs. 1
and 3). As we show here the glycan attachment site at113 (–115) is apparently specific for H. sapiens, whereas
that at 177 (NET) appears to be characteristic of
mammals. The motif NISR (77 primate numbering) is
present in all vertebrates and its apparent essential role
to receptor trafficking and insertion into the plasma
membrane, appears to have been related to an experi-
mental artefact [31]. Systematic studies indicated that
glycans in four glycosylation sites, irrespective of loca-tion, were necessary for expression and biological ac-
tivity, although efficiency of both processes increased
with the involvement of more sites [31]. Interestingly Sea
Bass and Tilapia (but not Salmon) have a distinctive
potential glycosylation site (NTLT: 276–278 in the for-
mer and 277–279 in the latter species) at the N-terminus
of CRF. Both salmon TSHR_A and _B have only three
glycosylation sites (85–88 NISR; 206–209 NGTR; 310–313 NTLA).
The four site minimum formulation [31] would pre-
clude the two Salmon TSHRs being expressed or to
exhibit biological activity when, in fact, they are effi-
ciently expressed in COS-7 cells and efficiently stimu-
lated by bovine TSH [32], indicating that more than just
the number of glycan adducts must be involved. We
286 V. Kaczur et al. / Molecular Genetics and Metabolism 78 (2003) 275–290
suggest that modification of ternary structure affected bysequences changes in LLRs (and possibly elsewhere)
may modify glycan orientation such that three sites
would be necessary and sufficient for biological function
of the receptor. Glycan adducts are important for
folding and assembly, secretion, and regulation of dis-
position. Both LH and TSH exhibit a unique sulphated
N-linked oligosaccharides, and these are present in all
vertebrate hormones [33].
Residues apparently important for TSH function
As a result of homologous gene segment exchange,
natural mutations as well as site-specific mutagenesis,
certain extracellular domain regions or residues were
deemed important for TSH association and receptor
activation. These include the regions 201–211 (primate
numbering), 222–230, 295–302, and Y385 [17]. The ty-rosine as position 385 or equivalent is preserved in all
vertebrate receptors. The sequences above are conserved
among the 10 mammalian sequences, but diverge in
teleosts, the least divergent being 387–395 equivalentthat follows on the third signature motif (LKNPQE)
and Y385 in CFR. These comparative studies may
suggest that important contact surfaces for ligand in
mammals and fish may have diverged to accommodate
differences in the details of ligand structure and receptor
function (Fig. 2).
Mutation at human 183 (K183R) results in preg-
nancy limited familial hyperthyroidism, related to in-creased sensitivity of TSHR to CG [34]. This residue is
conserved in the LHR and falls within LLR6 in a
transition region in which domain-exchange experi-
ments between the two receptors suggested a switch in
specificity. K183 is preserved in all mammals but its
equivalent position is substituted for M in teleosts.
Systematic mutagenesis of this residue has now been
carried out to [30] allow the surmise that the substitutionin teleosts would impart lutenizing attributes upon
TSHR. Indeed, the K mutation results in one of the
lowest EC50 for TSH and the lowest for GC [30]. This is
Fig. 3. Sequences constituting b strands at the beginning of the 9 leucine rich repeats (LRR) of vertebrate. TSHR The sequences in LLR3 completely
encompass the TSHR-specific motif, whereas the NISRIY motif overlaps the N-terminus of LLR2b strands. The strand have the following consensussequence: LXXLXLXX where X is any residue and L is leucine or equivalent (isoleucine, valine, or unusually methionine or alanine). Threonine is
used in TSHR in 2b strands. Apparently the side-chains of L or equivalent point towards the hyrophobic protein core, whereas the intervening (Xs)
residues are on the protein surface and thus potentially make contact with the ligand. The b strands of the LLR2 and LLR3 are the most conserved.The fifth position (K in mammals M in fish) of LLR 6 (K183 in mammals) is predicted to form a salt bridge in mammals with E (157 in mammals) in
the similar position in LLR5. The model predicts that the change to M would result in the disruption of that interaction and possible alternative
interactions with more proximal residues [30]. It is likely that teleost TSH may acquire additional and/or different biological functions as a con-
sequence.
V. Kaczur et al. / Molecular Genetics and Metabolism 78 (2003) 275–290 287
without doubt a factor in the ability of fish TSHR (ex-emplified by Sea Bass) to bind and be activated by LH
[29]. By contrast salmon TSHR is reported not to be
expressed in gonads, but on thyroid epithelial cell at a
critical time of development [36].
Apparently K183 forms a salt bridge with E157, a
residue conserved in all LGRs and mutation of the
former to R results in the disruption of the salt bridge
and establishment of an association with a distal D232;change to M would likely result in the establishment of
alternative partners. These mutations clearly cause dif-
ferences in the ternary structure of adjacent LLR and
electrostatic potential to allow GC to be a more potent
ligand for the TSHR (Fig. 3).
S281 is important in receptor silencing
S281 is part of a motif, HSCCF, conserved in all threeglycoprotein hormone receptors and all TSHRs exam-
ined here except for that of Sea Bass (Fig. 1). S281
maintains the receptor in a ‘‘silent’’ constrained state.
Data from natural and introduced mutations suggest
that the degree to which TSHR is constitutively activated
and related to the measure to which the substituted res-
idue modifies the backbone of the protein [35–38]. Dis-
ruption of the local structure introduced by mutatingC283, which is also important for the maintenance of
TSHR ternary structure, releases the suppressive influ-
ence of S281 [37]. Moreover, long range interactions
between the ectodomain and the external inter-helical
loops make for a composite structural module, which can
be disorganized by S281 mutations [38]. In Sea Bass
TSHR R is substituted for S281, a substitution which
might be expected to be associated constitutive activityof the receptor. Careful examination of the data in
Kumar et al. [29], suggests this to be the case
(M. Szkudlinski, personal communications). The exact
physiological consequence of this observation is unclear.
Receptor cleavage
We also note that all three primates have a distinctive
306–307 MQ doublet as opposed to I and R in all othermammals with the exception of Aries which exhibits W
at 307. As this motif is immediately upstream from site I
for TSHR cleavage [17], and given the role that has been
postulated for the 50 residue ‘‘add on’’ sequence [39] in
CFR (CFLANS) cleavage in the pathogenesis of
Graves� disease, it is possible that Old World monkeys
receptors are cleavable. There is, however, no informa-
tion that primates develop spontaneous autoimmunehyperthyroidism. Adamlysins postulated to be involved
in hTSHR cleavage digest membrane anchored struc-
tures at defined distances from the membrane, have
recognition motifs, although there is less certainty as to
whether they have consensus cleavage sequences [40].
The MQ doublet might be important for enzyme
docking or guidance for downstream cleavage.
Mammalian TSHR gene evolution is average
It is apparent, even though remarkable sequence
conservations have been maintained, the separation of
teleosts from mammals (Fig. 2) allow for independent
change in structure to potentially permit new functional
niches for the receptor. We put to the test the possibility
that there may be differences in the evolutionary rate
shifts between primates and lower mammals [8]. Therewas not enough TSHR sequence divergence between
these mammal groups for the rate shift algorithm to
detect differences in evolutionary rates. There is a notion
that mutation rates among mammalian lineages may be
determined by generation time and that some genes
within different mammalian genomes may display sev-
eral fold differences from those in others. Recent work
comprising 8600 genes among 326 mammals showed nodifference in mutation rates according to generation time
and relatively homogeneous mutation rates among dif-
ferent genes [9]. Because of the acknowledged limited
sensitivity of the evolutionary rate shift algorithm, we
estimated the evolutionary separation times of mammals
on the basis of differences in silent mutations in TSHR
and related our results to those of the reputed archae-
ological separation times. The correspondence betweenthe two sets of figures was uncannily close: thus the
separation of C. aethiops fromM. mulatta was estimated
at 11.5Myr (10), between Homo and mulatta at 25Myr
(23), and between mouse and Homo at 103Myr (109);
the figures in brackets giving the archaeological sepa-
ration times. It is of interest that hominoids show a
higher rate of evolution compared to rodents, when a
separate and distant mammalian lineage is used as anoutgroup [9]. In essence, the evolutionary rate of
mammalian TSHR, including those in primates, is av-
erage for the totality of mammalian genomes.
TSHR is a receptor that dimerizes
Dimerization of TSHR was entertained for some time
on the basis of indirect evidence [41,42], and more re-cently based on direct experimental demonstration [43].
There is also evidence for high order oligomers [42].
Apparently homodimerization occurs after TSH cleav-
age and involves covalent bond formation between the
residual ectodomain stumps [44]. This covalent interac-
tion between receptors likely enhances signalling, thus
compensating for the low abundance of TSHR on thy-
roid cells [44]. There is a large body of evidence tosuggest that activation of many GPCRs involve dimer
formation (see [45,46] for review). Rhodopsin dimer-
ization on membranes was very recently demonstrated
by atomic-force microscopy [47].
The correlated mutation analysis (CMA) [10] we per-
formed affirms dimerization of TSHR transmembrane
helices. We have confirmed with a much larger number
288 V. Kaczur et al. / Molecular Genetics and Metabolism 78 (2003) 275–290
TSHR sequences the earlier finding [12] that 21/24 TMmutations are predicted to fall on the external surfaces of
the relevant helices. The distribution on the external as-
pects of the helices also support the notion of contact or
domain-swapped dimers occur between the external as-
pects of helices 5 and 6 [12]. Other helices [2,3,4] may be
involved in high-order oligomerization [12]. We empha-
size that the CMA analysis was performed with reference
to the rhodopsin crystal [13], that the CMA analysisembeds the TSHR helix structure within a large number
of GPCRs, with their correspondingmutations and lastly
CMA is upheld by an independent bioinformatics ap-
proach, evolutionary trace analysis [48].
In a different but related vein, we identified 24 TM
residues whose mutations modified receptor function: 5
inactivating and 19 activating [11,24]. The former were
limited to TM1 through 4. Except for two (420 in TM1and 597 in TM5), all these residues were either strictly
conserved or conservatively substituted in all 14 TSHR
sequences; all 24 were absolutely conserved in mammals.
In teleosts V420 was replaced by S and V597 by I. TM6
exhibited a stretch of five residues (629–633), each of
which when mutated result in a constitutively activated
receptor [24]. The characteristics of this region reflect,
on the one hand, the importance of the wild-type con-formation of this mid-region (�1.5 turn of the a-helix)and on the other a short-range domain effect in helical
orientation transduced through to its cytoplasmic end
and il3 to affect engagement with the trimeric G-protein.
Conclusion
The availability of TSHR sequences including 2 we
now describe from lower primates have allowed us new
insights into TSHR evolution, conservation of essential
function as well as the potential of diversification, pos-
sibly to fill niches of function consummate with survival
and success in a particular environment. A case is made
for the structural determinants for receptor dimerization
as an essential pathway to TSHR signalling.
Acknowledgments
The authors gratefully acknowledge the conscientious
technical assistance of Mrs. Krisztina Farkas and Ce-
cilia Ker�eekgy�aart�oo-Zalka (Institute of Genetics, Cell-andImmunbiology, National Center for Epidemiology,Budapest, Hungary). Blood collection and biopsies from
C. aethiops and M. mulatta were performed by Dr.
L�aaszl�oo Nemes (Chief: Prof. Dr. Gy€oorgy Berencsi) (Na-tional Center for Epidemiology, Budapest, Hungary)
and Dr. Peter Kottra (Chief: Prof. Dr. J�aanos Urb�aan)(Institute for Psychology of Hungarian Academy, Bu-
dapest). The work was supported in part by a special
grant from Osancor Biotech Inc. and from the Hun-garian Ministry of Health OKTA Grant T037184.
References
[1] S.Y. Hsu, A.J.W. Hsueh, Discovering new hormones, receptors
and signalling mediators in the genomic era, Mol. Endocrinol. 14
(2000) 594–604.
[2] V. Kaczur, I.A. Racz, A. Szendroi, M. Tackas, N.R. Farid, The
cysteine-rich C-flanking region of the thyrotropin receptor has
very ancient phylogenetic origins: implications from sequence
analysis, J. Endocr. Genet. 3 (2002) 43–56.
[3] J.G. Hacia, Genomes of the apes, Trends Genet. 17 (2001) 637–
645.
[4] T. Kimura, A. Van Keymeullen, J. Goldstein, A. Fusco, J.E.
Dumont, P.P. Roger, Regulation of thyroid cell proliferation by
TSH and other factors: a critical evaluation of in vitro models,
Endocr. Rev. 22 (2001) 631–656.
[5] K.L. Laugwitz, A. Allgeir, S. Offermanns, K. Spicher, J. Van
Sande, J.E. Dumont, G. Schultz, The human thyrotropin recep-
tor: a heptahelical receptor capable of stimulating members of all
four G protein families, Proc. Natl. Acad. Sci. USA 93 (1996)
116–120.
[6] A. Allgier, K.L. Laugwitz, J. Van Sande, G. Schultze, J.E.
Dumont, Multiple G-protein coupling of the dog thyrotropin
receptor, Mol. Cell. Endocrinol. 127 (1997) 81–90.
[7] J. Sambrook, E.F. Fritsch, T. Maniatis, Molecular Cloning: A
Laboratory Manual, second ed., Cold Spring Harbor Laboratory
Press, Cold Spring Harbor, New York, 1989.
[8] B. Knudsen, M.M. Miyamoto, A likelihood ratio test for
evolutionary rate shifts and functional divergence among proteins,
Proc. Natl. Acad. Sci. USA 98 (2001) 14512–14517.
[9] S. Kumar, S. Subramanian, Mutation rates in mammalian
genomes, Proc. Natl. Acad. Sci. USA 99 (2002) 803–808.
[10] G. Vriend, WHAT IF: A molecular modelling and drug design
program, J. Mol. Graph 8 (1990) 52–56.
[11] http://www.gpcr.org.
[12] P.R. Gouldson, M.K. Dean, C.R. Snell, R.P. Bywater, G.
Gkoutos, C.A. Reynolds, Lipid-facing correlated mutations and
dimerization in G-protein coupled receptors, Protein Eng. 14
(2001) 759–767.
[13] K. Palczewski, T. Kumasaka, T. Hori, C.A. Behnke, H. Moto-
shima, B.A. Fox, I. le Trong, D.C. Teller, T. Okada, R.E.
Stenkamp, M. Yamamoto, M. Miyano, Crystal structure of
rhodopsin: A G protein-coupled receptor, Science 289 (2000) 739–
745.
[14] Z. Lin, A. Shenker, R. Pearlstein, A model of the lutropin/
choriogonadotropin receptor: insights into the structural and
functional effects of constitutively activating mutations, Protein
Eng. 10 (1997) 501–510.
[15] http://tolweb.org/tree/phylogeney.html.
[16] G.F. Turner, O. Seehausen, M.E. Knight, C.J. Allender, R.L.
Robinson, How many species of cichlid fishes are there in the
African lakes? Mol. Evol. 10 (2001) 793–806.
[17] B. Rapoport, G.D. Chazenbalk, J.C. Jaume, S.M. McLachlan,
The thyrotropin (TSH)-releasing hormone receptor: interaction
with TSH and autoantibodies, Endocr. Rev. 19 (1998) 673–
716.
[18] S. Tavare�, C.M. Marshall, O. Will, C. Soligo, R.D. Martin, Using
the fossil record to estimate the age of the last common ancestor of
extant primates, Nature 416 (2002) 726–729.
[19] S.L. Page, C.H. Chui, M. Goodman, Molecular phylogeney of
Old World monkeys (Cercopithecidae) as inferred from c-globulin DNA sequences, Mol. Phylogenet. Evol. 13 (1999)
335–348.
V. Kaczur et al. / Molecular Genetics and Metabolism 78 (2003) 275–290 289
[20] A.C. Van der Kuyl, C.L. Kuikek, J.T. Dekker, J. Goudsmit,
Phlyogeney of African monkeys based on mitochondrial 12S
rRNA sequences, J. Mol. Evol. 40 (1995) 173–180.
[21] E.E. Harris, T.R. Disotell, Nuclear trees and the phylogenetic
relationships of the Mangabeys (Primates: Paionini), Mol. Biol.
Evol. 15 (1998) 892–900.
[22] J.M. Scott, Genetic diversity and disease: opportunities and
challenge, Proc. Natl. Acad. USA 98 (2001) 14754–14756.
[23] Y. Nagayama, K.D. Kaufman, P. Seto, B. Rapoport, Molecular
cloning, sequencing and functional expression of the cDNA for
the human thyrotropin receptor, Biochem. Biophys. Res. Com-
mun. 165 (1989) 1184–1190.
[24] N.R. Farid, V. Kaczur, C. Balazs, The human thyrotropin
receptor is highly mutable: a review of gain-of-function mutations,
Euro. J. Endocrinol. 143 (2000) 25–30.
[25] C. Bratti_Elbaz, N. Ghunea, O. Lahuna, H. Loosfelt, C.
Pichon, E. Milgrom, Internalization and recycling pathways of
the thyrotropin receptor, Mol. Endocrinol. 13 (1999) 1751–
1765.
[26] Kishi, X. Liu, T. Hirakawa, D. Reczek, A. Bretscher, M. Ascoli,
Identification of two distinct structural motifs when added to the
C-terminal tail of the rat LR receptor, redirect the internalised
hormone–receptor complex from degradation to a recycling
pathway, Mol. Endocrinol. 15 (2001) 1624–1635.
[27] H. Biebermann, T. Schoneberg, A. Schultz, G. Krause, A. Guters,
G. Schultz, T. Gudermann, A conserved tyrosine residue (Y601)
in transmembrane domain 5 of the human thyrotropin receptor
serves as a molecular switch to determine G-protein coupling,
FASEB J. 12 (1998) 1461–1470.
[28] O. Karamanoglu Arseven, W.P. Wilkes, J.L. Jameson, P. Kopp,
Naturally occurring variants of the human thyrotropin (TSH)
receptor reveal that tyrosine 601 (Y601) plays a pivotal role in
constitutive basal activity and coupling to Gsa and Gq/11,
Thyroid 10 (1999) 1–3.
[29] R.S. Kumar, K. Ijiri, P. Swanson, A. Dittman, D. Alok, Y. Zoher,
J.M. Trant, Cloning and functional expression of a thyrotropin
receptor from the gonads of a vertebrate (bony fish): potential
thyroid-dependent role for thyrotropin in reproduction, Mol. Cell
Endocrinol. 167 (2000) 1–9.
[30] G. Smits, C. Govaerts, I. Nugourgh, L. Pardo, G. Vassart, S.
Costagliola, Lysine 183 and glutamic acid 157 of the TSH
receptor: two interacting residues with a key role in determining
specificity towards TSH and human CG, Mol. Endocrinol. 16
(2002) 722–735.
[31] Y. Nagayama, E. Nishihara, H. Namaba, S. Yamashita, M. Niwa,
Identification of the sites of aspargine-linked glycosylation of the
humanthyrotropin receptor and studies on their role on receptor
function and expression, J. Pharm. Exp. Therapeut. 295 (2000)
404–409.
[32] Y. Oba, T. Hirai, Y. Yoshiura, T. Kobayashi, Y. Nagahama,
Cloning, functional characterization, and functional expression of
thyrotropin receptors in the thyroid of amago salmon (Onorhyn-
chus rhodurus, Biochem. Biophys. Res. Commun. 276 (2000) 258–
263.
[33] Y. Mi, S.D. Shapiro, J.U. Baenziger, Regulation of lutropin
circulatory half-life by the mannose/N-acetylgalosamine-4-SO4
receptor is critical for implantation in vivo, J. Clin. Invest. 109
(2002) 269–276.
[34] P. Rodien, C. Bremont, M.L. Sanson, J. Parma, J. Van Sande, S.
Costagliola, J.P. Luton, G. Vassart, L. Duprez, Familial gesta-
tional hyperthyroidism caused by a mutant thyrotropin receptor
hypersensitive to human chorionic gonadotropin, N. Eng. J. Med.
339 (1998) 1823–1826.
[35] P. Kopp, S. Muirhead, N. Jourdian, W.X. Gu, J.L. Jameson, C.
Rodd, Congenital hyperthyroidism caused by a solitary toxic
adenoma harboring a novel somatic mutation (serine 281! iso-
leucine) in the extracellular domain of the thyrotropin receptor, J.
Clin. Invest. 100 (1997) 1634–1639.
[36] L. Duprez, J. Parma, S. Costagliola, J. Hermans, J. Van Sande,
J.E. Dumont, G. Vassart, Constitutive activation of the TSH
receptor by spontaneous mutations. Affecting the N-terminal
extracellular domain, FEBS Lett. 409 (1997) 469–474.
[37] S.C. Ho, J. Van Sande, A. Lefot, G. Vassart, S. Costagliola,
Effects of mutations involving the highly conserved S281 HCG
motif in the extracellular domain of the thyrotropin (TSH)
receptor in TSH binding and constitutive activity, Endocrinology
142 (2001) 2760–2767.
[38] V. Vlaeminck-Guillem, S.-C. Ho, P. Rodien, G. Vassart, S.
Costagliola, Activation of the cAMP pathway by the TSH
receptor involves switching of the ectodomain from a tethered
inverse agonist to an agonist, Mol. Endocrinol. 16 (2002) 736–746.
[39] G.D. Chazenbalk, P. Pichuria, C.-R. Chen, F. Latofe, A.P.
Johnstone, S.M. McLachlan, B. Rapoport, Thyroid stimulating
autoantibodies in Graves� disease preferentially recognize the freeA subunit, not the thyrotropin holoreceptor, J. Clin. Invest. 110
(2002) 209–217.
[40] R.A. Black, J.M. White, ADAMS: focus on the protease domain,
Curr. Opin. Cell Biol. 10 (1998) 654–659.
[41] M.N. Islam, N.R. Farid, Structure of the porcine thyrotropin
receptor: a 200 kiloDalton glycoprotein heterocomplex, Experi-
entia 41 (1985) 18–23.
[42] P.N. Graves, H. Vlase, Y. Bobvnikova, T.F. Davies, Multimeric
complex formation by the thyrotropin receptor in solubilized
thyroid membranes, Endocrinology 137 (1997) 3915–3920.
[43] R. Latif, P. Graves, T.F. Davies, Oligomerization of the human
thyrotropin receptor: florescent protein-tagged hTSHR reveals
post-translational complexes, J. Biol. Chem. 276 (2001) 45217–
45224.
[44] M.N. Islam, C.L. Hew, V. Kaczur, N.R. Farid, Cleavage of the
thyrotropin receptor, J. Endocrinol. Invest. 24 (2001) 833–835.
[45] P.R. Gouldson, C. Higgs, R.E. Smith, M.K. Dean, G.V. Gkoutos,
C.A. Reynolds, Dimerization and domain swapping in G-protein-
coupled receptors: a computational study, Neuropsychopharma-
cology 23 (2000) S60–S77.
[46] H.E. Hamm, How activated receptors couple to G proteins, Proc.
Natl. Acad. Sci. USA 98 (2001) 4819–4821.
[47] D. Fotiadis, Y. Liang, Filipek, D.A. Saperstein, A. Engel, K.
Palczewski, Rhodopsin dimers in native disc membranes, Nature
421 (2003) 127–128.
[48] M.K. Dean, C. Higgs, R.E. Smith, R.B. Bywater, C.R. Snell, P.D.
Scott, G.J. Upton, T.J. Howe, C.A. Reynolds, Dimerization of G-
coupled receptors, J. Med. Chem. 44 (2001) 4595–4616.
290 V. Kaczur et al. / Molecular Genetics and Metabolism 78 (2003) 275–290