Upload
others
View
8
Download
0
Embed Size (px)
Citation preview
GHENT UNIVERSITY
FACULTY OF PHARMACEUTICAL SCIENCES
Department of pharmaceutics
Laboratory for Medicinal Chemistry
Master thesis performed at:
UNIVERSITY OF CAMERINO
SCHOOL OF PHARMACY
Department of Chemical Sciences
Academic Year 2011-2012
ANALYSIS AND USE OF ADENOSINE RECEPTOR STRUCTURES WITH MOLECULAR MODELING TOOLS
Pieter CARDOEN
First Master of Pharmaceutical Care
Promoter
Prof. dr. S. Van Calenbergh
Co-promoter
Dr. D. Dal Ben
Commissioners
Prof. dr. S. Van Calenbergh
Prof. dr. B. De Spiegeleer
Dr. M. Risseeuw
GHENT UNIVERSITY
FACULTY OF PHARMACEUTICAL SCIENCES
Department of pharmaceutics
Laboratory for Medicinal Chemistry
Master thesis performed at:
UNIVERSITY OF CAMERINO
SCHOOL OF PHARMACY
Department of Chemical Sciences
Academic Year 2011-2012
ANALYSIS AND USE OF ADENOSINE RECEPTOR STRUCTURES WITH MOLECULAR MODELING TOOLS
Pieter CARDOEN
First Master of Pharmaceutical Care
Promoter
Prof. dr. S. Van Calenbergh
Co-promoter
Dr. D. Dal Ben
Commissioners
Prof. dr. S. Van Calenbergh
Prof. dr. B. De Spiegeleer
Dr. M. Risseeuw
COPYRIGHT
"The author and the promoters give the authorization to consult and to copy parts of this
thesis for personal use only. Any other use is limited by the laws of copyright, especially
concerning the obligation to refer to the source whenever results from this thesis are cited."
May 25, 2012
Promoter
Prof. dr. S. Van Calenbergh
Author
P. Cardoen
SUMMARY
Up to date, eleven crystal structures of the A2AAR co-crystallized with an agonist or
antagonist have been reported in the literature. Firstly, we seek a better understanding of these 3D
structures, which is necessary to carry out the subsequent research that has a threefold objective.
On the basis of the position and the role of receptor residues, we aim at interpreting mutagenesis
data for the A2AAR. When we are able to explain these data, this may lead to more targeted
research on A2AAR ligands. A second objective envisions a limitation of the number of compounds
to be synthesized in drug development through the implication of docking and quantitative-
structure activity relationship methods. In this way research can be made cheaper and less time-
consuming. The last objective is to create homology models all adenosine receptor subtypes.
Discovering differences in binding mode of adenosine, the physiological agonist of the A2AAR, may
lead to the development of subtype selective compounds.
Analysis of the mutagenesis data allowed to confirm earlier suspicions and knowledge on the
A2AAR and to develop new insights. Inserting mutations can cause two effects: destabilization of
the receptor conformation and influencing the ligand affinity. So understanding mutagenesis data
may be useful for the development of A2AAR ligands and for receptor thermostabilization.
The second objective of our research consisted of two parts. The antagonist ZM241385 was
docked on three A2AAR crystal structures that were co-crystallized with it. This allowed identifying
the best placing algorithm-scoring function combination that was subsequently applied on a
dataset of A2AAR antagonist compounds with known affinity. Re-scoring of the best poses carried
out with the quantitative-structure activity relationship method resulted in a model that was
validated using a test set. In this way, a distinction can be made between promising and less
promising compounds, so the synthesis stage in developing new ligands is simplified.
Finally, homology models of the four subtypes of the adenosine receptor were generated.
Differences seemed to be subtle, but the mutation of glutamate 169, an important residue in
ligand binding, into a valine in case of the A3AR, was remarkable. This could be used to create
compounds with a higher affinity for the A3AR with respect to the other subtypes by the
introduction of more hydrophobic substituents on the ligand scaffold.
SAMENVATTING
Tot op heden werden elf kristalstructuren van de A2AAR gerapporteerd in de literatuur. Deze
betreffen een complex van deze receptor met een agonist of een antagonist. Vooreerst trachten
we een beter inzicht te krijgen in deze 3D structuren, wat noodzakelijk is voor het uitvoeren van
het aansluitend onderzoek. Het objectief van dit onderzoek is drieërlei. Aan de hand van de plaats
en de functie van receptorresiduen trachten we de resultaten van mutagenese studies te
interpreteren. Indien we hiertoe in staat zijn, kan dit leiden tot een gerichter onderzoek naar
liganden voor de A2AAR. Een tweede objectief beoogt het beperken van het aantal verbindingen
dat dient te worden gesynthetiseerd tijdens de ontwikkeling van nieuwe geneesmiddelen door
toepassing van docking en QSAR methoden. Zo kan het onderzoek goedkoper en minder
tijdrovend worden gemaakt. Het laatste objectief is het bouwen van homologie modellen van de
vier adenosine receptor subtypes. Het ontdekken van verschillen in het binden van adenosine, de
fysiologische agonist van de A2AAR, kan leiden tot de ontwikkeling van verbindingen met
selectiviteit voor één van de subtypes.
Analyse van de mutagenese data liet toe eerdere kennis en vermoedens betreffende de
A2AAR te bevestigen en nieuwe inzichten te ontwikkelen. Het invoeren van mutaties kan leiden tot
twee effecten: destabilisatie van de receptorconformatie en verandering van de affiniteit voor
liganden. Een beter begrip van mutagenese data kan dus nuttig zijn voor de ontwikkeling van
liganden voor de A2AAR en de mutaties kunnen toegepast worden voor thermostabilisatie.
Het tweede objectief bestond uit twee delen. De antagonist ZM241385 werd gedocked in
drie A2AAR kristalstructuren die gecokristalliseerd werden met deze molecule. Dit liet toe de beste
combinatie placing algorithme-scoring functie te selecteren en vervolgens toe te passen op een
dataset van A2AAR antagonisten met gekende affiniteit. Het herscoren van de beste bindingswijzen
met de QSAR methode resulteerde in een model dat werd gevalideerd gebruikmakende van een
test set. Zo werd een onderscheid gemaakt tussen veelbelovende en minder interessante
verbindingen, waardoor de synthese-inspanningen kunnen gereduceerd worden.
Tenslotte werden homologie modellen van de adenosine receptor subtypes gebouwd.
Hoewel verschillen subtiel zijn, valt vooral de mutatie van glutamaat 169 door valine op in de A3AR,
wat suggereert dat hydrofobe modificatie van liganden kan resulteren in A3AR selectieve
verbindingen.
TABLE OF CONTENTS
1. INTRODUCTION ............................................................................................................. 1
1.1 G PROTEIN-COUPLED RECEPTORS AND THE ADENOSINE A2A RECEPTOR ................................. 1
1.2. A2AAR LIGANDS .................................................................................................................. 2
1.2.1. A2AAR agonists ...................................................................................................................................... 2
1.2.2. A2AAR antagonists ................................................................................................................................ 2
1.3. STRUCTURE OF THE A2AAR .................................................................................................. 4
1.3.1. Molecular modeling studies ................................................................................................................. 4
1.3.2. Adenosine receptor crystal structures ................................................................................................. 6
1.3.2.1. Receptor engineered with T4 Lysozyme ........................................................................................................ 6
1.3.2.2. Receptor engineered through thermostabilization ....................................................................................... 7
1.3.2.3. Receptor engineered with Fab fragment ....................................................................................................... 8
1.3.2.4. A2AAR crystal structures in complex with ZM241385 .................................................................................... 8
1.3.2.5. Adenosine receptor crystal structures in complex with an agonist ............................................................. 12
2. OBJECTIVES.................................................................................................................. 14
3. METHODS .................................................................................................................... 15
3.1. ANALYSIS OF MUTAGENESIS DATA OF THE A2AAR ............................................................... 15
3.2. MAIN PRINCIPLES OF MOLECULAR MECHANICS AND MOLECULAR DOCKING ..................... 15
3.2.1. Molecular mechanics ........................................................................................................................ 15
3.2.2. Molecular Docking ............................................................................................................................. 20
3.2.2.1 Retrieval and analysis of A2AAR crystal structures as docking targets .......................................................... 21
3.2.2.2. Docking analysis .......................................................................................................................................... 23
3.2.2.3. Post-docking analysis: Quantitative structure-activity relationship ............................................................. 27
3.3. HOMOLOGY MODELING ................................................................................................... 30
4. RESULTS AND DISCUSSION ........................................................................................... 32
4.1. ANALYSIS OF MUTAGENESIS DATA OF THE A2AAR ............................................................... 32
4.2. DOCKING AND QSAR ANALYSIS OF DOCKING RESULTS ....................................................... 37
4.3. DEVELOPMENT AND STUDY OF HOMOLOGY MODELS ........................................................ 41
5. CONCLUSION ............................................................................................................... 45
6. BIBLIOGRAPHY ............................................................................................................ 47
LIST OF ABBREVIATIONS
A1AR Adenosine A1 Receptor
A2AAR Adenosine A2A Receptor
A2BAR Adenosine A2B Receptor
A3AR Adenosine A3 Receptor
bacRho Bacterial Rhodopsin
bRho Bovine Rhodopsin
EL Extracellular Loop
ERK Extracellular signal-Regulated Kinase
FDA Food and Drug Administration
GDP Guanosine Diphosphate
GPCR G Protein-Coupled Receptor
GTP Guanosine Triphosphate
IL Intracellular Loop
PDB Protein Data Bank
RMSD Rooth Mean Square Deviation
TM Transmembrane
1
1. INTRODUCTION
1.1 G PROTEIN-COUPLED RECEPTORS AND THE ADENOSINE A2A RECEPTOR
The adenosine receptor is a member of the family of G protein-coupled receptors (GPCRs),
which forms the largest family of drug-targetable cell surface receptors. These are all membrane
spanning proteins consisting of seven transmembrane (TM) α-helices that are connected to each
other by three intracellular (IL) and three extracellular (EL) loops. The N-terminal domain of the
receptor is located in the extracellular environment, the C-terminal domain in the cytosol. The
family of GPCRs is clustered into five classes: the rhodopsin family or family A, the secretin family
or family B, the glutamate family or family C, the adhesion family or family D and the Frizzled/Taste
family or family E. Four subtypes of adenosine receptors, which belong to family A, are
distinguished, adenosine A1 (A1AR), A2A (A2AAR), A2B (A2BAR) and A3 (A3AR) receptors, each of them
showing different pharmacological properties.1’2
Activation of a G protein-coupled receptor induces the exchange of guanosine diphosphate
(GDP) for guanosine triphosphate (GTP) in the heterotrimeric G protein, which is bound to an
intracellular binding pocket. Subsequent dissociation of the G protein into separate Gα and Gβγ
subunits takes place. These subunits are now able to interact with intracellular targets. A2AARs are
coupled to Gs proteins. Activation of the receptor results in stimulation of adenylate cyclase and
subsequent activation of cAMP-dependent protein kinase A or C. This protein kinase
phosphorylates various intracellular targets, among others receptors and ion channels. Other
mechanisms are the stimulation of Golf proteins in brain tissue and the interaction with Ca2+
channels to modulate ERK activity. A variety of proteins, among others α-actinin, calmodulin and β
arrestins, can interact at the exceptionally long C-terminus of the A2AAR.1,3
Adenosine receptors are not only expressed in the monomeric state, they also form dimeric
structures through self-association and hetero-oligomerization. While two identical adenosine
receptor subtypes forms homomers, heteromers are formed through linking of an adenosine
receptor and another receptor. A2AARs are highly expressed in the striatum of the brain, where
they interact with dopamine D2 receptors to form such heterodimers. As the A2AAR antagonists can
simulate the activation of dopamine D2 receptors, they may have potential in the treatment of
Parkinson’s disease and drug addiction.4
2
Within the class of adenosine receptors, a high degree of sequence identity is found, with
the highest conservation between the A2AAR and A2BAR. In particular, the extracellular part of the
helical bundle shows the highest domain similarity, while the loops share less amino acid sequence
identity. Highest sequence similarity is found in the binding pocket region.5
1.2. A2AAR LIGANDS
For several decades, potent and selective adenosine receptor ligands have been the object of
medicinal chemistry research. In what follows, the focus is on the discovery of agonists and
antagonists for the A2AAR. The endogenous purine nucleoside adenosine is the physiological
agonist for the A2AAR. Because the A2AAR is activated by nanomolar concentrations of adenosine,
the receptor is called a high affinity AR subtype.
1.2.1. A2AAR agonists
Due to its limited stability, the physiological agonist adenosine was modified in order to
discover stable and more selective ligands for the ARs. The presence of the adenosine scaffold is of
major importance and has to be conserved to obtain efficient and selective agonists. The scaffold
can be modified in the 3’ and 4’ positions of the ribose and the 2 and N6 positions of the purine in
order to find metabolically stable agonists. Lexiscan, an Ado derivative that was substituted at the
2 position, was approved by the Food and Drug Administration (FDA) for use as a pharmacological
stress agent.6
1.2.2. A2AAR antagonists
The A2AAR antagonists exhibit a high structural variability. Three major groups can be
distinguished: xanthine analogs, adenine analogs and atypical heterocyclic compounds. Fifteen
years ago it was reported that caffeine (Fig. 1.1A) possesses a high affinity for the A2AAR and its
interaction with this receptor mediates the behavioral stimulation typically observed by caffeine
intake. Hence, the xanthine scaffold was modified at the positions 1, 3, 7 and 8 in order to yield
some potent and selective A2AAR antagonists. 3,7-dimethyl-1- propargylxanthine (DPMX, Fig. 1.1B)
was the first weakly potent xanthine analogue designed as A2AAR antagonist. Subsequent research
into 1,3,8-substituted xanthines resulted in the discovery of the first strongly active and selective
A2AAR antagonists, such as 3-chlorostyrylcaffeine (CSC). However, the use of these xanthine
3
analogs for studying the A2AAR was limited because of photoisomerization and their low water
solubility. To overcome the phenomenon of isomerization, the styryl moiety of styrylxanthines can
be replaced by another functional group or the compounds can be administered as solid
substances. The substitution of the styryl moiety, however, led to a decrease in affinity.
Modification of the styrylxanthines was suggested to improve the water solubility and for example
the compound MSX-3, the prodrug of MSX-2 (Fig. 1.1C), seemed to be more efficient than the drug
itself.6
Figure 1.1: Structure of some A2AAR agonists and antagonists.
Considering the problems of low water solubility and photoisomerization of the xanthine
derivatives, the focus has shifted to the development of non-xanthine analogues. Promising A2AAR
antagonists are the non-xanthine imidazopyrimidine (purine)-type structures, such as ANR-152
(Fig. 1.1E) and VER-6947 (Fig. 1.1F) that are endowed with enhanced binding affinities. Further
research resulted in the synthesis of adenine analogs bearing an alkynyl chain in the 2 or 8
position. The highest affinities at the A2AAR were observed for the 2-alkynyl derivatives. It was
decided to further explore the 2 position of 9-ethyladenine derivatives by introducing
phenylethylamino or phenylethoxy groups. These ligands bear a bromine atom in the 8 position,
4
which results in ligands showing nanomolar Ki values. Replacing the 8-bromine atom by a 2-furyl
group increased selectivity for the A2AAR. 9-propyladenines substituted with bulky chains at the N6
position show improved binding affinities at the A1AR and A3AR, and subsequently selectivity for
these receptors. However, introduction of a chlorine atom in the 2 position leads to compounds
with increased A2AAR selectivity.6
Another class of A2AAR antagonists is that of the heterocyclic compounds, like CGS 15943
(Fig. 1.1D) and ZM241385 (Fig. 4) that are very potent as A2AAR antagonist. Also these analogs did
not show favorable water solubility. Other derivatives, such as benzothiazole and 1,2,4-triazole
analogs, can be used for generating therapeutic A2AAR antagonists.6
1.3. STRUCTURE OF THE A2AAR
1.3.1. Molecular modeling studies
Early approaches to describe the structural features of adenosine receptors with
computational tools afforded a “negative image” of the binding pockets. The analyses were carried
out by superimposing and comparing various antagonists, so the key factors for the affinity could
be clarified and a 3D structure made up.
The second generation of adenosine receptor 3D models was obtained by using homology
modeling. This technique aims at obtaining the three-dimensional structure of the receptor using
the structure of a comparable receptor as template. The sequence of the homologue receptor, the
adenosine receptor, is aligned with that of the template and subsequently converted into a three-
dimensional model. The better the resolution of the template and the sequence similarity, the
higher the accuracy of the predicted structure will be. So, in order to obtain trustable GPCR
models, suitable templates need to be available.
Solving the three-dimensional crystal structure of membrane proteins is quite complex. Two
important factors contribute to this difficulty in crystallizing transmembranal receptors. Firstly,
membrane proteins are present in very low concentrations in tissues; secondly their surface is
amphipathic. The surface, in contact with the intracellular and extracellular aqueous phases, is
polar while the transmembranal region is hydrophobic.
5
Figure 1.2: Representation of the structure of the human A2AAR. Colours of the ribbon vary
from dark blue to dark red. Dark blue represents the extracellular N-terminal tail of the receptor,
dark red the intracellular C-terminal tail. The seven helical segments are shown. In the upper
part of the picture, the binding pocket (yellow) and co-crystallized ZM241385 (orange) are
depicted.
Despite the absence of any sequence homology with bacterial rhodopsin (bacRho), the
bacRho structure obtained by cryo-electron microscopy studies was used for the generation of the
early adenosine receptor homology models. This procedure consists of the replacement of the
bacRho amino acids by the corresponding residues of the adenosine receptor. The models
obtained by this method were used to perform mutagenesis studies, which aim at highlighting the
role for ligand interaction of the residues present in the binding pocket.
The 3D structure of bovine rhodopsin (bRho) was solved through X-ray diffraction in 2000.7
Adenosine receptor homology models could be built from this template, which also provided the
structure of the intracellular and extracellular loops. The structural models obtained by homology
6
modeling were more accurate than the first models and even results of previous mutagenesis
studies could be rationalized.
In late 2007 both the crystal structure of the human and the turkey β2 adrenergic receptor
were reported.8,9 The human receptor was engineered with the insertion of a T4 Lysozyme
segment instead of the IL3. A different way of transmembranal helix packing, EL2 fold and a
different number of disulfide links, is observed between the bRho and adrenergic β2 receptor.
These differences result in another orientation of the binding site amino acids and a particular
binding pose and orientation of the ligands.
1.3.2. Adenosine receptor crystal structures
Tremendous progress was made by the crystallization of the human A2AAR (Fig 1.2).10 To
date, 11 crystal structures of human A2AAR have been obtained and published.10-15 In contrast to
the bRho and adrenergic β receptor binding sites, the binding pocket of A2AAR is located differently
and the receptor-ligand contacts are provided by residues of different transmembranal helices and
extracellular loops. Major causes for this different receptor-ligand interaction pattern are the
dissimilar ligand pose and orientation, and the different EL2 fold. In bRho the binding site is
covered by EL2, which restricts the insertion of ligands in the binding cavity. So, in bRho-based
adenosine receptor models bulky ligands are not able to enter the binding pocket.5 A first
comparative analysis of the human A2AAR crystal structures can be made considering the methods
used to stabilize and crystallize the receptor-ligand complex. In all cases, engineered receptors
were used because of the thermolability of the wild-type adenosine receptor. So, the goal of
engineering the receptors is to create stable receptor variants, which can undergo solubilization,
purification and X-ray studies.
1.3.2.1. Receptor engineered with T4 Lysozyme
In order to obtain the crystal structure, the wild-type A2AAR was engineered by substituting
most of the third cytosolic loop, which is of major importance for G-protein binding, with a T4
Lysozyme fragment. Further stabilization of the receptor was achieved by replacing a part of the C-
terminal domain with a histidine purification epitope. Both procedures result in increasing the
likelihood of crystal formation, because now some flexible regions of the wild-type receptor are
removed. In the purification step, the addition of modulators such as sodium chloride and
7
cholesterol, and a receptor-saturating concentration of theophylline is seen as vital. Additionally,
modification of this construct at the amino terminus was performed by addition of a
hemagglutinin signal sequence and a FLAG-M2 detection tag. This method was used to obtain two
crystal structures, one in complex with the high affinity antagonist ZM241385, the other one in
complex with the potent agonist UK432097.11 Given the significant modifications applied to the
wild-type receptor, it is appropriate to characterize the pharmacological properties of the
construct with respect to the wild-type receptor properties.10,16
1.3.2.2. Receptor engineered through thermostabilization
Further A2AAR crystal structures were obtained by using conformational thermostabilization,
a mutagenic strategy that consists of introducing some point mutations into the receptor sequence
(Fig 1.3). This method aims at obtaining both higher receptor stability and agonist/antagonist
selectivity. Each point mutation is inserted to maintain interaction with one ligand species and to
reduce the binding of the other one, by changing the equilibrium between agonist and antagonist
conformation. When the bound ligand is an antagonist, the equilibrium will be altered in the
antagonist conformation. If it concerns an agonist, the equilibrium will be driven towards the
agonist conformation. The thermostabilization allows the use of short-chain detergents during
purification, crystallization and structure determination This method was used to obtain 7 crystal
structures of human A2AAR, in complex with agonist (Adenosine, NECA) or antagonist (ZM241385,
Caffeine, XAC, and two triazine derivatives) compounds.12,13,15
Figure 1.3: Effects and location of mutations into the A2AAR receptor. A. Thermostability
plots B. Sequence with mutated residues (red cercles)
8
1.3.2.3. Receptor engineered with Fab fragment
Stabilization of the receptor was achieved also by generating a complex of the receptor with
a mouse monoclonal antibody Fab fragment. In fact, the complementarity-determining region of
the Fab fragment binds to an intracellular binding pocket formed by TM2, TM3, TM6, and TM7.
This pocket is the critical site for transferring signals between the activated adenosine receptor and
the G protein. In contrast to the above described crystal structures, this does not cause disruption
of IL3. Two crystal structures were obtained, in both cases in complex with ZM241385
compound.14 The raising of Fab2838 against the intracellular surface of the A2AAR locks the
receptor in an inactive state configuration. This results in inhibition of agonist binding, while
antagonist binding is not affected.14
1.3.2.4. A2AAR crystal structures in complex with ZM241385
ZM241385 (Fig 1.4), which is co-crystallized with the A2AAR in different crystal structures, has
a prototypical AR antagonist structure. The bicyclic triazolotriazine unit, which is positioned in the
center of the binding pocket, is the core of the molecule. This scaffold bears a furan ring and a 4-
hydroxyphenylethyl side chain. The furan ring is located in the lower part of the binding pocket,
while the 4-hydroxyphenylethyl side chain points to the extracellular environment.1
Figure 1.4: Structure of the selective A2AAR antagonist ZM241385
Residues from different transmembranal helices and loops are involved in binding the
antagonist. The residues involved in the most significant interactions are described below. First,
some residues of extracellular loop 2 are involved in high-affinity ligand binding. Phenylalanine 168
and glutamate 169 interact with ZM241385 through an aromatic π-stacking interaction with the
triazolotriazine core and hydrogen bonding respectively. The side chain of phenylalanine 168 is
9
essential in binding ZM241385. The bicyclic system and the NH group of ZM241385 are coplanar. In
the 3EML (Fig 1.5) and the 3VG9 (Fig 1.7) crystal structure, where the NH group points towards the
receptor core, ZM241385 is found in the extended conformation and the molecule is orientated
perpendicular to the plane of the membrane bilayer. Here, glutamate 169 forms an intermolecular
hydrogen bond with the NH2 group. When the NH group points towards the extracellular side, as
seen in the 3PWH crystal structure (Fig 1.6), the ZM241385 ligand is found in the folded
conformation and glutamate forms a hydrogen bond with the NH group itself. These are the two
main conformations possible, because the co-planarity allows only few conformations. As the
glutamate residue interacts in the three crystal structures with ZM241385 through a hydrogen
bond, the results of mutagenesis studies are easily understandable. Mutation of glutamate 169 by
an alanine results in a reduction of the antagonist binding.17
Figure 1.5: A. Representation of the A2AAR binding pocket of the 3EML crystal structure.
Critical residues in ligand binding are colored in yellow, ZM241385 in orange; hydrogen bonds
are purple lines B. Schematic representation of ligand-receptor interactions.
Residues from TM6 are also involved in ligand binding. Hereby, asparagine 253 is of critical
importance. The NH2 group of ZM241385 forms a hydrogen bond with the carboxamide carbonyl
10
group of this asparagine. An additional hydrogen bond with the oxygen of the furyl group of
ZM241385 is made, even if it is not clearly observed in all A2AAR-ZM241385 crystal structures. The
furan ring and the triazolotriazine are positioned similarly in all of the three crystal structures. The
furyl moiety points to TM5 and TM7 and makes a hydrophobic interaction with tryptophane 246 at
the bottom of the binding pocket. The 4-hydroxyphenylethyl side chain, which is positioned
towards the extracellular environment, shows higher structural flexibility with respect to the furyl
and triazolotriazine moieties.
Figure 1.6: A. Representation of the A2AAR binding pocket of the 3PWH crystal structure.
Critical residues in ligand binding are colored in yellow, ZM241385 in orange; hydrogen bonds
are purple lines B. Schematic representation of ligand-receptor interactions.
The A2AAR binding cavity is composed of two hydrophobic walls between which the
ZM241385 scaffold is inserted in a well-defined way. The triazolotriazine two-ring π system is
orientated parallel to the one, consisting of phenylalanine 168, and the other, consisting of leucine
249, histidine 250, leucine 267 and methionine 270. A third wall is composed of some hydrophilic
residues, glutamate 169 and asparagine 253, which interact with the amine groups and nitrogen
atoms of the ligand through the formation of hydrogen bonds. Finally, a wall that is made up by
11
some water molecules fills up the cavity upwards to the TM5 helix. These water molecules can
interact with the ligand nitrogens. Also at the roof of the binding pocket, especially hydrophobic
interactions contribute to the binding of the ligand. At the bottom of the binding pocket, a
tryptophane is found. This tryptophane may be involved in receptor activation through a process
referred to as ‘toggle switch’.
Figure 1.7: A. Representation of the A2AAR binding pocket of the 3VG9 crystal structure.
Critical residues in ligand binding are colored in yellow, ZM241385 in orange; hydrogen bonds
are purple lines B. Schematic representation of ligand-receptor interactions.
A fourth crystal structure of the A2AAR in complex with ZM241385 was obtained through
raising a Fab fragment against the intracellular surface of the receptor, as for the 3VG9 crystal
structure. The only difference with the 3VG9 crystal structure is that the 3D structure is revealed
by X-ray diffraction with a lower resolution, 3,1 Å instead of 2,7 Å for the 3VG9 crystal structure.
So, the 3VGA structure is not discussed.
12
1.3.2.5. Adenosine receptor crystal structures in complex with an agonist
In comparison with the many crystal structures of the A2AAR solved in complex with an
antagonist, less crystal structures of the A2AAR in complex with an agonist are elucidated, i.e. three
to date. These are the pdb-deposited crystal structures 2YDO, 3QAK and 2YDV.
The way in which agonists and antagonists are positioned into the A2AAR binding pocket is
quite similar. The physiological agonist adenosine, co-crystallized with the receptor in the 2YDO
crystal structure, is composed of a heterocyclic purine core, the adenine, and a sugar, the ribose.
The hydrophobic adenine is found in almost every A2AAR agonist and is inserted into the binding
site in an analogous manner as the bicyclic core of ZM241385. The contribution of phenylalanine
168 to the hydrophobic interaction between this scaffold and the receptor is essential, as
demonstrated by mutagenesis data. In case the phenylalanine is mutated into an alanine, which is
a small apolar residue, the response observed by binding of the agonist CGS21680 was 63-fold
lower than for the wild-type receptor. Other hydrophobic interactions are provided by leucine 249,
a residue that is found opposite to phenylalanine 168.16,18
Figure 1.8: A. Binding pose of NECA in the 2YDV crystal structure B. Binding pose of
adenosine in the 2YDO crystal structure.
13
The exocyclic nitrogen of the adenine is comparable to that of ZM241385 and binds to
glutamate 169 and asparagine 253 through polar interactions. The hydrophilic ribose moiety,
found in nearly all A2AAR agonists, is orientated in a slightly different way in comparison with the
furan ring of ZM241385. This ribose interacts with threonine 88, serine 277 and histidine 278,
three residues that are located in the bottom of the binding pocket. Activation of the receptor
would be mediated by binding to these residues, since they are not involved in binding
antagonists. The 3’-OH group interacts with both serine 277 and histidine 278, while the 2’-OH
group only interacts with the histidine. In both the NECA and UK432097 co-crystallized A2AARs,
respectively 2YDV and 3QAK, the ligand interacts with threonine 88 through the N2 of the 5’-N-
ethyluronamide group. The bulky substituents on the adenine moiety of UK432097 interact with
residues such as tyrosine 9 and glutamate 169. This results in the high potency of this ligand.11,12,16
14
2. OBJECTIVES
In this work, three objectives are defined, all of them departing from the crystal structure of
the adenosine A2A receptor, the only adenosine receptor of which the crystal structure is
elucidated up to date. The results are achieved using computational methods.
First, we aim at analysing mutagenesis data reported in the literature. The goal of this part is
to understand why mutation of a residue causes a decrease or increase of agonist or antagonist
affinity. In detail, we try to deduct which residues are important in ligand binding, their impact on
ligand binding, and in which way the mutation switches the binding affinity. Even more, these data
may highlight the contribution of certain residues to receptor stabilization. So, explaining
mutagenesis data could be of importance in the future development of A2AAR antagonists or in
refining stabilization methods of the other adenosine receptors to obtain new crystal structures.
Secondly, we try to develop a tool that allows us to distinguish between promising and less
promising A2AAR antagonists. Towards this end we compose a database of 77 compounds with
known affinity (training set). First, we determine which is the best placing algorithm-scoring
function combination. This is carried out by docking ZM241385 on the three crystal structures that
were co-crystallized with ZM241385. Now, the dataset is docked using the placing-scoring
combination that generated the best poses in docking ZM241385. Subsequently, re-scoring of the
dataset is performed with the quantitative structure-activity relationship method and the
developed model is validated using a test set of 10 A2AAR antagonists. The aim of re-scoring is to
obtain an approximation, and not a prediction, of the pKi values of a database with A2AAR
antagonists of interest. Only molecules assigned with a pKi that satisfies the predefined criteria, i.e.
a range in between which the pKi has to lie, are retained. So, the number of compounds that has to
be synthesized in the following stage of research could be restricted. This results in a cheaper and
more simplified development process of drug candidates for the treatment of Parkinson’s disease.
Finally, we use the crystal structure of the A2AAR in complex with adenosine, the
physiological agonist of the adenosine receptor, to generate homology models of the different
adenosine receptor subtypes. This allows us to compare the corresponding residues of the four
subtypes and to detect subtle differences in binding adenosine. The method permits to distinguish
between the binding manner of adenosine in the several subtypes and to synthesize ligands that
are more potent for the one subtype than for the other, so to synthesize selective ligands.
15
3. METHODS
3.1. ANALYSIS OF MUTAGENESIS DATA OF THE A2AAR
The first part of our computational research focuses on comparing the eleven crystal
structures of the A2AAR co-crystallized with a ligand, which is an agonist or antagonist. Therefore,
the receptors are superimposed. Subsequently, the eleven ligands are selected and all the receptor
residues which are situated within a distance of 4,5 Å are specified. When a residue is located in
the environment of one ligand, but not in that of another ligand, that residue also is selected in all
of the other crystal structures. Now, we determine residue per residue, for all the crystal
structures, what the function is of that residue in the receptor. Is the residue involved in ligand
binding? Does it influence the conformation or stability of the receptor through interaction with
other residues? After performing this analysis and making up a database of the results, we try to
interpret the results of previous mutagenesis studies carried out on the A2AAR. When for example
a decrease in agonist affinity is observed when a specific residue is substituted by another more
hydrophilic or lipophilic residue, we try to interpret mutagenesis data according to the position
and the role of the residue observed in the considered crystal structures.
3.2. MAIN PRINCIPLES OF MOLECULAR MECHANICS AND MOLECULAR DOCKING
3.2.1. Molecular mechanics
The molecular mechanics model is a way to describe molecules, ranging from very simple to
quite complex structures, in a more simple way. The method is used in conformational analysis,
molecular dynamics and docking, techniques for which energy calculations are needed. The key
principles of the molecular mechanics are some classical mechanics laws. First, nuclei and
electrons of an atom have to be treated as a unit. Secondly, atoms may be seen as spherical
objects and they have a well-defined charge. Both the sphere radius and the atomic charge are
calculated using quantum mechanics. Finally, classical mechanics laws describe the interactions
between the atoms and these interactions result in defining the molecules geometry and energy.
The interaction between the atoms of a compound are described using a molecular function based
on these previously listed classical mechanics principles. The empirical function of the potential
energy of a compound as function of its atoms coordinates is called the force field. Energy values
that are calculated using this function have no physical meaning, but when the energies are
16
calculated for different conformations, for example obtained by using the docking method, the
relative energies of these different conformations may be compared.
The formula used to calculate the potential energy consists of terms corresponding with
energies, which refer to bonding and nonbonding interaction. Each of the energy terms in formula
(3.1) is the sum of some particular kinds of atomic interactions.
𝐸𝑝𝑜𝑡 = 𝐸𝑠𝑡𝑟 + 𝐸𝑎𝑛𝑔 + 𝐸𝑠𝑡𝑏 + 𝐸𝑜𝑜𝑝 + 𝐸𝑡𝑜𝑟 + 𝐸𝑣𝑑𝑤 + 𝐸𝑒𝑙𝑒 + 𝐸𝑠𝑜𝑙 (3.1)
The bond stretch energy 𝐸𝑠𝑡𝑟 is the sum of the stretch energy of all the bonds i-j and is
calculated by the following equation:
𝐸𝑠𝑡𝑟 = 𝑤𝑠𝑡𝑟 𝑘𝑖𝑗𝑖−𝑗
𝑟𝑖𝑗 − 𝐿𝑖𝑗 2
+ 𝑘𝑖𝑗′ 𝑟𝑖𝑗 − 𝐿𝑖𝑗
3+ 𝑘𝑖𝑗
′′ 𝑟𝑖𝑗 − 𝐿𝑖𝑗 4
(3.2)
Where: 𝑤𝑠𝑡𝑟 : a weight
𝑘𝑖𝑗 , 𝑘𝑖𝑗′ and 𝑘𝑖𝑗
′′ : force constants
𝐿𝑖𝑗 : equilibrium bond length
When the Morse potential is expanded in a Taylor series about the equilibrium bond length
and truncated to the quartic term, it gives a good approximation of the bond stretch energy. The
linear term is zero at equilibrium and the constant term is negligible. The bond angle energy 𝐸𝑎𝑛𝑔
extends over all bond angles i-j-k and is given by the following equation:
𝐸𝑎𝑛𝑔 = 𝑤𝑎𝑛𝑔 𝑘𝑖𝑗𝑘 𝑑𝑖𝑗𝑘2
𝑖−𝑗−𝑘
+ 𝑘𝑖𝑗𝑘′ 𝑑𝑖𝑗𝑘
3 + 𝑘𝑖𝑗𝑘′′ 𝑑𝑖𝑗𝑘
4 (3.3)
Where: 𝑤𝑎𝑛𝑔 : a weight
𝑘𝑖𝑗 , 𝑘𝑖𝑗′ and 𝑘𝑖𝑗
′′ : force constants
17
Depending on the functional form chosen for the angle interactions, 𝑑𝑖𝑗𝑘 is either 𝑎𝑖𝑗𝑘 − 𝐴𝑖𝑗𝑘
(angle form) or cos 𝑎𝑖𝑗𝑘 − cos𝐴𝑖𝑗𝑘 (cosine form). 𝐴𝑖𝑗𝑘 is the equilibrium bond angle (in radians)
and 𝑎𝑖𝑗𝑘 is given by the following equation:
𝑎𝑖𝑗𝑘 =𝑥𝑗𝑖𝑇𝑥𝑗𝑘
𝑟𝑗𝑖 𝑟𝑗𝑘 , 𝑥𝑗𝑖 = 𝑥𝑗 − 𝑥𝑖 (3.4)
For sp-hybridized compounds, the quartic expression is replaced by 𝑘𝑖𝑗𝑘 (1 + cos 𝑎𝑖𝑗𝑘 ), for
octahedral and square-planar geometries 𝑘𝑖𝑗𝑘 (1 − cos(4𝑎𝑖𝑗𝑘 )) is used. The stretch-bend energy
extends over all bond angles i-j-k, except of (near-)linear equilibrium angles and is given by formula
(3.5):
𝐸𝑠𝑡𝑏 = 𝑤𝑠𝑡𝑏 𝑘𝑖𝑗𝑘 𝑟𝑖𝑗 − 𝐿𝑖𝑗 + 𝑘𝑘𝑗𝑖 𝑟𝑗𝑘 − 𝐿𝑗𝑘 𝑑𝑖𝑗𝑘𝑖−𝑗−𝑘
(3.5)
Where: 𝑤𝑠𝑡𝑏 : a weight
𝑘𝑖𝑗𝑘 and 𝑘𝑘𝑗𝑖 : stretch-bend force constants
Other terms are similar to those in equation (2.2) and (2.3)
The out-of-plane energy is a sum which extends over all atoms I with three neighbours j, k
and l and is given by equation (3.6):
𝐸𝑜𝑜𝑝 = 𝑤𝑜𝑜𝑝 𝑘𝑖;𝑗𝑘𝑙𝑖;𝑗𝑘𝑙
𝑋𝑖;𝑗𝑘𝑙2 (3.6)
Where 𝑤𝑜𝑜𝑝 : a weight
𝑘𝑖;𝑗𝑘𝑙 : the out-of-plane force constant
𝑋𝑖;𝑗𝑘𝑙 : the Wilson angle (the angle between the bond il and the plane ijk)
18
Depending of the force field used to calculate the 𝐸𝑜𝑜𝑝 , the 𝐸𝑜𝑜𝑝 equation can be the same
as that of the torsion energy. Then, the only difference is that instead of using four consecutive
bonded atoms i-j-k-l, the sum extends over atoms i with three bonded neighbours j, k and l. The
torsion energy Etor is a sum that extends over all sets of i-j-k-l atoms bonded in sequence:
𝐸𝑡𝑜𝑟 = 𝑤𝑡𝑜𝑟 𝑘𝑛 ;𝑖𝑗𝑘𝑙 cos 𝑛𝑇𝑖𝑗𝑘𝑙 − 𝑃𝑛 ;𝑖𝑗𝑘𝑙 (3.7)
6
𝑛=0𝑖−𝑗−𝑘−𝑙
Where: 𝑤𝑡𝑜𝑟 : a weight
𝑘𝑛 ;𝑖𝑗𝑘𝑙 : force constants
𝑃𝑛 ;𝑖𝑗𝑘𝑙 : phase angle parameters
𝑇𝑖𝑗𝑘𝑙 =(𝑥𝑗𝑖 𝑥𝑗𝑘 )𝑇(𝑥𝑗𝑘 𝑥𝑙𝑘 )
𝑥𝑗𝑖 𝑥𝑗𝑘 𝑥𝑗𝑘 𝑥𝑙𝑘 , 𝑥𝑖𝑗 = 𝑥𝑗 − 𝑥𝑖
The van der Waals energy 𝐸𝑣𝑑𝑤 is given by the following equation:
𝐸𝑣𝑑𝑤 = 𝑤𝑣𝑑𝑤 𝑒𝑖𝑗 1 + 𝑎 𝑅𝑖𝑗
𝑟𝑖𝑗 + 𝑎𝑅𝑖𝑗
𝑛𝑖𝑗
𝑛𝑖𝑗
𝑚𝑖𝑗
1 + 𝑏 𝑅𝑖𝑗
𝑚 𝑖𝑗
𝑅𝑖𝑗
𝑚 𝑖𝑗 + 𝑏𝑅𝑖𝑗
𝑚 𝑖𝑗−𝑚𝑖𝑗 + 𝑛𝑖𝑗
𝑚𝑖𝑗 𝑠 𝑟𝑖𝑗 𝑇𝑖𝑗 𝑙𝑖𝑗
𝑣𝑑𝑤 (3.8)
𝑖<𝑗
Where: 𝑤𝑣𝑑𝑤 : a weight
𝑒𝑖𝑗 , 𝑅𝑖𝑗 , 𝑚𝑖𝑗 and 𝑛𝑖𝑗 : force field parameters
𝑎, 𝑏: buffering constants
𝑠 𝑟 =
1 𝑖𝑓 𝑟 < 𝑟0
1 − 𝑝 𝑟 − 𝑟0
𝑟1 − 𝑟0 𝑖𝑓 𝑟 ∈ 𝑟0, 𝑟1
0 𝑖𝑓 𝑟 > 𝑟1
(3.9)
Where: 𝑝 𝑥 = 𝑥3(6𝑥2 − 15𝑥 + 10)
By setting the cut-off parameters 𝑟0 and 𝑟1, a variety of
smooth tapering functions that are continuous in both their
first and second derivatives can be created.
19
𝑙𝑣𝑑𝑤 : interaction scale factor (0 for 1-2 and 1-3 interactions, a parameter
set-dependent scale value for 1-4 interactions and 1 for other interactions)
and 𝑇𝑖𝑗 : factor used to scale particular non-bonded interactions.
𝑇𝑖𝑗
𝑇𝑙𝑖𝑘𝑒 𝑖𝑓 𝑇𝑖 = 𝑇𝑗
𝑇𝑤𝑖𝑙𝑑 𝑖𝑓 𝑇𝑖 ≠ 𝑇𝑗 𝑎𝑛𝑑 𝑇𝑖 = 0 𝑜𝑟 𝑇𝑗 = 0
𝑇𝑢𝑛𝑙𝑖𝑘𝑒 𝑜𝑡𝑒𝑟𝑤𝑖𝑠𝑒
(3.10)
Where: 𝑇𝑙𝑖𝑘𝑒 , 𝑇𝑢𝑛𝑙𝑖𝑘𝑒 and 𝑇𝑤𝑖𝑙𝑑 : state parameters
The electrostatics energy Eele is given by the following equation:
𝐸𝑒𝑙𝑒 =
𝑤𝑒𝑙𝑒 𝑒
2
4𝜋휀0𝑑 𝑞𝑖𝑞𝑗
1
𝑟𝑖𝑗 +𝑏𝑒𝑙𝑒 𝑠(𝑟𝑖𝑗 )𝑇𝑖𝑗 𝐼𝑖𝑗
𝑒𝑙𝑒𝑖<𝑗 𝐶𝑜𝑢𝑙𝑜𝑚𝑏
𝑤𝑒𝑙𝑒 𝑒2
4𝜋휀0𝑑 𝑞𝑖𝑞𝑗
1
(𝑟𝑖𝑗 +𝑏𝑒𝑙𝑒 )2 𝑠(𝑟𝑖𝑗 )𝑇𝑖𝑗 𝐼𝑖𝑗𝑒𝑙𝑒
𝑖<𝑗 𝐷𝑖𝑠𝑡𝑎𝑛𝑐𝑒 𝑑𝑒𝑝𝑒𝑛𝑑𝑒𝑛𝑡 𝑑𝑖𝑒𝑙𝑒𝑐𝑡𝑟𝑖𝑐
𝑤𝑒𝑙𝑒 𝑒2
4𝜋휀0𝑑 𝑞𝑖𝑞𝑗
1
𝑟𝑖𝑗 +𝑏𝑒𝑙𝑒−
𝛼𝑟𝑖𝑗2
𝑅𝑐3 −
(1−𝛼)
𝑅𝑐 𝑠(𝑟𝑖𝑗 )𝑇𝑖𝑗 𝐼𝑖𝑗
𝑒𝑙𝑒𝑖<𝑗 𝑅𝑒𝑎𝑐𝑡𝑖𝑜𝑛 𝑓𝑖𝑒𝑙𝑑,𝛼 =
𝑑−𝑑𝑥
𝑑+2𝑑𝑥
Where: 𝑤𝑒𝑙𝑒 : a weight
𝑑: the dielectric constant in the interior of the solute
𝑑𝑥 : the dielectric constant of the solvent
𝑠 and 𝑇: equations (2.9) and (2.10)
𝑞𝑖 : the partial charge on atom i
𝑏𝑒𝑙𝑒 : a buffering constant to prevent zero denominators
𝐼𝑒𝑙𝑒 : similar to 𝑙𝑣𝑑𝑤
Esol is the implicit solvation energy calculated using the Generalized Born model (GB/VI):
𝐸𝑠𝑜𝑙 = −𝑤𝑠𝑜𝑙𝑊 𝑑−1 − 𝑑𝑥−1
𝑒2
4𝜋휀0
1
2
𝑞𝑖𝑞𝑗 𝐺𝑖𝐺𝑗
𝑦𝑖𝑗 +exp −𝑦𝑖𝑗
4
𝑛𝑗=1
𝑛𝑖=1 𝑠 𝑟𝑖𝑗 𝑇𝑖𝑗 , 𝑦𝑖𝑗 = 𝑟𝑖𝑗
2𝐺𝑖𝐺𝑗
Where: wsol and W: weights
(3.11)
(3.12)
20
d and dx: given in (3.11)
s and T: equations (3.9) and (3.10)
Gi is the self energy of atom i and is defined to be
𝐺𝑖 = −1
2
1
𝑅𝑖3 − 𝑉𝑖
1
3
, 𝑉𝑖 ≈ 𝑥 − 𝑥𝑖 −6𝛿 𝑥 − 𝑥𝑖 > 𝑅𝑖 𝑑𝑥 (3.13)
𝑥∈𝑠𝑜𝑙𝑢𝑡𝑒
Where Ri: the solvent model radius of atom i
Vi is an approximation to the Born Integral
Another important concept of molecular mechanics is that of the atom types. The atoms are
considered as charged spheres and they are characterized by the chemical situation in which they
are found. For example, the AMBER force field defines five different atom types for oxygen: a
carbonyl oxygen (O), alcohol oxygen (OH), carboxylic acid or phosphate oxygen (O2), ester or ether
oxygen (OS) and water oxygen (OW). In order to make the energy calculations, different force fields
are used for both small organic molecules and biological macromolecules. We use the Merck
Molecular Force Field (MMFF) for the organic molecules and the Assisted Model Building with
Energy Refinement (AMBER) force field for the receptor.
3.2.2. Molecular Docking
The aim of docking is to find the binding geometry of two interacting molecules. We
distinguish two different types of docking, local and global docking. In local docking the binding
site in the receptor is known. The ligand has to find its position in the known binding pocket. In
global docking the binding site is unknown, so here we have to search for the binding site and the
position of the ligand in the binding pocket, which can be performed sequentially or
simultaneously. During docking simulation, the receptor is usually kept rigid, whereas for the
ligand flexibility is allowed. This docking strategy is called semi-flexible docking. The number of
degrees of freedom is 6 + Nfree. The goal of the testing docking method is to reproduce a known
complex, for example the A2AAR – ZM241385 complex as observed from the experimental crystal
structure data. The starting structures for both the receptor and the ligand are taken from the
minimized crystal structure.
21
In docking, 2 factors are of major importance. First, there is the algorithm that determines
the conformation of the ligand in the receptor. Second, there is the scoring function that scores the
different conformations that were generated by the algorithm. Once all possible docking
conformations are generated, a guide criterion or score is needed, which allows us to define which
of these conformations is the most energetically stable and can probably represent the real
binding situation. There are basically two major methods to assign a score during docking protocol,
energy score and grid score.
The energy score of a ligand within the binding site can be calculated by using molecular
mechanics force field. The calculated interaction energy values are usually negative to indicate a
stabilization of the complex with respect to the isolate ligand and target, and the absolute value
indicates the stabilization degree of the system. These values don’t have a thermodynamic
meaning as they are derived using a mechanic force field. The grid score method is based on the
calculation of electrostatic interaction energy between the ligand and the binding site. To calculate
such energy, the binding site is enclosed in a 3D grid with regular intervals.
3.2.2.1 Retrieval and analysis of A2AAR crystal structures as docking targets
3.2.2.1.1 Downloading the crystal structure
Crystal structures of biological macromolecules, such as proteins and nucleic acids, can be
found in the protein data bank (PDB). Searching for the A2AAR results in a number of crystal
structures, from which we download only the files containing the crystal structure of the A2AAR in
complex with the antagonist ZM241385. The higher the resolution that the crystal structure is
obtained with, the more accurate the crystal structure will be. So, the resolution indicates the
quality of the crystallographic data. Subsequently, protein data obtained from the PDB have to be
corrected for errors and prepared for further computational research.
3.2.2.1.2. Correcting for mistakes
A major problem of solving the crystal structures of macromolecules is the presence of data-
related issues such as missing atoms or poorly resolved atomic data. Mistakes in the bonding
patterns of co-crystallized ligands are another issue. Bonding orders are predicted from the
interatomic distances and angles of heavy atoms, so flexible regions complicate inferring the
bonding pattern. Also knowledge of hydrogen atoms, which are difficult to be located, is of major
22
importance in deriving the bond orders of non-amino acids. So, the inspection of the correctness
of bond orders, charges and protonation states is an important step in preparing the crystal
structures for further analysis. Another characteristic that is of great importance is the geometric
quality of the protein. Bond lengths, bond angles and torsion angles are checked for the entire
receptor. For example, a Ramachandran phi-psi plot may highlight structural problems. In order to
solve these problems, the outliers in the Ramachandran dihedral plot are selected and an energy
minimization is carried out for these residues. So, strain can be maneuvered away and the
geometric quality is restored. Furthermore, hydrogen atoms have to be added to the crystal
structure, because performing X-ray crystallography leads to a crystal structure for which
information about hydrogens is not available. Macromolecular protonation can be obtained by
using the Protonate 3D application of MOE.
3.2.2.1.3. Replacing missing data
When this first step of preparing the protein-ligand complex for docking simulation is
completed, the primary sequence of the human adenosine A2A receptor is downloaded from the
Universal Protein Resource database (Uniprot database; http://www.uniprot.org). With the aid of
the primary sequence, missing structural data can be corrected. Sometimes, atoms cannot be
resolved, regions of the protein are to flexible or regions are invisible for the X-ray diffraction
technique. To fill in the missing residues, the Homology Model application of MOE is used. The
original human sequence, downloaded from the Uniprot database is used as target to make up this
self-homology model. The force field parameterized for proteins, AMBER99, is selected for scoring
the obtained models. Twenty-five different models are generated by the application and
subsequently scored by the AMBER99 force field. The top-scored model will be loaded in MOE.
After completing the crystal structure, a minimization of the restored residues is performed to
eliminate the strain that may be present because no refinement step was carried out.
3.2.2.1.4. Minimizing the energy
Errors in the ZM241385 and the receptor structure are corrected, missing residues inserted
and hydrogens added, so a next step in preparing the protein-ligand complex may be carried out.
The partial charges of the structure are calculated and a molecular mechanics refinement in the
form of a tethered energy minimization of the receptor in combination with the ligand is
performed.
23
3.2.2.2. Docking analysis
Once the receptor and the ligand are prepared for further computational analysis, the ligand
is docked into the binding pocket. Figure 2.1 provides a schematic overview of the different steps
of the docking methodology. Step 1 (Figure 3.1) is not carried out in our case. We use the
conformation of the ligand as it appears in the crystal structure. The algorithm or placing method
positions the ligand in various ways in the binding pocket (step 2, Fig 3.1). All the poses obtained
by the placing method in step 2 are scored by both the algorithm itself and other scoring functions.
The best poses are assigned the lowest (most negative) scores. Subsequently, a refinement step
(step 4, Fig 3.1) is performed using force field or the grid-based energetics method. A
pharmacophore constraint may be defined to filter the conformations. Only conformations that
satisfy the pharmacophore model are then retained. The refined poses are rescored using the
same scoring method as the one used in step 3. Finally the top scoring positions are output in a
database, subjected to optional duplicates removal. Are the placement methods of the software
(MOE) able to reproduce the binding position of the ligand as seen in the crystal structure? Are the
scores assigned by the scoring methods within our expectations, i.e. are the best poses assigned
the lowest values? We will evaluate the different placing methods and scoring functions in their
ability to generate good poses and to assign good scores. First, the ligand ZM241385 is removed
from the minimized crystal structure and the conformation of the 4-hydroxyphenylethyl side chain
is slightly changed. Also water molecules present in the receptor are removed. The partial charges
of both the receptor and the ligand are set. For the receptor, a macromolecule, the AMBER99 force
field is used, while the MMFF94 force field is used for the ligand. The poses of ZM241385 are
generated by different placement methods: Alpha Triangle, Alpha PMI, Proxy Triangle and Triangle
Matcher. Using the Alpha Triangle method, ligand atom triplets are superposed to triplets of
receptor site points, which are alpha sphere centers. Each pose is generated by selecting a random
conformation, a random triplet of ligand atoms and a random triplet of alpha sphere centers. The
Alpha PMI method is fast and appropriate for small binding pockets. Here, the principal moments
of inertia of the ligand conformation are aligned to a randomly generated subset of alpha spheres
in the receptor site to generate poses. The Proxy Triangle method is able to position larger ligands,
which may occur in a large number of conformations. Before placing the ligands in the binding
pocket, conformers are superposed to save computational time. The Triangle Matcher method
aligns ligand atom triplets on triplets of receptor site points, as in the Alpha Triangle method, but
in a more systematic way.
24
Figure 3.1: Stages in the Dock Algorithm
Once the ligand is positioned with different orientations into the binding pocket, poses are
scored. The lower is the score, the more favorable is the pose of the ligand. Four scoring functions
are used: ASE Scoring, Affinity dG Scoring, Alpha HB Scoring and London dG Scoring.
The score using the ASE Scoring method is obtained by calculating the sum of the Gaussians
(0,035kcal/mol)R1R2exp(-0,5d2) over all ligand atom-receptor atom pairs and ligand atom-alpha
sphere pairs. The radii, R1 and R2, are expressed in Å; for alpha spheres R is -1,85. The distance d
between the atoms or alpha spheres also is expressed in Å.
The Affinity dG Scoring function uses a linear function, which is given in (3.14), to estimate
the enthalpic contribution to the free energy of binding.
𝐺 = 𝐶𝑏𝑓𝑏 + 𝐶𝑖𝑜𝑛𝑓𝑖𝑜𝑛 + 𝐶𝑚𝑙𝑖𝑔 𝑓𝑚𝑙𝑖𝑔 + 𝐶𝑓 + 𝐶𝑝𝑓𝑝 + 𝐶𝑎𝑎𝑓𝑎𝑎 (3.14)
Where: Cx: weight coefficient
fx: value that represents the atomic contacts of a specific type
x: interaction type; hb = hydrogen bond; ion = ionic interaction; mlig = metal
ligation; hh = hydrophobic interaction; hp = interaction between hydrophobic
and polar atoms; aa = interaction between any two atoms
25
The score obtained by the Alpha HB Scoring function is a linear combination of two terms.
The first term is a measurement of the geometric fit of the ligand to the binding pocket, which is
composed of an attractive and a repulsive part. Each ligand atom that is within 3 Å of an alpha
sphere center contributes -0,6845exp(-0,5d2) to the attractive part, where d is the distance
between the ligand atom and the nearest sphere center. When atomic overlap between ligand and
receptor atoms is observed, this pair contributes to the repulsive part of the first term. The value
of this contribution varies between 0 and 1 depending on the severity of the overlap. Hydrogen
bonding effects are represented by the second term. If a non-sp3 donor or acceptor is present,
hydrogen bonding sites are projected from the atom. When the projection site contains a receptor
atom that is favorable for hydrogen bonding, a score of -2 is assigned. When the projection site
contains any other atom, a score of +1 is assigned. If a sp3 donor or acceptor is present, favorable
atoms within 3,5 Å contribute a score of -1 while all other atoms contribute +1. Metals are seen as
acceptors and their contribution is multiplied with three. Both parts are summed over all the
atoms in the ligand.
As the Affinity dG Scoring function, the London dG Scoring function uses equation (3.15) to
estimate the free energy of binding of the ligand.
∆𝐺 = 𝑐 + 𝐸𝑓𝑙𝑒𝑥 + 𝑐𝐻𝐵𝑓𝐻𝐵 + 𝑐𝑀𝑓𝑀 + ∆𝐷𝑖 (3.15)
𝑎𝑡𝑜𝑚𝑠 𝑖𝑚−𝑙𝑖𝑔−𝑏𝑜𝑛𝑑𝑠
Where: ΔG: free energy of binding
C: average gain/loss of rotational and translational entropy
Eflex: energy due to the loss of flexibility of the ligand
fHB: value that represents geometric imperfections of hydrogen bonds
CHB: energy of an ideal hydrogen bond
fM: value that represents geometric imperfections of metal ligations
CM: energy of an ideal metal ligation
Di: desolvation energy of atom i
The difference in desolvation energies is calculated according to formula (3.16).
26
∆𝐷𝑖 = 𝑐𝑖𝑅𝑖3 𝑢 −6𝑑𝑢 −
𝑢∉𝐴 𝐵
𝑢 −6𝑑𝑢
𝑢∉𝐵
(3.16)
Whereby: A: volume of the protein
B: volume of the ligand
i: atom belonging to volume B
Ri: solvation radius of atom i
Ci: desolvation coefficient of atom i
After positioning the ligand in the binding pocket and scoring the obtained poses with one of
the four scoring functions, a refinement step is performed. In fact, the refinement step is a method
to minimize the energy of the system. Two methods are available: force field refinement and
gridmin refinement. The force field refinement scheme is more time-consuming, but also more
accurate than the gridmin scheme.
An additional filtering tool is the insertion of a pharmacophore query during docking
analysis. The IUPAC definition of a pharmacophore is the following: "A pharmacophore is an
ensemble of steric and electronic features that is necessary to ensure the optimal supramolecular
interactions with a specific biological target and to trigger (or block) its biological response".19 So, a
pharmacophore is a 3D model that describes both the types and locations of ligand-receptor
interactions. In MOE, a pharmacophore is represented by a pharmacophore query that is
composed of a number of query features each one assigned of a tolerance radius. When a
pharmacophore query is specified, matches are retained; poses that do not satisfy the
pharmacophore are eliminated. The restraints set are used during both the refinement stage and
the pharmacophore-based filtering stage.
Subsequently, duplication removal is performed. This method does not use the conventional
rooth mean square deviation (RMSD) cutoff approach to exclude duplicates, but it considers the
hydrogen bonds and hydrophobic interactions between ZM241385 and the A2AAR. If the distance
between appropriate ligand-receptor atom types is less than 3,5 Å, it is supposed that a hydrogen
bond is present. Hydrophobic interactions between appropriate ligand atom types-receptor
residues are considered when the distance is less than 4,5 Å. If the set of ligand-receptor atoms
27
that are involved in hydrogen bonds and the set of ligand atoms-receptor residues that are
involved in hydrophobic interactions are the same for both the observed poses, the poses are
considered as duplicates.
Finally, the results are output in a database, which lists up only the top scoring poses. Firstly,
docking is performed for the 3EML, 3PWH and 3VG9 crystal structures. Sixteen different
combinations of algorithm-scoring function are tested on their ability to generate good poses and
to assign good scores. Secondarily, these sixteen combinations are tested again, but now with the
use of a pharmacophore query. After performing docking, the results of each algorithm-scoring
function combination with and without pharmacophore constraint are subjected to a data-
analysis. In addition to the RMSD of the total conformation, the RMSD of the scaffold, which is an
estimation of the distortion of the bicyclic scaffold in Å, is calculated. The ten best poses, which are
given in the database, are evaluated on both the RMSDtotal_conf and RMSDscaffold. The cut-off value
for the RMSDtotal_conf is set at 2 Å, while the value for the RMSDscaffold is set at 1 Å. Results are drawn
from this data-analysis. The two crystal structures, the three scoring functions and the placing
algorithm that lead to the best results are used to carry out the docking of a dataset of antagonists
in the next step of the research.
3.2.2.3. Post-docking analysis: Quantitative structure-activity relationship
The biological activity of a ligand is a function of the structure as described by its
physicochemical and structural properties. More than a century ago, Meyer and Overton observed
independently of each other that the depressant action of a group of organic compounds increases
linearly with their oil/water partition coefficients. The initial postulate of quantitative structure-
activity relationship states “activity was a function of structure as described by electronic
attributes, hydrophobicity, and steric properties”.20
The objective of quantitative structure-activity relationship (QSAR) is to search for
relationships between the values of some descriptors and the known experimental data of a set of
chemical entities that is referred to as training set. This results in a numerical model, a linear
regression line, on the basis of which a dataset of compounds for which the binding data are not
known, the so-called test set, may be evaluated. The results of the numerical model may not be
seen as a predicted value for the biological activity, but the goal is rather to distinguish between
acceptable and unsatisfactory values. In this way one can decide which compounds have to be
28
synthesized and submitted to subsequent testing procedures, among others in vitro and in vivo
studies.
The experimental data may be biological parameters, such as the Ki or Kd, physical property
measurements or high throughput screening results. For each compound of the training set, a
numerical representation is constructed. This representation is provided by molecular descriptors,
quantities that depend on the molecule. Three classes of numerical descriptors can be
distinguished. 2D descriptors only contain information about the atoms and connections present in
the compound, while 3D coordinates and conformations are not taken into account. The i3D or
internal 3D descriptors contain information about the 3D coordinates of each compound, but
information about rotations and translations of the individual conformations is not considered. On
the other hand, x3D or external 3D descriptors consider the 3D coordinate information. For the
calculation of these descriptors, an absolute frame of reference is needed, for example the binding
pocket of the A2AAR.
The obtained model is a numerical formula of which the parameters are estimated using the
experimental data and the descriptors of the molecules of the training set. The experimental
results of linear regression models are expressed as a linear combination of the molecular
descriptors and a constant. Once a satisfactory model has been built, it can be used to predict
experimental data for new molecules. As previously mentioned, these predicted experimental
values are only estimations and are used only to distinguish between compounds which are useful
or not useful for further research.
𝑌𝑖 = 𝐴 + 𝐶1𝑃11 + 𝐶2𝑃21 + 𝐶3𝑃31 + ⋯ (3.17)
Where: 𝑌𝑖 : predicted experimental result for compound i
𝐴: constant
𝐶𝑖 : parameterized coefficient of a descriptor Pj
𝑃𝑖𝑗 : value of a descriptor j for compound i
Because usually a linear correlation is observed between response and log dose in the mid
region of the log dose-response curve, the biological endpoints are transformed to a logarithmic
29
scale. The use of inverse logarithms leads to higher values for more effective compounds. It’s also
of great importance that the experimental data of the training set are spread over a wide range of
values in order to generate a model with a high predictive power. Of course, the interest of
researchers goes to the development of compounds with high activity, but also the compounds
which are of lower interest have to be included in the data set.20
3.2.2.3.1. QSAR study of A2AAR antagonists
After docking ZM241385 in the three crystal structures a database of seventy-seven A2AAR
antagonists is composed. The pKi values of these ligands are very different, ranging from 4.63 to
9.40. The pKi is the negative logarithm of the Ki, a value that designates the potential of the
antagonist to bind the receptor instead of the bound agonist. In fact, the Ki is the concentration of
antagonist needed to remove half of the agonist concentration initially bound to the receptor. So,
the lower the Ki, the higher is the affinity of the antagonist for the receptor.
The database of antagonists is docked in the two crystal structures that led to the best
results in the previous docking study. The three best scoring functions and the best placing method
are used for docking. Some of the seventy-seven A2AAR antagonists can be protonated. For this
reason a second database is created in which structures are depicted like they should behave in an
environment of pH 7.4. So, the situation in the body is simulated. Docking is performed using both
the force field refinement and gridmin refinement. The placing method-scoring function
combination that leads to the best results is used for the QSAR study; both 3EML and 3VG9,
gridmin and force field, neutral and protonated are considered.
For each of the seventy-seven compounds that are docked, the best ten poses are retained in
a database. A selection of descriptors is calculated for all of these ligand poses. One of these
descriptors is the Eoop, a value that indicates the distortion of the scaffold. Only compounds with
an Eoop lower than three are retained. Five other descriptors and the known biological activities of
the seventy-seven A2AAR antagonists of the training set are used to estimate a QSAR model with
the partial least square (PLS) method. For each compound pred and xpred values are calculated
with the use of the obtained model. For each compound ten poses were obtained through
docking, these ten poses are assigned a pred and xpred value, only the one with the best xpred
value is selected for each compound. Now the xpred values are plotted against the pKi values for
all of the compounds left of the training set. The most significant outliers of the plot are removed
30
and the regression line is re-predicted by parameterizing using the molecules of the first prediction
but without the outliers.
Finally, a test set of ten A2AAR antagonists is composed. The obtained model is validated by
applying it to the test set. The ten compounds of the test set are first docked and pred values are
calculated for all ten poses of each compound. The best pose out of ten is selected for each
compound and pred values are plotted against the pKi. This will tell if the model is able to predict
which are good or bad compounds.
3.3. HOMOLOGY MODELING
When there is no three-dimensional structure available for a protein, its 3D structure can be
built using the homology modeling technique. As refolding of denatured proteins into their native
3D structure is possible in particular conditions, it means that the primary structure of a protein
contains information related to the secondary and tertiary structure. So, usually the less sequence
similarity between two proteins, the less similarity will be observed between their secondary and
tertiary structure. Homology modeling is based on this principle.
The homology modeling technique analyses structural similarity between proteins and
subsequently builds the 3D structure of a protein using the 3D structure of a template. In the
search for a suitable template it is of great importance to know the biological function of the target
protein. Since we wish to obtain a model for three subtypes of the adenosine receptor, the A1AR,
A2BAR and A3AR, it is logical to use the homologue A2AAR, the only crystallized adenosine receptor,
as template. The primary structure of the three target proteins is set in space using the A2AAR as
template. The obtained 3D structure is refined using molecular mechanics or molecular dynamics.
The first stage in our homology modeling procedure is aligning the primary structure of the
protein that has to be modeled and the template protein. Therefore, the relationship between
both sequences is determined with the aim at obtaining a maximum similarity degree. The global
and local alignment approaches are the two available methods to align sequences. Global
alignment considers each amino acid of both sequences; local alignment aims at finding the
subsequence of one strand that shows the highest similarity with a subsequence of the other
strand. In both global and local alignment, both sequence identity and similarity have to be
considered in the evaluation of the alignment. The sequence identity is the number of identical
residues found at the same relative position after alignment, while sequence similarity is the
31
number of residues with comparable physicochemical properties found at the same relative
position.
As previously mentioned, the template used is the A2AAR, more specifically the 2YDO crystal
structure. Here, adenosine, the physiological agonist of the adenosine receptor, is co-crystallized
with the receptor. The models are built in the presence of the ligand and co-crystallized water
molecules. The following computational steps are performed with the aim at obtaining a homology
model. The sequence of the receptor subtype that has to be modeled is aligned to the
corresponding template chain. Subsequently, it is checked for insertions and deletions. In the case
of insertions, there are more residues for the model sequence with respect to the template. In the
case of deletions, there are more residues in the template. Water molecules, which are located in
the neighborhood of intracellular loop 4, are deleted because they are not involved in ligand
binding and a homology model is made up by the MOE homology model function. Once the model
is built, it is refined through minimization and geometry check. Finally, we compare the binding
pose of adenosine in the four obtained models.
32
4. RESULTS AND DISCUSSION
The results are represented and discussed in different sections, according to the three aims
of this work: the analysis of mutagenesis data of the A2AAR, the QSAR analysis of docking results
and the study of self-made homology models.
4.1. ANALYSIS OF MUTAGENESIS DATA OF THE A2AAR
We compared the eleven A2AAR crystal structures solved to date. All the receptor residues
that are within a distance of 4,5 Å of one of the eleven ligands are listed in Table 3.1. Most of
them are similar for both agonist and antagonist bound crystal structures, but some are typically
found within a distance of 4,5 Å only in the agonist or in the antagonist bound crystal structures.
For each of the residues listed in Table 4.1, its involvement in ligand-receptor interactions and the
interaction energy values between residue and ligand are given. These values are calculated with a
force field, so they may be seen as relative and not absolute values. When mutagenesis data are
available for a residue that is listed in the table, we try to interpret these data according to the
position and the role of the residue observed in the considered crystal structures. When we go
through the A2AAR receptor sequence, a first residue for which mutagenesis data are known is
glutamate 13. Here both agonist and antagonist affinity are decreased when mutated into a
glutamine. When mutated, this residue cannot interact with histidine 278 at the expense of
stabilization. So, this could be an explanation for the decrease in affinity caused by the mutation
into glutamine. For threonine 88, a mutation into alanine, arginine or serine leads to a decrease of
affinity for both agonists and antagonists, probably due to the fact that the threonine doesn’t
stabilize the receptor conformation anymore. Normally, the threonine interacts with one of the
scaffold nitrogens of agonists, but when substituted by aspartic or glutamic acid, this interaction is
not present, so agonist affinity decreases. Mutation of glutamine 89 into alanine or aspartic acid
leads to an increased affinity. This is difficult to understand, certainly because also mutation into
alanine results in an increased affinity. Also the shift in affinity in case of the mutation of glutamate
151 and 161 is not easy to explain. Both these residues are located in EL2 and don’t interact with
the ligand. The loss of affinity that is observed when glutamate 169 is mutated into alanine is easy
to understand. The glutamate interacts with both agonists and antagonists through the formation
of hydrogen bonds. However, the alanine is not able anymore to undergo these interactions, so
loss of affinity is logically explainable. However, in case of mutation of the glutamate into a
glutamine, no variation in binding affinity is observed. The side chain of glutamine is similar to that
33
of glutamate, but it is not electrically charged. So, glutamine is still able to form the hydrogen
bonds seen in presence of a glutamate. Mutation of asparagine 181 in a serine results in a reduced
agonist affinity. A possible reason for this shift in affinity is that the presence of a hydrogen bond
network is important in agonist binding. Mutation of the asparagine breaks the network and
agonist affinity consequently is diminished. When phenylalanine 182 is mutated into alanine, loss
of affinity is observed for both agonists and antagonists. This could be explained in the following
way: the ring scaffold of the adenosine receptor ligands is inserted in a hydrophobic, aromatic
binding pocket. Among others phenylalanine is part of this hydrophobic cluster. When mutated by
an alanine, the hydrophobicity of this cluster decreases and so does the binding affinity. Even
when mutated in other hydrophobic residues with an aromatic side chain, affinity decreases.
Tryptophane and tyrosine possess more bulky side chains what possibly results in a distortion of
the other residues located in the hydrophobic binding pocket. The next residue for which
mutagenesis data are known is histidine 250. This residue interacts with asparagine 181 and 253
through hydrogen bond formation with a water molecule and keeps asparagine 253 in a
conformation needed for ligand binding. When mutated into a hydrophobic residue, the
asparagine loses its appropriate conformation and affinity for both agonists and antagonists
decreases. Mutation of asparagine 253 into alanine results in decrease of affinity, which is easy to
explain. Asparagine is directly involved in ligand binding, where alanine doesn’t give the possibility
to form a hydrogen bond. The reason of loss of affinity when phenylalanine 257 is substituted by
alanine is the same as for mutation of phenylalanine 182. Also isoleucine 274 makes part of the
hydrophobic binding pocket. When this residue is mutated, interaction of the hydrophobic scaffold
is more difficult and affinity for both agonists and antagonists is decreased. Serine 277 interacts
with agonists through the formation of hydrogen bonds with the ribose group of agonists. So,
mutation affects only the affinity for agonists. The positively charged side chain of histidine 278
normally interacts with the negatively charged glutamate 13, which stabilizes the receptor
conformation. When histidine is substituted by an alanine, interaction is impossible and ligand
affinity decreases. When mutated by an aspartate or a glutamate, affinity is not influenced. Here,
the possibility exists that the presence of positively charged ion, such as Na+ or Mg2+, still allows
the interaction. A last residue for which mutagenesis data are available is serine 281, a residue that
interacts with histidine 278 in a hydrogen bond network. This histidine interacts with the hydroxyl
groups of agonists, but when the network is disturbed, the interaction may not take place
anymore.
34
Table 4.1: List of A2AAR residues. Each listed residue is located within a distance of 4,5 Å of at least one of the eleven co-crystallized ligands.
Mutagenesis data are represented if available. O: no interaction ligand-receptor; X: residue not in environment (4,5 Å) of ligand; NO: residue not
crystallized; H20: water molecule within 4,5 Å; L: ligand-receptor interaction; numerical value: relative interaction energy calculated with MOE
tool. Ballesteros and Weinstein numbering system is used, whereby for example 3.50 is the most conserved residue of TM3.
Res Res Num
Mutagenesis data 3EML 3PWH 3REY 3RFM 3VG9 3VGA 3UZA 3UZC 3QAK 2YDO 2YDV
int if int if int if int if int if int if int if int if int if int if int if
Gly 1.31 5
O
NO
NO
NO
NO
NO
NO
NO
O
NO
O
Ser 1.32 6
O
NO
NO
NO
O
O
NO
NO
O
H20
Val 1.34 8
O
O
O
O
O
O
O
O
O
O
O
Tyr 1.35 9
H2O
O
O -12,8 O
O
O
O
O
O -11,4 O
H20
Glu 1.39 13 E13Q: reduced
agonists/antagonist affinity
O
O -7,0 O
O
O
O
O
O
O
O
O
Ala 2.57 59
H20
O
O
O
O
O
O
O -3,2 O
O
O
Ile 2.58 60
O
O
O
O
O
O
O
O
O
O
O
Ala 2.61 63
H2O
O -9,7 O -5,2 O -2,2 O
O
O -3,7 O -4,5 H20 -6,6 O -4,7 H20 -3,7
Ile 2.62 64
O
O -8,3 O
O
O
O
O
O
O
O
O
Ile 2.64 66
O -4,9 O -9,0 O -7,6 O -3,6 O -4,8 O
O -3,7 O -3,5 O -16,1 O -3,1 O -2,4
Ser 2.65 67
O
L -21,4 O -9,6 O
O
O
O
O
O -10,4 O
O
Thr 2.66 68
X
X
X
X
X
X
X
X
O
O
O
Ile 3.28 80
H2O
O
O -2,9 O
O
O
O
O
O
O
O
Ala 3.29 81
H2O
O
O -4,9 O
O
O
O -2,7 O
O
O
O
Cys 3.30 82
O
O
O -3,1 O
O
O
O
O
O
O
O
Phe 3.31 83
O
O
O
O
O
O
O
X
X
X
Val 3.32 84 V84L: marginal
variation H20 -3,9 O -3,6 O -6,5 O -5,5 O -3,5 O -3,6 O -9,1 O -13,2 O -15,0 O -11,1 O -14,1
Leu 3.33 85
O -7,3 O -4,9 O -9,4 O -2,6 O -7,8 O -5,0 O -8,7 O -10,7 O -21,8 O -17,0 O -24,2
Val 3.34 86
X
X
X
X
X
X
X
X
O
O
O
Leu 3.35 87
X
X
X
X
X
X
X
X
O
O
O
35
Thr 3.36 88 T88A/R/S: reduced
affinity; T88D/E: loss of agonists binding
O -2,1 O
O
O
O -2,4 O -2,2 O
O
L -25,9 O -12,5 L -26,7
Gln 3.37 89 Q89A/D: increased
affinity X
X
X
X
X
X
X
X
O -16,4 O
O -6,8
Ile 3.40 92
X
X
X
X
X
X
X
X
O -3,3 O
O -3,3
Gly 4.57 136
X
X
X
X
X
X
X
X
O
O
O
Glu 151 151 E151A/D/Q: loss of
affinity X
X
X
X
X
X
X
X
X
X
X
Glu 161 161 E161A: increased
antagonists affinity X
X
X
X
X
X
X
X
X
X
X
Leu 167 167
O -3,0 O
O
O
O
O -3,7 O
O
O -19,1 O
O
Phe 168 168 F168A: loss of affinity L -19,3 L -16,7 L -13,0 L -7,9 L -13,9 L -17,1 O -13,0 L -16,9 L -20,1 H20/
L -20,8 L -19,2
Glu 169 169 E169A: loss of affinity;
E169Q: no variation L -45,2 L -31,3 O
O -7,3 L -40,1 L -48,5 O -9,1 O -8,3 L
-111,
3 H20 -33,4
H20/L
-33,2
Asp 170 170 D170K: no variation H20
O
O
O
O
O
O
O
O
O
O
Val 172 172
O
O
O
O
O -9,2 O
O
O
O
O
O
Met 174 174
O
O -4,2 O -3,0 O
O
O -5,9 O -6,4 O -4,2 O -7,9 O
O
Met 5.38 177
H20 -6,3 L -5,3 O -3,6 O -3,2 O -4,4 O -3,3 O -8,5 O -8,0 O -8,1 O -9,1 O -9,3
Val 5.39 178
O
O
O
O
O
O
O
O
O
O
O
Asn 5.42 181 N181S: reduced agonists affinity
H20 -6,0 O
O -4,4 O
O -5,9 O -6,2 O -6,4 O -6,1 O -12,1 O -12,5 O -17,2
Phe 5.43 182 F182A: loss of affinity;
F182Y/W: reduced agonists affinity
O
O
O
O
O
O
O
O
O
O
O
Cys 5.46 185
X
X
X
X
X
X
X
X
O -2,1 O
O -3,7
Val 5.47 186
X
X
X
X
X
X
X
X
O -5,8 O
O -5,3
Phe 6.44 242
X
X
X
X
X
X
X
X
O
O
O
Try 6.48 246
O -6,3 O -4,5 O
O -2,5 O -7,6 O -6,0 O -11,5 O -12,7 O -23,4 O -16,4 O -21,2
Leu 6.51 249
O -15,4 L -14,4 O -11,1 O -6,5 O -12,7 O -13,1 O -6,1 O -12,6 O -28,1 O -22,7 O -22,0
His 6.52 250
H250A: loss of affinity; H250F/Y: decreased
agonists affinity; H250N: increased agonists affinity
H2O -6,5 L -7,1 O -7,6 O
O -5,3 O -5,5 O -12,6 O -12,4 O -21,3 H20 -10,1 H20 -17,8
Ile 6.54 252
O
O -4,8 O
O
O
O
O
O -4,7 O -13,5 O
O
Asn 6.55 253 N253A: loss of affinity H20/
L -42,2 L -33,5 L -26,5 L -14,8 L -37,7 O -27,7 L -48,5 L -44,7 L -56,1
H20/L
-41,3 H20/
L -49,0
36
Thr 6.58 256
O
O
O
O
O
O
O
O
O -20,2 H2O
H20
Phe 6.59 257 F257A: loss of affinity X
X
X
X
X
X
X
X
X
X
X
Cys 262 262 C262G: no variation X
X
X
X
X
X
X
X
O
O
O
Ser 263 263
X
X
X
X
X
X
X
X
O
O
O
His 264 264
H2O -11,5 O
O
O
O -8,3 O -9,6 O
O
O -8,2 O
O
Ala 265 265
O -7,3 O
O
O
O -8,0 O -7,5 O
O
O
O
O
Pro 266 266
O
O
O
O
O -4,7 O
O
O
O
O
O
Leu 267 267
O -15,8 O -10,6 O -18,7 O
O -10,3 O
O
O
O -19,3 O
O
Try 7.33 268
O
O
O -6,6 O
O
O
O
O
O
O
O
Leu 7.34 269
X
X
X
X
X
X
X
X
O
O
O
Met 7.35 270
O 0,0 O -9,7 O -3,9 O -3,9 O -10,3 O -7,8 O -6,8 O -8,1 O -18,8 O -8,6 O -8,9
Tyr 7.36 271
O -17,3 O -12,1 O -14,9 O
O -16,5 O -13,2 O
O
L -56,0 O
H20
Ala 7.38 273
O
O
O
O
O
O
O -1,4 O
O -2,0 O
O
Ile 7.39 274 I274A: loss of affinity O -8,9 L -18,4 O -9,6 L -8,5 O -8,4 O -8,0 O -12,3 O -14,1 O -25,6 O -21,6 O -18,6
Val 7.40 275
O
O
O
O
O
O
O -2,8 O
O
O
O
Ser 7.42 277
S277A/C: reduced agonists affinity; S277N/E/T: no
variation
O
O
O
O
O
O
O -1,8 O -2,1 L -23,0 L -26,4 L -24,4
His 7.43 278
H278A: loss of affinity; H278D/E: no variation;
H278Y: decreased affinity
O
O
O
O -3,4 O
O
O -8,8 L -24,9 L -31,5 L -26,0 L -23,5
Ser 7.46 281
S281A: loss of affinity; S281T: increased affinity; S281N:
increased agonists affinity, decreased antagonists affinity
X
X
X
X
X
X
X
X
X
X
X
37
4.2. DOCKING AND QSAR ANALYSIS OF DOCKING RESULTS
Data-analysis of ZM241385 docking results of the sixteen possible combinations of placing
algorithm and scoring function revealed that Trianglematcher is the most appropriate algorithm.
The use of Trianglematcher in combination with the Alpha HB Scoring function delivered the best
results, even without applying pharmacophore restrain. Hence, for subsequent docking of both the
training and test set, the combination Trianglematcher/Alpha HB was used. The results obtained by
using the combination Trianglematcher/Affinity dg and trianglematcher/London DG were also
acceptable, so these combinations were considered to support the docking results of the
Trianglematcher/Alpha HB docking combination. The docking of both training and test set were
carried out with the 3EML and the 3VG9 crystal structure because the best results were obtained
with these two structures. It appears in fact that in the case of the 3PWH crystal structure, the
Trianglematcher algorithm is apparently able to obtain some good conformations, but the scoring
functions rejected these results because of the low similarity with the folded conformation of the
4-hydroxyphenylethyl side chain of ZM241385 found in the crystal structure. Indeed, the docking
algorithm posed the ligand in the extended conformation as found in the other crystal structures.
Positioning ZM241385 in the extended conformation delivers good scores for the 3EML and 3VG9
crystal structures, but unfavorable scores for the 3PWH crystal structure.
Figure 4.1: A. Plot of XPRED of the best-scored position of each compound (except of outliers) of the training set versus the experimental pKi B. Plot of PRED of the best-scored position of each compound (except of outliers) of the test set versus the experimental pK i. Equation and cross-validated R2 are given in both A and B plots. Poses are obtained using 3EML crystal structure and force field refinement.
y = 0,5343x + 3,5311R² = 0,49518
4,00
5,00
6,00
7,00
8,00
9,00
10,00
4,00 6,00 8,00 10,00
XP
RED
pKiA
y = 0,2987x + 5,1961R² = 0,4596
4,00
5,00
6,00
7,00
8,00
9,00
10,00
4,00 6,00 8,00 10,00
PR
ED
pKiB
38
Table 4.2: QSAR model obtained for training set using conformations generated by using 3EML crystal structure and force field refinement (left) or gridmin (right).
Activity Field: pKi Weight Field: Condition Limit: 1e+006 Component Limit: 0 Observations: 72 Descriptors: 5 ROOT MEAN SQUARE ERROR (RMSE): 0,77193 CORRELATION COEFFICIENT (R
2): 0,57539
CROSS-VALIDATED RMSE: 0,84425 CROSS-VALIDATED R
2: 0,49518
ESTIMATED LINEAR MODEL pKi = 5,00213 +0,14078 * dock_HB +0,18596 * dock_HYD -0,56233 * E_oop -0,09373 * E_rele ESTIMATED NORMALIZED LINEAR MODEL pKi / SD(pKi) = 4,22254 -0,17742 * dock_ENT / SD(dock_ENT) +0,03189 * dock_HB / SD(dock_HB) +0,19069 * dock_HYD / SD(dock_HYD) -0,16392 * E_oop / SD(E_oop) -0,63651 * E_rele / SD(E_rele) RELATIVE IMPORTANCE OF DESCRIPTORS 0,278741 dock_ENT 0,050098 dock_HB 0,299594 dock_HYD 0,257534 E_oop 1,000000 E_rele
Activity Field: pki Weight Field: Condition Limit: 1e+006 Component Limit: 0 Observations: 68 Descriptors: 5 ROOT MEAN SQUARE ERROR (RMSE): 0,97487 CORRELATION COEFFICIENT (R
2): 0,38310
CROSS-VALIDATED RMSE: 1,06308 CROSS-VALIDATED R
2: 0,27514
ESTIMATED LINEAR MODEL pKi = 5,07995 -1,34162 * dock_ENT -0,07026 * dock_HB +0,05255 * dock_HYD -2,11463 * E_oop -0,07815 * E_rele ESTIMATED NORMALIZED LINEAR MODEL pKi / SD(pKi) = 4,09279 -0,57520 * dock_ENT / SD(dock_ENT) -0,01852 * dock_HB / SD(dock_HB) +0,06209 * dock_HYD / SD(dock_HYD) -0,34687 * E_oop / SD(E_oop) -0,57818 * E_rele / SD(E_rele) RELATIVE IMPORTANCE OF DESCRIPTORS 0,994856 dock_ENT 0,032025 dock_HB 0,107388 dock_HYD 0,599931 E_oop 1,000000 E_rele
The training set of 77 A2AAR antagonists from literature (molecular structure reported in
ANNEX 1 Table)6,21-40 was docked twelve times, considering 3 algorithm/scoring function
combinations, 2 crystal structures, and 2 refinement methods. The pharmacophore created for
docking ZM241385 was used. It has to be remarked that the protonated database leads to better
results than the database with neutral compounds. Here, it seems that not protonation in se is
important, but rather the modified distribution of the partial charges when the compounds are
protonated. Hence, direct charge interactions seemed to be not of big importance in obtaining the
observed improvement of the results. For this reason, the protonated database was used for the
docking analysis of the 77 A2AAR antagonists. Subsequently, QSAR analysis of the docking result
39
was carried out. Only the results for Trianglematcher/Alpha HB are represented and discussed,
because the results obtained with Affinity DG and London DG were not good. Tables 4.2 and 4.3
show the QSAR models obtained from the training set. The number of observations, i.e. the
number of compounds left from the original dataset, used for creating these models is given.
Figure 4.2: A. Plot of XPRED of the best-scored position of each compound (except of outliers) of the training set versus the experimental pKi B. Plot of PRED of the best-scored position of each compound (except of outliers) of the test set versus the experimental pKi. Equation and cross-validated R2 are given in both A and B plots. Poses are obtained using 3EML crystal structure and gridmin refinement.
Figure 4.3: A. Plot of XPRED of the best-scored position of each compound (except of outliers) of the training set versus the experimental pKi B. Plot of PRED of the best-scored position of each compound (except of outliers) of the test set versus the experimental pK i. Equation and cross-validated R2 are given in both A and B plots. Poses are obtained using 3VG9 crystal structure and force field refinement.
y = 0,3239x + 5,1467R² = 0,2751
4,00
5,00
6,00
7,00
8,00
9,00
10,00
4,00 6,00 8,00 10,00
XP
RED
pKiA
y = 0,2909x + 5,5088R² = 0,2013
4,00
5,00
6,00
7,00
8,00
9,00
10,00
4,00 6,00 8,00 10,00
PR
ED
pKiB
y = 0,3826x + 4,6516R² = 0,3252
4,00
5,00
6,00
7,00
8,00
9,00
10,00
4,00 6,00 8,00 10,00
XP
RED
pKiA
y = 0,2196x + 5,8478R² = 0,0941
4,00
5,00
6,00
7,00
8,00
9,00
10,00
4,00 6,00 8,00 10,00
PR
ED
pKiB
40
Some compounds were removed during post-docking QSAR analysis as outliers and their
removal resulted in a better QSAR model. The linear combination of five descriptors (Dock_ENT,
Dock_HYD, Dock_HB, Eoop and Erele) led to the model.
Table 4.3: QSAR model obtained for training set using conformations generated by using 3VG9 crystal structure and force field refinement (left) or gridmin (right).
Activity Field: pKi
Weight Field: Condition Limit: 1e+006 Component Limit: 0 Observations: 70 Descriptors: 5 ROOT MEAN SQUARE ERROR (RMSE): 0,92574 CORRELATION COEFFICIENT (R
2): 0,44080
CROSS-VALIDATED RMSE: 1,02455 CROSS-VALIDATED R
2: 0,32521
ESTIMATED LINEAR MODEL pKi = 5,08753 -0,42035 * dock_ENT -0,79438 * dock_HB +0,18342 * dock_HYD -0,46403 * E_oop -0,11690 * E_rele ESTIMATED NORMALIZED LINEAR MODEL (SD = Standard Deviation) pKi / SD(pKi) = 4,10962 -0,18436 * dock_ENT / SD(dock_ENT) -0,27994 * dock_HB / SD(dock_HB) +0,21437 * dock_HYD / SD(dock_HYD) -0,09831 * E_oop / SD(E_oop) -0,72014 * E_rele / SD(E_rele) RELATIVE IMPORTANCE OF DESCRIPTORS 0,256006 dock_ENT 0,388738 dock_HB 0,297685 dock_HYD 0,136511 E_oop 1,000000 E_rele
Activity Field: pKi Weight Field: Condition Limit: 1e+006 Component Limit: 0 Observations: 71 Descriptors: 5 ROOT MEAN SQUARE ERROR (RMSE): 0,86451 CORRELATION COEFFICIENT (R
2): 0,50210
CROSS-VALIDATED RMSE: 0,94355 CROSS-VALIDATED R
2: 0,41161
ESTIMATED LINEAR MODEL pki = -0,35786 * dock_ENT -0,15842 * dock_HB +0,38960 * dock_HYD -1,75549 * E_oop -0,06437 * E_rele ESTIMATED NORMALIZED LINEAR MODEL (SD = Standard Deviation) pKi / SD(pKi) = 3,96930 -0,17855 * dock_ENT / SD(dock_ENT) -0,05556 * dock_HB / SD(dock_HB) +0,45020 * dock_HYD / SD(dock_HYD) -0,45910 * E_oop / SD(E_oop) -0,62436 * E_rele / SD(E_rele) RELATIVE IMPORTANCE OF DESCRIPTORS 0,285978 dock_ENT 0,088986 dock_HB 0,721062 dock_HYD 0,735311 E_oop 1,000000 E_rele
The best cross-validated R2 for 3EML is 0,4952 (Fig 4.1, Table 4.2). It was obtained when using
the 3EML/Trianglematcher/AlphaHB/Forcefield combination. Also the number of compounds of
the original dataset that was taken into account is the highest for this combination. This QSAR
41
model also resulted performing the best prediction of pKi values for the test set of 10 literature
A2AAR antagonists (molecular structure reported in ANNEX 2 Table).6,21,23,41,42 A disadvantage of this
model is the relative low score assigned to the compound with the highest pKi (red data point in
Fig. 4.1). We also have to remark that the pharmacophore used in this case only shares one feature
with ZM241385 and the receptor was slightly modified.
In case of 3VG9, the best result was found with the 3VG9/Trianglematcher/AlphaHB/GridMin
combination. Here, the pharmacophore shares two features with ZM241385, the receptor did not
have to be modified, and the ligand of the test set with the highest pKi were also scored as the best
compounds. So, generally we can state that the obtained 3VG9/Trianglematcher/AlphaHB/GridMin
QSAR model was the best one among the developed models. Even the correlation coefficient for
this combination is relatively low. This is allowed because we do not aim at predicting the pKi of
the test set compounds, but only try to determine which compounds are lying within a certain
range and are candidates to synthesize in a next stage.
Figure 4.4: A. Plot of XPRED of the best-scored position of each compound (except of outliers) of the training set versus the experimental pKi B. Plot of PRED of the best-scored position of each compound (except of outliers) of the test set versus the experimental pKi. Equation and cross-validated R2 are given in both A and B plots. Poses are obtained using 3VG9 crystal structure and gridmin refinement.
4.3. DEVELOPMENT AND STUDY OF HOMOLOGY MODELS
A third part of this work is the generation of homology models of the four adenosine
receptor subtypes of which the alignment is given in Table 4.4. We compared the binding of
adenosine, the physiological ligand, in all four models. Adenosine is bound in a similar way in the
four adenosine receptor subtype homology models. This is logically explainable, since the
y = 0,4556x + 4,0601R² = 0,4116
4,00
5,00
6,00
7,00
8,00
9,00
10,00
4,00 6,00 8,00 10,00
XP
RED
pKiA
y = 0,3762x + 4,702R² = 0,2992
4,00
5,00
6,00
7,00
8,00
9,00
10,00
4,00 6,00 8,00 10,00
PR
ED
pKiB
42
adenosine used in the homology models has the same conformation as that of the pdb-deposited
2YDO crystal structure. The most noteworthy difference is the presence of a valine instead of
glutamate 169 in the A3AR homology model.
Table 4.4: Sequence alignment for the four human adenosine receptor subtypes (*
designates sequence identity in all the four subtypes, and : indicate the presence of residues
with comparable physicochemical characteristics, residues marked in yellow are of importance in
ligand binding and are discussed in the text).
hA1 1 ---MPPSISAFQAAYIGIEVLIALVSVPGNVLVIWAVKVNQALRDATFCFIVSLAVADVA 57
hA2A 1 ------MPIMGSSVYITVELAIAVLAILGNVLVCWAVWLNSNLQNVTNYFVVSLAAADIA 54
hA2B 1 -----MLLETQDALYVALELVIAALSVAGNVLVCAAVGTANTLQTPTNYFLVSLAAADVA 55
hA3 1 MPNNSTALSLANVTYITMEIFIGLCAIVGNVLVICVVKLNPSLQTTTFYFIVSLALADIA 60
. *: :*: *. :: ***** .* *: * *:**** **:*
hA1 58 VGALVIPLAILINIGPQTYFHTCLMVACPVLILTQSSILALLAIAVDRYLRVKIPLRYKM 117
hA2A 55 VGVLAIPFAITISTGFCAACHGCLFIACFVLVLTQSSIFSLLAIAIDRYIAIRIPLRYNG 114
hA2B 56 VGLFAIPFAITISLGFCTDFYGCLFLACFVLVLTQSSIFSLLAVAVDRYLAICVPLRYKS 115
hA3 61 VGVLVMPLAIVVSLGITIHFYSCLFMTCLLLIFTHASIMSLLAIAVDRYLRVKLTVRYKR 120
** :.:*:** :. * : **:::* :*::*::**::***:*:***: : : :**:
hA1 118 VVTPRRAAVAIAGCWILSFVVGLTPMFGWNNLSAVER----AWA---ANGSMGEPVIKCE 170
hA2A 115 LVTGTRAKGIIAICWVLSFAIGLTPMLGWNN-------CGQPKEGKNHSQGCGEGQVACL 167
hA2B 116 LVTGTRARGVIAVLWVLAFGIGLTPFLGWNSKDSATNNCTEPWDGTTNESCC---LVKCL 172
hA3 121 VTTHRRIWLALGLCWLVSFLVGLTPMFGWNMKLTSEYH-------------RNVTFLSCQ 167
:.* * :. *:::* :****::*** : *
hA1 171 FEKVISMEYMVYFNFFVWVLPPLLLMVLIYLEVFYLIRKQLNKKVSAS--SGDPQKYYGK 228
hA2A 168 FEDVVPMNYMVYFNFFACVLVPLLLMLGVYLRIFLAARRQLKQMESQPLPGERARSTLQK 227
hA2B 173 FENVVPMSYMVYFNFFGCVLPPLLIMLVIYIKIFLVACRQLQRTEL----MDHSRTTLQR 228
hA3 168 FVSVMRMDYMVYFSFLTWIFIPLVVMCAIYLDIFYIIRNKLSLNLSN---SKETGAFYGR 224
* .*: *.*****.*: :: **::* :*: :* .:*. :
hA1 229 ELKIAKSLALILFLFALSWLPLHILNCITLFCPSC--HKPSILTYIAIFLTHGNSAMNPI 286
hA2A 228 EVHAAKSLAIIVGLFALCWLPLHIINCFTFFCPDC-SHAPLWLMYLAIVLSHTNSVVNPF 286
hA2B 229 EIHAAKSLAMIVGIFALCWLPVHAVNCVTLFQPAQGKNKPKWAMNMAILLSHANSVVNPI 288
hA3 225 EFKTAKSLFLVLFLFALSWLPLSIINCIIYFNG----EVPQLVLYMGILLSHANSMMNPI 280
*.: **** ::: :***.***: :**. * . * :.*.*:* ** :**:
hA1 287 VYAFRIQKFRVTFLKIWNDHFRCQPAPPIDEDLPEE------------------------ 322
hA2A 287 IYAYRIREFRQTFRKIIRSHVLRQQEPFKAAGTSARVLAAHGSDGEQVSLRLNGHPPGVW 346
hA2B 289 VYAYRNRDFRYTFHKIISRYLLCQADVKSGNGQ----------AGVQPALGVGL------ 332
hA3 281 VYAYKIKKFKETYLLILKACVVCHPSDSLDTSIEKNSE---------------------- 318
:**:: :.*: *: * . :
hA1 323 ----------RPDD---------------------------------------------- 326
hA2A 347 ANGSAPHPERRPNGYALGLVSGGSAQESQGNTGLPDVELLSHELKGVCPEPPGLDDPLAQ 406
hA2B 333 ------------------------------------------------------------
hA3 319 ------------------------------------------------------------
43
Figure 4.5: Schematic overview of interactions between adenosine and the A. A1AR, B.
A2AAR, C. A2BAR and D. A3AR.
This residue substitution could explain why some A3AR selectivity of ligands is obtained
through introduction on the 6-NH2 group of the adenine moiety of a small alkyl group.43-46 This
group is more hydrophobic and may easily interact with the valine residue. In the A1AR homology
model, serine 277 seen in the three other subtypes is mutated by a threonine. This mutation has
no influence on binding the adenosine, because the characteristics of threonine are plus minus the
same as those of serine.
Also two residues of the hydrophobic pocket, where the scaffold is inserted, are mutated. In
the A2BAR model, we observe substitution of leucine 249 into a valine, which is also a hydrophobic
residue. So, this will not have a great influence on binding adenosine. Methionine 270 is replaced
by a threonine in the A1AR model and by a leucine in the A3AR, also this may not result in
significant consequences. Asparagine 181 is replaced by a serine in the A3AR. As discussed in the
study of mutagenesis data, this may reduce affinity for adenosine, because the hydrogen bond
network is not present anymore.
44
Figure 4.6: Representation of the binding pocket of the homology models in presence of
adenosine: A. A1AR, B. A2AAR, C. A2BAR and D. A3AR.
45
5. CONCLUSION
In the first part of this work, we analysed mutagenesis data of the A2AAR reported in the
literature. All residues in proximity of the co-crystallized ligands were discussed. This analysis
makes it evident which are the key residues involved in agonist or antagonist interaction, like for
example phenylalanine 168, glutamate 169, and asparagine 253. In the case of the agonist-bound
crystal structures, also serine 277 and histidine 278 seem to contribute significantly in ligand
binding. So, in case of mutation of these residues, the loss of binding affinity is due to a direct
influence on ligand-receptor interactions. Also effects on hydrogen bonding or hydrophobic
contacts between residues may cause an increase or decrease of ligand affinity. This was observed
when residues such as asparagine 181 and phenylalanine 257 were substituted. Furthermore,
some residues may cause stabilization of the receptor conformation. This was seen for example for
glutamate 13 and histidine 278, two residues that interact with each other through hydrogen
bonding. In some cases the results of mutagenesis data did not find a clear explanation in our
analysis, for example the mutation of glutamate 89 into alanine or aspartate.
Our second objective was to search for a docking-scoring tool able to give an approximate
estimation of the biological activity of an arbitrary A2AAR antagonist. Therefore, we built a
database of 77 antagonists from literature. These compounds were docked with different
algorithms and the best-scored conformation of each compound was used to develop a scoring
model that was applied to a set of 10 test compounds (again from literature). The prediction values
obtained by the model were plotted against the experimental pKi affinity data. The results of this
analysis allowed us to define a placement-scoring algorithm combination of MOE software able to
provide a reasonable estimation of activity at A2AAR. It must be underlined that this tool is not
aimed at predicting the pKi values but rather at generally distinguishing the analysed compounds in
active vs. not active derivatives.
The third goal of this work was to build homology models of the four AR subtypes and to
check for eventual relevant differences comparing the binding sites of these models. These
variations could be considered to develop compounds that are selective for one of the subtypes.
The comparative analysis of the developed models highlights subtle discrepancies in the binding
region of adenosine. For example, a relevant difference among the AR subtypes is the presence of
a valine residue in EL2 of A3AR that is substituted by a glutamate in all the other three subtypes
(i.e. glutamate 169 in A2AAR). While glutamate is able to form a hydrogen bond with the exocyclic
46
amino group of adenosine, the side chain of valine is hydrophobic and hence not able to provide
an analogue interaction. This could explain the selectivity of N6-substituted adenosine derivatives
for A3AR with respect to the other AR subtypes.
47
6. BIBLIOGRAPHY
1. Jaakola, V.-P. & IJzerman, A. P. The crystallographic structure of the human adenosine A2A
receptor in a high-affinity antagonist-bound state: implications for GPCR drug screening and
design. Current Opinion in Structural Biology 20, 401-414 (2010).
2. Rodriguez, D. et al. Molecular Dynamics Simulations Reveal Insights into Key Structural
Elements of Adenosine Receptors. Biochemistry 50, 4194-4208 (2011).
3. Verzijl, D. & IJzerman, A. P. Functional selectivity of adenosine receptor ligands. Purinergic
Signalling 7, 171-192 (2011).
4. Fredholm, B. B. et al. International Union of Basic and Clinical Pharmacology. LXXXI.
Nomenclature and Classification of Adenosine Receptors‚ An Update. Pharmacological
Reviews 63, 1-34 (2011).
5. Dal Ben, D. et al. Adenosine Receptor Modeling: What Does the A2A Crystal Structure Tell
Us? Current Topics in Medicinal Chemistry 10, 993-1018 (2010).
6. Cristalli, G. et al. Vol. 193 Handbook of Experimental Pharmacology (eds Constance N.
Wilson & S. Jamal Mustafa) 59-98 (Springer Berlin Heidelberg, 2009).
7. Palczewski, K. et al. Crystal structure of rhodopsin: A G protein-coupled receptor. Science
289, 793-745 (2000).
8. Rasmussen, S. G. et al. Crystal structure of the human beta2 adrenergic G-protein-coupled
receptor. Nature 450, 383-387 (2007).
9. Warne, T. et al. Structure of a beta-adrenergic G-protein-coupled receptor. Nature 454, 486-
491 (2008).
10. Jaakola, V.-P. et al. The 2.6 Angstrom Crystal Structure of a Human A2A Adenosine Receptor
Bound to an Antagonist. Science 322, 1211-1217 (2008).
11. Xu, F. et al. Structure of an agonist-bound human A2A adenosine receptor. Science 332, 322-
327 (2011).
12. Lebon, G. et al. Agonist-bound adenosine A2A receptor structures reveal common features
of GPCR activation. Nature 474, 521-525 (2011).
13. Doré, A. S. et al. Structure of the Adenosine A2A Receptor in Complex with ZM241385 and
the Xanthines XAC and Caffeine. Structure 19, 1283-1293 (2011).
14. Hino, T. et al. G-protein-coupled receptor inactivation by an allosteric inverse-agonist
48
antibody. Nature 482, 237-240 (2012).
15. Congreve, M. et al. Discovery of 1,2,4-Triazine Derivatives as Adenosine A2A Antagonists
using Structure Based Drug Design. Journal of Medicinal Chemistry 55, 1898-1903 (2012).
16. Lane, J. R. et al.. in Advances in Pharmacology Vol. 61 (eds A. Jacobson Kenneth & Linden
Joel) Ch. 1, 1-40 (Academic Press, 2011).
17. Kim, J. et al. Site-directed mutagenesis identifies residues involved in ligand recognition in
the human A2A adenosine receptor. Journal of Biological Chemistry 270, 13987-13997
(1995).
18. Jaakola, V.-P. et al. Ligand Binding and Subtype Selectivity of the Human A2A Adenosine
Receptor. Journal of Biological Chemistry 285, 13032-13044 (2010).
19. Wermuth, C. G. et al. Glossary of terms used in medicinal chemistry (IUPAC
Recommendations 1998). Pure and Applied Chemistry 70, 1129-1143 (1998).
20. Selassie, C. D. in Burger's Medicinal Chemistry and Drug Discovery Vol. 1 (ed D. J. Abraham)
(John Wiley and Sons, 2003).
21. Stefanachi, A. et al. 1,3-Dialkyl-8-(hetero)aryl-9-OH-9-deazaxanthines as potent A2B
adenosine receptor antagonists: Design, synthesis, structure-affinity and structure-
selectivity relationships. Bioorganic & Medicinal Chemistry 16, 9780-9789 (2008).
22. Bulicz, J. et al. Synthesis and pharmacology of pyrido[2,3-d]pyrimidinediones bearing polar
substituents as adenosine receptor antagonists. Bioorganic & Medicinal Chemistry 14,
2837-2849 (2006).
23. Drabczynska, A. et al. Synthesis and biological activity of tricyclic cycloalkylimidazo-,
pyrimido- and diazepinopurinediones. European Journal of Medicinal Chemistry 46, 3590-
3607 (2011).
24. Moorjani, M. et al. 2,6-Diaryl-4-acylaminopyrimidines as potent and selective adenosine
A2A antagonists with improved solubility and metabolic stability. Bioorganic & Medicinal
Chemistry Letters 18, 5402-5405 (2008).
25. Giovannoni, M. P. et al. Pyrazolo[1′,5′:1,6]pyrimido[4,5-d]pyridazin-4(3H)-ones as selective
human A1 adenosine receptor ligands. Bioorganic & Medicinal Chemistry 18, 7890-7899
(2010).
26. Slee, D. H. et al. 2-Amino-N-pyrimidin-4-ylacetamides as A2A Receptor Antagonists: 1.
Structure-Activity Relationships and Optimization of Heterocyclic Substituents. Journal of
Medicinal Chemistry 51, 1719-1729 (2008).
49
27. Neustadt, B. R. et al. Potent and selective adenosine A2A receptor antagonists: 1,2,4-
Triazolo[1,5-c]pyrimidines. Bioorganic & Medicinal Chemistry Letters 19, 967-971 (2009).
28. Eastwood, P. et al. Discovery of potent and selective bicyclic A2B adenosine receptor
antagonists via bioisosteric amide replacement. Bioorganic & Medicinal Chemistry Letters
20, 1634-1637 (2010).
29. Pastorin, G. et al. Synthesis and pharmacological characterization of a new series of 5,7-
disubstituted-[1,2,4]triazolo[1,5-a][1,3,5]triazine derivatives as adenosine receptor
antagonists: A preliminary inspection of ligand-receptor recognition process. Bioorganic &
Medicinal Chemistry 18, 2524-2536 (2010).
30. Shah, U. et al. Design, synthesis, and evaluation of fused heterocyclic analogs of SCH 58261
as adenosine A2A receptor antagonists. Bioorganic & Medicinal Chemistry Letters 18, 4204-
4209 (2008).
31. Gillespie, R. J. et al. Antagonists of the human A2A receptor. Part 6: Further optimization of
pyrimidine-4-carboxamides. Bioorganic & Medicinal Chemistry 17, 6590-6605 (2009).
32. Gillespie, R. J. et al. Antagonists of the Human A2A Adenosine Receptor. 4. Design, Synthesis,
and Preclinical Evaluation of 7-Aryltriazolo[4,5-d]pyrimidines. Journal of Medicinal
Chemistry 52, 33-47 (2008).
33. Slee, D. H. et al. Identification of Novel, Water-Soluble, 2-Amino-N-pyrimidin-4-yl
Acetamides as A2A Receptor Antagonists with In Vivo Efficacy. Journal of Medicinal
Chemistry 51, 400-406 (2008).
34. Cheong, S. L. et al. The Significance of 2-Furyl Ring Substitution with a 2-(para-substituted)
Aryl Group in a New Series of Pyrazolo-triazolo-pyrimidines as Potent and Highly Selective
hA3 Adenosine Receptors Antagonists: New Insights into Structure-Affinity Relationship and
Receptor-Antagonist Recognition. Journal of Medicinal Chemistry 53, 3361-3375 (2010).
35. Eastwood, P. et al. Discovery of N-(5,6-diarylpyridin-2-yl)amide derivatives as potent and
selective A2B adenosine receptor antagonists. Bioorganic & Medicinal Chemistry Letters 20,
1697-1700 (2010).
36. Mishra, C. B. et al. Novel 8-(furan-2-yl)-3-substituted thiazolo [5,4-e][1,2,4] triazolo[1,5-c]
pyrimidine-2(3H)-thione derivatives as potential adenosine A2A receptor antagonists.
Bioorganic & Medicinal Chemistry 18, 2491-2500 (2010).
37. Borrmann, T. et al. 1-Alkyl-8-(piperazine-1-sulfonyl)phenylxanthines: Development and
Characterization of Adenosine A2B Receptor Antagonists and a New Radioligand with
50
Subnanomolar Affinity and Subtype Specificity. Journal of Medicinal Chemistry 52, 3994-
4006 (2009).
38. Baraldi, P. G. et al. Novel 1,3-Dipropyl-8-(3-benzimidazol-2-yl-methoxy-1-methylpyrazol-5-
yl)xanthines as Potent and Selective A2B Adenosine Receptor Antagonists. Journal of
Medicinal Chemistry 55, 797-811 (2011).
39. Shao, Y. et al. Synthesis and SAR studies of trisubstituted purinones as potent and selective
adenosine A2A receptor antagonists. Bioorganic & Medicinal Chemistry Letters 19, 1399-
1402 (2009).
40. Kozma, E. et al. Novel fluorescent antagonist as a molecular probe in A3 adenosine receptor
binding assays using flow cytometry. Biochemical Pharmacology 83, 1552-1561 (2012).
41. McGuinness, B. F. et al. Discovery of 2-aminoimidazopyridine adenosine A2A receptor
antagonists. Bioorganic & Medicinal Chemistry Letters 20, 6845-6849 (2010).
42. Piersanti, G. et al. Synthesis of benzo[1,2-d;3,4-d’]diimidazole and 1H-pyrazolo[4,3-
b]pyridine as putative A2A receptor antagonists. Organic & Biomolecular Chemistry 5, 2567-
2571 (2007).
43. Volpini, R. et al. N6-methoxy-2-alkynyladenosine derivatives as highly potent and selective
ligands at the human A3 adenosine receptor. Journal of Medicinal Chemistry 50, 1222-1230
(2007).
44. Volpini, R. et al. Synthesis and biological evaluation of 2-alkynyl-N6-methyl-5'-N-
methylcarboxamidoadenosine derivatives as potent and highly selective agonists for the
human adenosine A3 receptor. Journal of Medicinal Chemistry 52, 7897-7900 (2009).
45. Volpini, R. et al. N6-alkyl-2-alkynyl derivatives of adenosine as potent and selective agonists
at the human adenosine A3 receptor and a starting point for searching A2B ligands Journal
of Medicinal Chemistry 45, 3271-3279 (2002).
46. Dal Ben, D. et al. Molecular modeling study on potent and selective adenosine A3 receptor
agonists. Bioorganic & Medicinal Chemistry 18, 7923-7930 (2010).
51
ANNEX 1: TABLE WITH STRUCTURE OF ANTAGONISTS WITH A2AAR AFFINITY USED AS
TRAINING SET IN DOCKING ANALYSIS
cpd Structure ref pKi cpd Structure ref pKi
1
6 9,40
13
6 8,30
2
6 9,37
14
6 7,70
3
6 9,10
15
21 7,51
4
6 8,96
16
6 7,44
5
6 8,85
17
22 7,22
6
6 8,82
18
22 7,09
7
6 8,80
19
21 7,04
8
6 8,77
20
21 7,00
9
6 8,75
21
21 6,59
10
6 8,57
22
23 6,51
11
6 8,43
23
21 6,49
12
6 8,42
24
21 6,39
52
25
6 8,42
37
21 6,33
26
23 6,21
38
24 7,57
27
21 6,16
39
25 5,13
28
23 6,07
40
26 6,00
29
21 6,03
41
27 8,02
30
23 5,82
42
27 8,74
31
22 5,70
43
28 6,75
32
6 5,70
44
29 6,42
33
23 5,62
45
30 8,72
34
23 5,49
46
30 9,15
35
22 4,63
47
30 8,15
36
6 8,96
48
31 8,19
53
49
6 8,77
60
31 8,55
50
32 8,07
61
33 8,54
51
31 8,52
62
34 6,92
52
31 9,00
63
25 6,24
53
35 7,92
64
36 8,45
54
35 7,52
65
37 7,04
55
24 9,22
66
38 6,02
56
25 7,17
67
38 6,30
57
24 7,06
68
24 9,70
58
38 6,48
69
36 11,09
59
24 8,40
70
31 7,77
54
71
39 8,22
75
39 7,22
72
27 9,30
76
27 9,52
73
24 8,70
77
40 7,29
74
25 5,50
ANNEX 2: TABLE WITH STRUCTURE OF ANTAGONISTS WITH A2AAR AFFINITY USED AS TEST
SET IN DOCKING ANALYSIS
cpd Structure ref pKi cpd Structure ref pKi
1
6 8,92
6
41 6,36
2
6 8,18
7
23 5,80
3
6 7,72
8
21 5,61
4
6 7,34
9
42 5,47
5
21 6,98
10
23 4,66