4
ARTICLE Backbone 1 H N , 13 C, and 15 N resonance assignments of the tandem RNA-binding domains of human DGCR8 Braden M. Roth Mirko Hennig Received: 9 May 2012 / Accepted: 21 June 2012 Ó Springer Science+Business Media B.V. 2012 Abstract Double-stranded RNA binding domain (dsRBD) containing proteins are critical components of the microRNA (miRNA) pathway, with key roles in small RNA biogenesis, modification, and regulation. DiGeorge Critical Region 8 (DGCR8) is a 773 amino acid, dsRBD- containing protein that was originally identified in humans as a protein encoded in the region of chromosome 22 that is deleted in patients with DiGeorge syndrome. Now, it is realized that DGCR8 complements the nuclear RNase III Drosha to initiate miRNA biogenesis by promoting effi- cient recognition and cleavage of primary miRNAs (pri- miRNA). A pair of C-terminal tandem dsRBDs separated by a flexible linker are required for pri-miRNA substrate binding and recognition. The crystal structure of the DGCR8 core region comprising residues 493–720 revealed that each dsRBD adopts the canonical abbba fold. How- ever, several residues located in important flexible regions including the b1-b2-loop implicated in canonical dsRNA recognition are absent in the crystal structure and no RNA- bound structure of DGCR8 has been reported. Here we report the 1 H N , 13 C, and 15 N backbone resonance assign- ments of the 24 kDa, 214 amino acid human DGCR8 core (residues 493–706) by heteronuclear NMR spectroscopy. Our assignments lay the foundation for a detailed solution state characterization of the dynamical and RNA-binding properties of this protein in solution. Keywords DiGeorge Critical Region 8 (DGCR8) Double stranded RNA binding domain (dsRBD) Microprocessor microRNA (miRNA) Biological context MicroRNAs are non-coding RNAs that regulate gene expression through directed downregulation of targeted messenger RNA (mRNA). MiRNA biogenesis involves two distinct RNase III processing events. Long, noncoding pri- miRNA transcripts are initially cleaved by the nuclear RNase III Drosha to release a *70nt precursor (pre)-miR- NA. Pre-miRNAs undergo a second round of processing by the cytoplasmic RNase III Dicer before incorporating into the RISC (RNA-Induced Silencing Complex) where the mature miRNA serves as a guide to repress translation or destabilize mRNAs with incomplete complementarity to the miRNA guide. RNase III-dependent miRNA biogenesis is facilitated by dsRBD containing protein cofactors. In humans, the nuclear protein DGCR8 assists Drosha in the recognition and cleavage of miRNA-containing pri-miRNA hairpins and is required for embryonic development and differentiation. DGCR8 was initially identified as the essential gene located within a 1.5-Mb chromosomal deletion (del22q11) commonly associated with DGS. More recently, DGCR8 has been shown to play a central role in pri-miRNA recognition and processing (Han et al. 2004; Landthaler et al. 2004). Drosha and DGCR8 copurify from cell extracts and together constitute the ‘‘minimal’’ Microprocessor, responsible for the initiation step of pri-miRNA biogenesis (Gregory et al. 2004). Through a largely unknown process, the Micropro- cessor recognizes and binds the miRNA-containing hairpin at its single-stranded(ss)-double stranded (ds) RNA junction B. M. Roth M. Hennig (&) Department of Biochemistry and Molecular Biology, Medical University of South Carolina, Charleston, 70 President St, DD213, PO Box 250509, South Carolina 29425, USA e-mail: [email protected] 123 Biomol NMR Assign DOI 10.1007/s12104-012-9406-x

Roth and Hennig 2012

Embed Size (px)

Citation preview

ARTICLE

Backbone 1HN, 13C, and 15N resonance assignments of the tandemRNA-binding domains of human DGCR8

Braden M. Roth • Mirko Hennig

Received: 9 May 2012 / Accepted: 21 June 2012

� Springer Science+Business Media B.V. 2012

Abstract Double-stranded RNA binding domain

(dsRBD) containing proteins are critical components of the

microRNA (miRNA) pathway, with key roles in small

RNA biogenesis, modification, and regulation. DiGeorge

Critical Region 8 (DGCR8) is a 773 amino acid, dsRBD-

containing protein that was originally identified in humans

as a protein encoded in the region of chromosome 22 that is

deleted in patients with DiGeorge syndrome. Now, it is

realized that DGCR8 complements the nuclear RNase III

Drosha to initiate miRNA biogenesis by promoting effi-

cient recognition and cleavage of primary miRNAs (pri-

miRNA). A pair of C-terminal tandem dsRBDs separated

by a flexible linker are required for pri-miRNA substrate

binding and recognition. The crystal structure of the

DGCR8 core region comprising residues 493–720 revealed

that each dsRBD adopts the canonical abbba fold. How-

ever, several residues located in important flexible regions

including the b1-b2-loop implicated in canonical dsRNA

recognition are absent in the crystal structure and no RNA-

bound structure of DGCR8 has been reported. Here we

report the 1HN, 13C, and 15N backbone resonance assign-

ments of the 24 kDa, 214 amino acid human DGCR8core

(residues 493–706) by heteronuclear NMR spectroscopy.

Our assignments lay the foundation for a detailed solution

state characterization of the dynamical and RNA-binding

properties of this protein in solution.

Keywords DiGeorge Critical Region 8 (DGCR8) �Double stranded RNA binding domain (dsRBD) �Microprocessor � microRNA (miRNA)

Biological context

MicroRNAs are non-coding RNAs that regulate gene

expression through directed downregulation of targeted

messenger RNA (mRNA). MiRNA biogenesis involves two

distinct RNase III processing events. Long, noncoding pri-

miRNA transcripts are initially cleaved by the nuclear

RNase III Drosha to release a *70nt precursor (pre)-miR-

NA. Pre-miRNAs undergo a second round of processing by

the cytoplasmic RNase III Dicer before incorporating into

the RISC (RNA-Induced Silencing Complex) where the

mature miRNA serves as a guide to repress translation or

destabilize mRNAs with incomplete complementarity to the

miRNA guide. RNase III-dependent miRNA biogenesis is

facilitated by dsRBD containing protein cofactors. In

humans, the nuclear protein DGCR8 assists Drosha in the

recognition and cleavage of miRNA-containing pri-miRNA

hairpins and is required for embryonic development and

differentiation.

DGCR8 was initially identified as the essential gene

located within a 1.5-Mb chromosomal deletion (del22q11)

commonly associated with DGS. More recently, DGCR8 has

been shown to play a central role in pri-miRNA recognition

and processing (Han et al. 2004; Landthaler et al. 2004).

Drosha and DGCR8 copurify from cell extracts and together

constitute the ‘‘minimal’’ Microprocessor, responsible for

the initiation step of pri-miRNA biogenesis (Gregory et al.

2004). Through a largely unknown process, the Micropro-

cessor recognizes and binds the miRNA-containing hairpin

at its single-stranded(ss)-double stranded (ds) RNA junction

B. M. Roth � M. Hennig (&)

Department of Biochemistry and Molecular Biology, Medical

University of South Carolina, Charleston, 70 President St,

DD213, PO Box 250509, South Carolina 29425, USA

e-mail: [email protected]

123

Biomol NMR Assign

DOI 10.1007/s12104-012-9406-x

positioning Drosha’s catalytic RNase III domains to cleave

the pri-miRNA approximately one helical turn from the ss-

dsRNA junction (Han et al. 2006).

The *175 amino acid tandem dsRBD domain (511–685)

is the signature motif of RNase III-associated accessory

proteins. Crystal structures of dsRBD in complex with two

collinearly stacking, GC-rich dsRNA substrates helped to

define canonical interaction interfaces involving the a-heli-

cal and b1-b2 loop regions of a typical abbba dsRBD.

Besides its tandem C-terminal RBDs, DGCR8 features a

WW motif overlapping with the C352-mediated heme-

binding/dimerization domain (Faller et al. 2007), and an

N-terminal nuclear localization signal. Undoubtedly, the two

C-terminal dsRBDs of DGCR8 play a key role in pri-miRNA

substrate recognition, but answering the question whether

Drosha and DGCR8 function in mutually exclusive or

cooperative fashion awaits detailed structural information on

DGCR8-pri-miRNA complexes. Sohn et al. (2007) reported

the crystal structure of the DGCR8 ‘‘core’’, (residues

493–720) comprising both dsRBDs and the flexible linker

(residues 493–720). Their analysis revealed that the dsRBDs

are arranged in a pseudo two-fold symmetry stabilized by

hydrophobic interactions and hydrogen bonding between

both dsRBDs and a C-terminal loop/helix (residues

684–698) (Fig. 2, a7). Absent from the crystal structure,

however, are residues that correspond to flexible regions,

most notably the dsRBD1-dsRBD2 linker and a canonical

RNA-binding region located in the b5–b6 loop (Fig. 2).

Although this work has confirmed the canonical abbbaarchitecture of each dsRBD, how pri-miRNA are recognized

and distinguished from the myriad of ss-dsRNA junction in

other RNA transcripts by DGCR8 and whether the dsRBDs

coordinate the binding of a single RNA substrates remains

unknown. If a single, rod-like pri-miRNA substrate would be

recognized then the tandem dsRBD1-dsRBD2 would have to

undergo a significant conformational change upon binding

suggesting that NMR dynamics investigations could help to

decipher the pri-miRNA binding mechanism.

Here we report the 1HN, 13C, and 15N backbone reso-

nance assignments and secondary structure of the 24 kDa

human DGCR8core in solution. These will serve as a

foundation for further investigation of DGCR8 in terms of

protein dynamics and DGCR8/pri-miRNA complex

interactions.

Methods and experiments

Expression and purification of deuterated 2H,15N,(13C)-

DGCR8core

We have generated a 214 amino acid (24 kDa) truncated

version of the DGCR8core consisting of residues 493–706.

DGCR8core was amplified from pFLAG/HA-DGCR8

plasmid (Addgene #10921; Landthaler et al. 2004) with the

TEV protease recognition sequence ENLYFQS incorpo-

rated in the 50 primer: (50- GAAAACCTGTACTTT-

CAGtcagtgcaagatgcacccacaaagaaag-30 and 50- tcagaccatct

tgctgctctcacggccatacatgcgc-30; uppercase represents TEV

recognition sequence). The PCR product was used to

generate a pENTR-TEV/core Gateway entry vector by

TOPO-TA cloning with Invitrogen’s pCR8/GW/TOPO

vector. pENTR-TEV/core was recombined with a cus-

tomized Gatweway p(H)GB1-GTW destination vector that

incorporated the TEV/core sequence in frame with an

N-terminal His6-GB1 protein. Positive clones were trans-

formed into E. coli BL21-CodonPlus (DE3)-RIPL compe-

tent cells (Stratagene). Deuterated cultures of E. coli

BL21-CodonPlus (DE3)-RIPL cells expressing DGCR8core

were grown from a single colony in 2 ml LB/Amp100 at

37 �C overnight. These cultures were then adapted to

growth in 500 ml of [99.9 % D2O]-15N, (13C)-enriched M9

minimal media with a succession of small-volume cultures

grown at 37 �C. Protonated glucose carbon sources were

used and expression was induced with 1 mM IPTG for 4 h

at 37 �C, then purified with the HisTrapFF affinity column

(GE Healthcare). Partially purified fusion proteins were

cleaved with TEV protease overnight at room temperature

and reapplied to the HisTrap column. The flow-through

contained highly purified protein that was subsequently

exchanged into NMR buffer.

NMR spectroscopy

Even in the absence of the previously identified dimeriza-

tion domain of DGCR8 and at modest concentration,

DGCR8core exhibits a strong tendency to dimerize in

solution. In agreement with previously published work,

size-exclusion chromatography (SEC) analysis yielded an

estimated molecular weight (MW) of 31.0 kDa for the

DGCR8core, significantly larger than the theoretical MW of

24.2 kDa (data not shown). To address the resulting line

broadening, we have expressed deuterated 15N, 13C-

DGCR8core and utilized TROSY (transverse relaxation

optimized spectroscopy)-based pulse sequences to enhance

sensitivity and resolution. NMR spectra were acquired in

500 ll of 90 % H2O/10 % D2O at 25 �C on back-

exchanged 2H,15N- as well as 2H,15N,13C-labeled

DGCR8core samples (ca. 300 lM in 20 mM Na-phosphate

buffer, pH 7.0, 150 mM KCl, and 5 mM TCEP).

Three-dimensional, clean TROSY-based (Pervushin

et al. 1997; Schulte-Herbruggen and Sorensen 2000),

constant-time (CT)-HNCA, sequential CT-HNCA, CT-

HN(CA)CB, CT-HN(COCA)CB, HNCO and HN(CA)CO

were recorded on a TCI-cryoprobe-equipped Bruker

Avance800 spectrometer to establish sequential

B. M. Roth, M. Hennig

123

connectivities based on a pairwise comparison of intra- and

inter-residue 13Ca, 13Cb and 13C’ chemical shifts. Subse-

quently, assignments were confirmed using through-space1HN and 15N correlations derived from modified 3D

HSQC-NOESY-TROSY and NOESY-TROSY experi-

ments. The NMR data were processed using nmrPipe, and

analyzed with CcpNMR Analysis software.

Assignments and data deposition

The well-dispersed 2D 1H-15N TROSY spectrum of

DGCR8core is shown in Fig. 1. In total, 800 of 845 (95 %)

backbone resonances were assigned, including 96 % of1HN protons and 15N amides, 95 % of Ca, 90 % of Cb, and

97 % of C’ resonances. Residues in flexible regions (the

dsRBD1-dsRBD2 linker and the b5-b6 loop) that were

elusive in the crystal structure were readily observed. 15N

assignments for the N-terminal residue (S493), nine pro-

lines and eight other residues (H140, N156, Q157, K175,

N176, K177, S185, and N196) have not been obtained,

presumably due to line broadening as a result of solvent

exchange or intermediate conformational exchange. 1H

chemical shifts were externally referenced to DSS, with

heteronuclear 13C and 15N chemical shifts referenced

indirectly according to the X/1H ratio in DSS. Assignments

Fig. 1 Quality of the NMR spectra obtained for DGCR8core showing an 800 MHz 1H,15N TROSY spectrum recorded at 25 �C. An enlarged

view of the most crowded region of the spectrum is show in the top-left corner. Assignments of the signals from backbone amide groups are

indicated by residue type and number; tryptophan sidechain indole resonances are marked with e

Fig. 2 Plot of the secondary chemical shifts used to derive the

secondary structure of DGCR8core. Differences of chemical shift

deviations of Ca and Cb carbons (Dd(Ca)–(Dd(Cb)) with respect to

corresponding random coil values are plotted against amino acid

residue number. Locations of the a-helices and b-strands identified by

TALOS? are indicated with boxes and labeled a1-7 and b1-7

Backbone 1HN, 13C, and 15N resonance assignments

123

have been deposited in the BioMagResBank database

(http://www.bmrb.wisc.edu) under accession number

17773.

Backbone Ca Cb and C’ assignments were used in

chemical shift index (Wishart and Sykes 1994) and TA-

LOS? (Shen et al. 2009) analyses as shown in Fig. 2; the

resulting secondary structure elements closely resemble

those of the crystal structure (Sohn et al. 2007) with minor

differences in secondary structure boundaries. The pre-

dicted secondary structure of human DGCR8core in solution

includes seven a-helices (a1: residues 512–524, a2: resi-

dues 562–576, a3: residues 593–598, a4: residues 606–614,

a5: residues 620–629, a6: residues 670–684, and a7: resi-

dues 691–697) and seven b-strands (b1: residues 503–505,

b2: residues 529–535, b3: residues 544–548, b4: residues

551–558, b5: residues 639–644, b6: residues 651–657, and

b7: residues 661–666).

Acknowledgments This work was supported by the National Sci-

ence Foundation [MCB 0845512 to M.H.]. We thank T. Tuschl for the

pFLAG/HA-DGCR8 construct and members of the Hennig laboratory

for stimulating discussions and comments on the manuscript and

acknowledge the support of the Hollings Marine Laboratory NMR

Facility for this work.

References

Faller M, Matsunaga M, Yin S, Loo JA, Guo F (2007) Heme is

involved in microRNA processing. Nat Struct Mol Biol 14(1):

23–29

Gregory RI, Yan KP, Amuthan G, Chendrimada T, Doratotaj B,

Cooch N, Shiekhattar R (2004) The Microprocessor complex

mediates the genesis of microRNAs. Nature 432(7014):235–240

Han J, Lee Y, Yeom KH, Kim YK, Jin H, Kim VN (2004) The

Drosha-DGCR8 complex in primary microRNA processing.

Genes Dev 18(24):3016–3027

Han J, Lee Y, Yeom KH, Nam JW, Heo I, Rhee JK, Sohn SY, Cho Y,

Zhang BT, Kim VN (2006) Molecular basis for the recognition

of primary microRNAs by the Drosha-DGCR8 complex. Cell

125(5):887–901

Landthaler M, Yalcin A, Tuschl T (2004) The human DiGeorge

syndrome critical region gene 8 and Its D. melanogaster

homolog are required for miRNA biogenesis. Curr Biol 14(23):

2162–2167

Pervushin K, Riek R, Wider G, Wuthrich K (1997) Attenuated T2

relaxation by mutual cancellation of dipole–dipole coupling and

chemical shift anisotropy indicates an avenue to NMR structures

of very large biological macromolecules in solution. Proc Natl

Acad Sci USA 94(23):12366–12371

Schulte-Herbruggen T, Sorensen OW (2000) Clean TROSY: com-

pensation for relaxation-induced artifacts. J Magn Reson 144(1):

123–128

Shen Y, Delaglio F, Cornilescu G, Bax A (2009) TALOS ? : a hybrid

method for predicting protein backbone torsion angles from

NMR chemical shifts. J Biomol NMR 44(4):213–223

Sohn SY, Bae WJ, Kim JJ, Yeom KH, Kim VN, Cho Y (2007) Crystal

structure of human DGCR8 core. Nat Struct Mol Biol 14(9):

847–853

Wishart DS, Sykes BD (1994) The 13C chemical-shift index: a simple

method for the identification of protein secondary structure using

13C chemical-shift data. J Biomol NMR 4(2):171–180

B. M. Roth, M. Hennig

123