15
1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung Tung and Karissa Y. Sanbonmatsu 6 Los Alamos National Laboratory, Los Alamos, NM 87545, USA 7 Correspondence to Karissa Y. Sanbonmatsu: [email protected] 8 http://dx.doi.org/10.1016/j.jmb.2013.02.030 9 Edited by A. Pyle 10 11 12 Abstract 13 Novel, profound and unexpected roles of long non-coding RNAs (lncRNAs) are emerging in critical aspects of 14 gene regulation. Thousands of lncRNAs have been recently discovered in a wide range of mammalian 15 systems, related to development, epigenetics, cancer, brain function and hereditary disease. The structural 16 biology of these lncRNAs presents a brave new RNA world, which may contain a diverse zoo of new 17 architectures and mechanisms. While structural studies of lncRNAs are in their infancy, we describe existing 18 structural data for lncRNAs, as well as crystallographic studies of other RNA machines and their implications 19 for lncRNAs. We also discuss the importance of dynamics in RNA machine mechanism. Determining 20 commonalities between lncRNA systems will help elucidate the evolution and mechanistic role of lncRNAs in 21 disease, creating a structural framework necessary to pursue lncRNA-based therapeutics. 22 © 2013 Published by Elsevier Ltd. 23 24 25 Introduction 26 RNA is primarily known as an intermediary in gene 27 expression between DNA and proteins. Over the 28 past several decades, other roles for RNA have 29 been identified, which include protein synthesis, 30 gene regulation and nucleic acid processing. How- 31 ever, these RNAs were considered as outliers in a 32 transcriptome that consists mainly of protein-coding 33 RNAs. While this may be the case for bacteria and 34 certain unicellular organisms, 1 transcriptomes in 35 higher eukaryotes are markedly different. Recent 36 deep sequencing studies in humans have shown 37 that more than two-thirds of the genome is actively 38 transcribed. 2 Since protein-coding genes constitute 39 a very small fraction of the genome, 3 non-coding 40 RNAs represent the vast majority (more than 80% in 41 mammals). 4 Recent studies estimate approximately 42 15,000 long non-coding RNAs ( lncRNAs) in 43 humans. 5 In light of their expression profiles specific 44 to cell, cell cycle, tissue, developmental stage and 45 disease, it is difficult to precisely quantify the number 46 of human lncRNAs. We note that tens of thousands 47 of lncRNAs have been profiled in 2012. 610 This 48 recent explosion of newly discovered lncRNAs 49 suggests that non-coding RNAs may be the norm 50 rather than the exception in the case of eukaryotic 51 organisms. 52 LncRNAs are defined by the following: (i) lack of 53 coding potential and (ii) transcript length (N 200 nt). 11 54 These transcripts are generally nuclear retained and 55 transcribed by RNA polymerase II, with many that are 56 spliced and polyadenylated. 1215 LncRNAs may be 57 intronic, intergenic (large intervening non-coding 58 RNAs or lincRNAs) or antisense to the protein- 59 coding genes (overlapping one or more exons). 60 While transcript lengths are normally in the range 61 100010,000 residues, the Air and kcnq1ot1 62 lncRNAs have lengths that exceed 90 kb. 1618 63 Interestingly, in 2013, a novel monsternon-coding 64 transcript, XACT (252 kb in length), was found to 65 originate from the active X chromosome in human 66 pluripotent cells. 19 Many lncRNAs play key roles in 67 signaling, 2022 development, 23 cancer, 12,2429 em- 68 bryonic stem cell pluripotency, 30 brain function, 3134 69 subcellular compartmentalization, 26,3538 chromatin 70 remodeling, 12,39,40 plant biology 41,42 and stress 71 response. 43 Cell biology studies and functional 72 aspects of lncRNAs have been discussed by Rinn 73 and Chang in a recent review article, as well as in 74 many references therein. 6 Since lncRNAs are often 75 associated with histone modification and chromatin IMF YJMBI-63988; No. of pages: 15; 4C: 3, 4, 7, 8, 10 0022-2836/$ - see front matter © 2013 Published by Elsevier Ltd. J. Mol. Biol. (2013) xx, xxxxxx Please cite this article as: Novikova, I. et al., Rise of the RNA Machines: Exploring the Structure of Long Non-Coding RNAs, J. Mol. Biol. (2013), http://dx.doi.org/10.1016/j.jmb.2013.02.030

Rise of the RNA Machines: Exploring the Structure …1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Rise of the RNA Machines: Exploring the Structure …1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung

1

2 Rise of the RNA Machines: Exploring the Structure of3 Long Non-Coding RNAs

4

5Irina V.Q1 Novikova, Scott P. Hennelly, Chang-Shung Tung and Karissa Y. Sanbonmatsu

6 Los Alamos National Laboratory, Los Alamos, NM 87545, USA

7 Correspondence to Karissa Y. Sanbonmatsu: [email protected] http://dx.doi.org/10.1016/j.jmb.2013.02.0309 Edited by A. Pyle1011

12 Abstract

13 Novel, profound and unexpected roles of long non-coding RNAs (lncRNAs) are emerging in critical aspects of14 gene regulation. Thousands of lncRNAs have been recently discovered in a wide range of mammalian15 systems, related to development, epigenetics, cancer, brain function and hereditary disease. The structural16 biology of these lncRNAs presents a brave new RNA world, which may contain a diverse zoo of new17 architectures and mechanisms. While structural studies of lncRNAs are in their infancy, we describe existing18 structural data for lncRNAs, as well as crystallographic studies of other RNA machines and their implications19 for lncRNAs. We also discuss the importance of dynamics in RNA machine mechanism. Determining20 commonalities between lncRNA systems will help elucidate the evolution and mechanistic role of lncRNAs in21 disease, creating a structural framework necessary to pursue lncRNA-based therapeutics.22 © 2013 Published by Elsevier Ltd.

2324

25 Introduction

26 RNA is primarily known as an intermediary in gene27 expression between DNA and proteins. Over the28 past several decades, other roles for RNA have29 been identified, which include protein synthesis,30 gene regulation and nucleic acid processing. How-31 ever, these RNAs were considered as outliers in a32 transcriptome that consists mainly of protein-coding33 RNAs. While this may be the case for bacteria and34 certain unicellular organisms,1 transcriptomes in35 higher eukaryotes are markedly different. Recent36 deep sequencing studies in humans have shown37 that more than two-thirds of the genome is actively38 transcribed.2 Since protein-coding genes constitute39 a very small fraction of the genome,3 non-coding40 RNAs represent the vast majority (more than 80% in41 mammals).4 Recent studies estimate approximately42 15,000 long non-coding RNAs (lncRNAs) in43 humans.5 In light of their expression profiles specific44 to cell, cell cycle, tissue, developmental stage and45 disease, it is difficult to precisely quantify the number46 of human lncRNAs. We note that tens of thousands47 of lncRNAs have been profiled in 2012.6–10 This48 recent explosion of newly discovered lncRNAs49 suggests that non-coding RNAs may be the norm

50rather than the exception in the case of eukaryotic51organisms.52LncRNAs are defined by the following: (i) lack of53coding potential and (ii) transcript length (N200 nt).11

54These transcripts are generally nuclear retained and55transcribed by RNA polymerase II, withmany that are56spliced and polyadenylated.12–15 LncRNAs may be57intronic, intergenic (large intervening non-coding58RNAs or lincRNAs) or antisense to the protein-59coding genes (overlapping one or more exons).60While transcript lengths are normally in the range611000–10,000 residues, the Air and kcnq1ot162lncRNAs have lengths that exceed 90 kb.16–18

63Interestingly, in 2013, a novel “monster” non-coding64transcript, XACT (252 kb in length), was found to65originate from the active X chromosome in human66pluripotent cells.19 Many lncRNAs play key roles in67signaling,20–22 development,23 cancer,12,24–29 em-68bryonic stem cell pluripotency,30 brain function,31–34

69subcellular compartmentalization,26,35–38 chromatin70remodeling,12,39,40 plant biology41,42 and stress71response.43 Cell biology studies and functional72aspects of lncRNAs have been discussed by Rinn73and Chang in a recent review article, as well as in74many references therein.6 Since lncRNAs are often75associated with histone modification and chromatin

IMF YJMBI-63988; No. of pages: 15; 4C: 3, 4, 7, 8, 10

0022-2836/$ - see front matter © 2013 Published by Elsevier Ltd. J. Mol. Biol. (2013) xx, xxx–xxx

Please cite this article as: Novikova, I. et al., Rise of the RNA Machines: Exploring the Structure of Long Non-Coding RNAs, J. Mol.Biol. (2013), http://dx.doi.org/10.1016/j.jmb.2013.02.030

ivn
kys2
The names are ok
Karissa Sanbonmatsu
Please insert new reference 0 after reference 23
Page 2: Rise of the RNA Machines: Exploring the Structure …1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung

76 remodeling, epigenetic effects represent a unifying77 theme. While the mechanism by which epigenetic78 factors find their targets is largely unknown, many79 recent studies indicate that lncRNAs may be a80 necessary component of the network that guides81 these factors to their chromatin targets. In addition to82 epigenetic effects, the origin of lncRNAs is not well83 understood. While the genomes of simple organisms84 are dominated by protein-coding genes, the advent85 of non-coding RNAs in higher organisms gave rise to86 an incredible sequence space for exploration of new87 function at the RNA level, creating sophisticated88 regulatory networks. Possible evolution mechanisms89 may include pseudogenization of protein-coding90 genes, insertion of transposons and duplication of91 genes. Investigations into the open area of the origin92 and evolution of lncRNAs have the potential to yield93 exciting results.

94 RNA machines

95 RNA molecules are able to form complex molec-96 ular machines. The ribosome is one of the most well97 known RNA machines since it has many moving98 parts and may be powered by GTP hydrolysis.44,45

99 In certain circumstances, the ribosome can function100 without GTP hydrolysis (“factor-free translation”).46

101 Factor-free translation is quite inefficient in compar-102 ison to GTP-based translation and in comparison to103 the far more efficient protein-based molecular104 motors (e.g., flagellum).47 The ribosome also pro-105 cesses information by performing a look-up table106 operation, converting the 4-letter nucleic acid alpha-107 bet into the 20-letter protein alphabet. The group I108 and group II self-splicing introns are also molecular109 machines as they use chemical energy to bring110 together distant regions of RNA.48–50 RNase P and111 telomerase RNA also catalyze chemical reactions to112 accomplish their function.51–53 The riboswitch is113 another type of RNA molecular machine.54 Ribos-114 witches are molecular switches that sense their115 environment and allow gene expression decisions to116 be made on the basis of environmental inputs such117 as ligand concentration.55–60 More complicated118 tandem riboswitches act as logical AND gates. In119 addition, complex combinations of aptamers can be120 arranged into logic circuitry (e.g., OR, NAND, NOR121 and NOT gates).61 Finally, RNA scaffolding that122 controls information flow could be analogous to an123 integrated circuit (a device that has no moving parts124 but controls highly complex information flow in125 computers). Recent studies suggest that several126 different lncRNA systems may act as scaffolding,127 controlling information flow in epigenetic systems.62

128 Structural studies are often critical in deciphering129 RNA machines and the mechanism of RNA action.130 Few structural studies of individual lncRNAs have131 been performed due to their enormous size and very132 recent discovery.20,63 Considering that the high-

133resolution structure of the bacterial ribosome134(N4500 nt) required more than two decades for its135solution, structural studies of lncRNAs are136formidable.64,65 Fundamental questions regarding137the structure of lncRNAs remain unanswered,138including:

139(1) Is lncRNA mechanism dominated by pri-140mary sequence, higher-order structure or141both?142(2) Do lncRNAs contain sub-domains?143(3) Are lncRNAs complexed with many pro-144teins or do they exist as isolated RNAs that145transiently interact with proteins?146(4) Are lncRNAs compact or extended?147

148Here, we attempt to provide a starting point for149structural studies of lncRNAs. Because tertiary150structures have not been solved to date, we review151tertiary structures of other RNA machines. We next152summarize currently known structural features of153eukaryotic lncRNAs (Fig. 1). Finally, we discuss154lncRNA structure in the context of previously studied155RNA molecular machines and corresponding struc-156ture/function relationships.

157Crystallographic studies of RNA machines in158bacteria and other unicellular organisms

159Three-dimensional structure is often integral to the160function and mechanism of a biomolecular system.161Here, we review crystallographic structures of other162RNA systems. This review is by no means compre-163hensive but highlights the diversity of mechanism164and composition of RNA structure. The group I165intron, the group II intron, the ribosome and RNase P166are the only RNAs with lengths greater than 200 nt167that have been studied at high resolution.48,66

168The group I and group II introns are self-splicing169RNAs that catalyze their own cleavage. The struc-170ture of the group I intron consists largely of an171isolated RNA (~200 nt), organized into a well-172defined architecture by co-axially stacked helices173connected by single-stranded regions. The pre-2S174state of the group I intron includes an exon and also175contains mainly co-axially stacked helices con-176nected by single-stranded regions.67 Four of the177helices are capped by tetraloops. This structure178consists largely of an isolated RNA with one small179spliceosomal protein (U1A) bound to its extremity.180The active site contains two base triples, critical for181precise positioning of magnesium ions and catalysis.182The structure of the post-catalytic state of the183group II intron, solved by Toor et al., contains184approximately 400 nt, organized into more than 10185helices, with most capped by RNA stem–loop186structures.68 The RNA contains many co-axially187stacked helices, forming a relatively compact overall

2 Review: Structure of RNAs and RNA MachinesQ2

Please cite this article as: Novikova, I. et al., Rise of the RNA Machines: Exploring the Structure of Long Non-Coding RNAs, J. Mol.Biol. (2013), http://dx.doi.org/10.1016/j.jmb.2013.02.030

ivn
kys2
Running title should be "Rise of the RNA machines"
Page 3: Rise of the RNA Machines: Exploring the Structure …1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung

Fig. 1. Examples of non-coding RNAs studied with structural methods. Purple, lncRNAs with existing structural information. Orange, examples of non-coding RNAswith solved crystallographic structures. “3D” denotes systems with crystallographic structures; “2D” denotes systems with experimentally derived secondary structuresof the entire RNA.

3Review

:Structure

ofRNAsand

RNAMachines

Q2

Please

citethis

articleas:N

ovikova,I.etal.,Rise

oftheRNAMachines:E

xploringthe

Structure

ofLongNon-C

odingRNAs,J.M

ol.Biol.(2013),http://dx.doi.org/10.1016/j.jm

b.2013.02.030

Page 4: Rise of the RNA Machines: Exploring the Structure …1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung

188 structure. A large number of long-range tertiary189 interactions connect the helices, including a kissing190 loop interaction, a ribose zipper interaction and an

191intricate five-way junction with many stacked non-192helical nucleotides. Several base triples exist in the193Z-anchor region, nearby the magnesium binding

Fig. 2. Tertiary structures of previously solved RNA complexes. (a) The group II intron, solved by Chan et al., is a highlystructured isolated RNA with a compact core.48 (b) The ribosome, solved by Gao et al., is highly structured and compact.70

Pink, rRNA; purple, tRNA; cyan, ribosomal proteins; yellow, elongation factor EF-G. Different protein factors bind to thesame binding sites, regulating protein synthesis. (c) RNase P.51 RNase P is structured and contains a single proteinbinding domain. Pink, RNase P RNA; purple, tRNA; yellow, RNase P protein. (d) Cellulosome protein-based molecularmachine. The cellulosome represents a relatively disordered but highly efficient molecular machine. Red, white and pink,cellulose substrate; gray, cellulase subunit; purple, dockerin connecting protein; blue, cohesin (disordered linker protein).

4 Review: Structure of RNAs and RNA MachinesQ2

Please cite this article as: Novikova, I. et al., Rise of the RNA Machines: Exploring the Structure of Long Non-Coding RNAs, J. Mol.Biol. (2013), http://dx.doi.org/10.1016/j.jmb.2013.02.030

Page 5: Rise of the RNA Machines: Exploring the Structure …1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung

194 sites associated with catalysis. Marcia and Pyle also195 uncovered a reversible conformational change in a196 study of the group II intron, which included 14197 structures, capturing several different intermediates198 of splicing consistent with biochemical studies.69 A199 structure of the group II intron prior to the first step of200 splicing48 shows a high degree of similarity in overall201 architecture to the post-splicing configuration202 (Fig. 2a). Interestingly, a significant conformational203 change occurs in the core of the intron, where a204 single-stranded region forms a kink motif, producing205 a reversal in the direction of the RNA strand at this206 location. It has also been suggested that the207 spliceosome is mechanistically similar to the group208 II intron.68

209 The largest RNA tertiary structure solved to date is210 the ribosome (Fig. 2b).70 The architecture of the211 ribosome comprises two major subunits, whose212 RNA structure is formed by a large number of213 shorter RNA helices and a small number of longer214 helices organized into a compact geometry. Approx-215 imately 50 proteins (in the case of bacteria) are216 interspersed throughout the ribosome complex,217 producing a highly compact, globular architecture.218 The globular domains of ribosomal proteins them-219 selves typically reside on the exterior of the complex220 and possess long tails extending to the interior of the221 ribosome, near the active sites in some cases. The222 70S complex is interconnected by an enormous223 number of tertiary contacts, including both RNA/RNA224 and RNA/protein interactions. The structure of the225 ribosome is uniquely designed for its function: the226 ribosomemust (i) accept only correct tRNAs carrying227 amino acids corresponding to the mRNA codon, (ii)228 perform chemistry (adding the amino acid to the229 nascent protein)71 and (iii) move the tRNAs through230 the ribosome.72 A large cavity exists between the231 small and large subunits. This cavity is used to232 transport the tRNA ligands through the ribosome.233 The six major active sites of the ribosome are RNA234 based. In addition to ribosomal proteins, other235 protein factors bind to the ribosome, facilitating the236 initiation and termination of translation, translocation237 of the ribosome along the mRNA, improved fidelity of238 tRNA selection and regulation of protein synthesis.239 While ribosomal proteins bind all over the ribosome,240 protein factors bind at a few select sites, associating241 and dissociating with the ribosome at various stages.242 These include the GTP-associated center and the243 three tRNA binding sites (the aminoacyl site, the244 peptidyl site and the exit site). Elongation factors EF-245 Tu and EF-G each bind to GTP-associated center246 during different stages of the elongation.247 Several different RNA action mechanisms allow248 the ribosome to accomplish its function. Locally, a249 conformational selection mechanism may act at the250 decoding center (the aminoacyl site on the small251 subunit) during certain stages of elongation. During252 translocation of the ribosome along the mRNA, many

253large-scale conformational fluctuations occur simul-254taneously and at different timescales. Protein binding255or GTP hydrolysis events act to synchronize the256fluctuations, shifting the equilibrium to the next basin257in the energy landscape and allowing the ribosome to258progress through the elongation cycle.45,73

259RNase P (~330–400 nt) can be considered a260canonical RNA that is compact, highly structured261and binds a single protein (Fig. 2c).51 RNase P is an262RNA-based multiple-turnover enzyme that catalyzes2635′ end maturation of tRNA and other RNAs. While264the complex is dominated by RNA, a small protein265component (approximately 10% of total mass)266increases the affinity of tRNA to RNase P. The267overall architecture of the RNA is similar to the self-268splicing introns, consisting mainly of co-axially269stacked helices, connected by various tertiary270interactions.74

271The structure of the intact telomerase RNA, TER272(~450 nt in humans), has yet to be solved; however,273this system represents an interesting RNA in the274context of lncRNAs.52 The template for telomeric275repeat extension resides within the TER. The276telomerase RNA (TER) binds to telomerase reverse277transcriptase (TERT), facilitating nucleotide addition278to telomeric regions. Crystal structures of portions of279the telomerase RNA have been solved. These280structures show a high degree of organization with281several tertiary interaction motifs, including a well-282studied and functionally important pseudoknot near283the RNA template sequence. The structural integrity284of the RNA is dependent to some degree on the285interactions of TERT and accessory proteins.75 In286yeast, the precise location of the telomerase subunit287(est1p's binding domain on this RNA) is not the key288to its function and can be positioned within the RNA289at different locations while maintaining function.53

290These important experiments suggest that the291telomerase RNA may act as flexible scaffolding for292protein binding. Another telomerase-associated293RNA is the telomere-repeat-containing RNA294(TERRA). This RNA is transcribed from positions295near the telomeric ends and includes sequence from296the telomeric repeats. This RNA has been found to297associate with chromatin. TERRA appears to local-298ize near the telomeres. Its association is negatively299regulated by effectors of the nonsense-mediated300RNA decay pathway.76 The presence of increased301concentrations of TERRA may inhibit telomerase.77

302LncRNAmechanisms based on primary structure

303In several cases, mechanistic information has304been gained by studying the primary structure of305the lncRNA. Here, the role of lncRNA is mainly to306provide sequence specificity to a process. Such307mechanisms are well known in other RNA systems308such as RNAi (RNA interference), where base309pairing between short regulatory RNAs [siRNA

5Review: Structure of RNAs and RNA MachinesQ2

Please cite this article as: Novikova, I. et al., Rise of the RNA Machines: Exploring the Structure of Long Non-Coding RNAs, J. Mol.Biol. (2013), http://dx.doi.org/10.1016/j.jmb.2013.02.030

Page 6: Rise of the RNA Machines: Exploring the Structure …1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung

310 (small interfering RNA) or miRNA (micro RNA)] and311 mRNA plays a critical role in gene regulation.78

312 RISC (RNA-induced silencing complex) complexes313 use these short RNAs as templates to locate and314 silence their mRNA targets.78–84 The protein-based315 RISC complex provides the structural architecture316 and is responsible for either directly cleaving the317 target or down-regulating translation.85

318 A first example is the 1/2-sbsRNA, which facilitates319 mRNA decay by base pairing with the 3′-untrans-320 lated region (STAU1-mediated mRNA decay).13

321 Once bound, the lncRNA/mRNA complex is recog-322 nized by the double-stranded RNA binding protein,323 STAU1, triggering mRNA decay. Another example is324 the DHFR lncRNA, which is transcribed from a minor325 promoter of the dihydrofolate reductase gene and326 plays a key role in the epigenetic mechanism of327 promoter-specific repression of transcription.86

328 Here, the lncRNA forms a complex with the major329 promoter and also interacts with the transcription330 factor IIB, disrupting the preinitiation complex.331 Additional evidence exists in other systems for the332 cotranscriptional recruitment of chromatin modifying333 complexes utilizing base pairing between the na-334 scent RNA and its antisense transcript.87 Antisense335 lncRNAs have been also shown to regulate splicing336 by interacting with mRNAs. The long non-coding337 Zeb2 natural antisense transcript is antisense to a 5′338 splice site of the Zeb2 mRNA. Binding of Zeb2339 inhibits splicing of mRNA and, as a result, preserves340 the internal ribosome entry site, necessary for341 efficient translation.88 Linc-MD1, associated with342 muscle differentiation, is an example of an lncRNA343 that serves as a decoy for miRNAs. This lncRNA344 regulates miRNA action by providing alternative345 binding sites, effectively titrating the miR-133 and346 miR-135 away from their targets.89 These binding347 sites tend to be relatively short (i.e., much shorter348 than the total length of the lncRNA). The much larger349 remainder of the lncRNA may play an additional350 structural role in the lncRNA's mechanism of action351 yet to be determined.

352 Structural probing studies of lncRNAs

353 Secondary structure often plays a critical role in354 RNA mechanism, underpinning overall tertiary ar-355 chitecture by defining helices, bulges, stem–loops,356 internal loops, junctions and sub-domains. Second-357 ary structure alone can provide the basis of function358 when (i) the secondary structure defines a platform359 for unique protein recognition or (ii) a single360 sequence produces two competing secondary struc-361 tures, as in the case of the riboswitch. The riboswitch362 represents a quintessential secondary-structure-363 basedRNAmechanism.55–60 Here, a single sequence364 of RNA has two different secondary structures,365 which compete with each other to determine the366 outcome of gene expression. Often, ligand concen-

367tration shifts the equilibrium between secondary368structures, resulting in the formation or destruction369of a transcriptional terminator helix. In riboswitch370studies, the action mechanism (i.e., that the RNA is a371ligand-based molecular switch) is determined by372studying the secondary structure, without the need373for tertiary structure studies. Breaker et al., who have374discovered most of the known riboswitches, used375chemical probing experiments (in-line probing) to376determine the secondary structure of the riboswitch377in the presence or absence of the ligand.56,57,90,91

378Their technique has been validated in many crystal-379lographic studies.54,59,92,93 While many computa-380tional techniques have been developed to predict381secondary structure, predicting the secondary struc-382ture of long RNA sequences remains a significant383challenge.94–98 One promising strategy uses ma-384chine learning approaches.95–97 Our experimentally385determined secondary structures are convenient386benchmarks for new predictive algorithms.387Our group has produced the first experimentally388derived secondary structure of an intact lncRNA, the389steroid receptor RNA activator (SRA).20 This390lncRNA co-activates a variety of sex hormone391receptors (ER, AR, TR, GR, RAR) and has been392shown to directly interact with several proteins (ER,393SHARP, SLIRP, DAX-1, SF-1, TR, Pus1p).99–101

394SRA has been found to associate with CTCF and is395thought to play a scaffolding role in the transcription396complex.102 SRA is also strongly associated with397breast cancer and may be useful as an early onset398tumorigenesis marker. As many lncRNAs are similar399in size to ribosomal subunits, we followed the400footsteps of Noller, Woese and Gutell by (i)401performing extensive chemical probing to produce402a secondary structure of the full lncRNA (~870 nt)403and (ii) validating with covariance analysis across404multiple sequences (Fig. 3).103,104

405Our experiments uncovered an intricate and highly406structured two-dimensional architecture of the407lncRNA, organized into four major sub-domains. To408determine the secondary structure, we employed409four methods of structural probing: selective 2′-410hydroxyl acylation analyzed by primer extension411(SHAPE), in-line probing, DMS probing and RNase412VI digestion. The four methods yielded complemen-413tary information about the secondary structure of the414intact lncRNA and were highly consistent with each415other. Overall, the lncRNA comprises 25 helical416segments (H1–H25), 16 terminal loops, 15 internal417loops and 5 junction regions (Fig. 3), consistent with418the 16S rRNA (~1540 nt), which contains 45 helices,41931 terminal loops, 26 internal loops and 18 junction420regions. In the SRA system, helices H1–H7 com-421prise domain I. Helices H10–H14 comprise domain422II. Helices H8, H9 and H15–H21 comprise domain III,423while helices H22–H25 comprise domain IV. While424domains I–III are highly conserved, representing the425core region of SRA, domain IV is evolutionarily

6 Review: Structure of RNAs and RNA MachinesQ2

Please cite this article as: Novikova, I. et al., Rise of the RNA Machines: Exploring the Structure of Long Non-Coding RNAs, J. Mol.Biol. (2013), http://dx.doi.org/10.1016/j.jmb.2013.02.030

ivn
ivn
Page 7: Rise of the RNA Machines: Exploring the Structure …1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung

426 divergent and highly variable in sequence across427 mammals. Therefore, our initial expectation was that428 this region would have little structure because there429 was no obvious sequence conservation across430 species. However, we found that this domain does431 comprise a number of helical components, even

432though the helical density is slightly lower compared433to the rest of the sequence. This is consistent with434the recently solved crystallographic structure of the43580S yeast ribosome, whose expansion segments436are highly variable in sequence yet highly structured437in the crystallographic study.105

Fig. 3. First experimentally derived secondary structure of an intact higher eukaryote lncRNA, to our knowledge. Thehuman SRA has a length of 870 nt, organized into 4 sub-domains and 25 helices. Four biochemical techniques were usedto obtain the secondary structure (SHAPE, in-line, DMS and RNase V1). The vast majority of helices were validated withcovariance analysis based on multiple sequence alignment across vertebrates. Yellow, domain I; blue, domain II; purple,domain III; green, domain IV. Inset: secondary structure annotated with SHAPE probing results (SHAPE). Orange/red,high reactivity nucleotides; yellow, medium reactivity; gray/black, low/no mobility.

7Review: Structure of RNAs and RNA MachinesQ2

Please cite this article as: Novikova, I. et al., Rise of the RNA Machines: Exploring the Structure of Long Non-Coding RNAs, J. Mol.Biol. (2013), http://dx.doi.org/10.1016/j.jmb.2013.02.030

Page 8: Rise of the RNA Machines: Exploring the Structure …1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung

438 We also found that the SRA RNA secondary439 structure contains many highly conserved, purine-440 rich elements, primarily located in single-stranded441 regions of RNA. The same preference toward single-442 stranded regions is also found in riboswitches and443 ribosomes, attributed to the base-stacking propen-444 sity of purines. For example, in the eukaryotic445 ribosome, expansion segment 6 (ES6) contains446 purine-rich single-stranded regions.105 While, at447 first glance, this secondary structure might suggest448 a significant amount of disorder, the crystallographic449 structure reveals a high degree of organization, with450 many of the single-stranded nucleotides participat-451 ing in non-Watson-Crick base pairs (Fig. 4). Using a452 multiple sequence alignment across 45 species for453 the SRA gene, we were able to calculate the degree454 of covariation of base pairs between species.455 Helices H2, H3, H4, H6, H7, H8, H9, H12, H13,456 H14, H15, H19, H20 and H21 possess at least one457 covariant base pair.458 Generally, the longer the RNA sequence, the more459 alternative folding choices are present. Probing460 experiments on SRA were sufficient in most cases

461to select the correct regional fold, but for some RNA462sections, more information was required. In these463regions, we opted to use an additional experimental464technique, Shotgun Secondary Structure determi-465nation, where sub-fragments of the full RNA are466probed to determine modularly folded sub-domains467of SRA.20 This technique enabled us to determine468much of the secondary structures of domain II and469domain III. We note that several interesting studies470of viral secondary structures have been performed471using SHAPE probing.106–109 To obtain the second-472ary structure fold, we incorporated SHAPE reactivity473data with structure predictions based on thermody-474namic parameters. Covariance analysis and frag-475ment-based probing strategies such as the shotgun476methodology can be used to help validate these477structures.478In addition to studies of single intact lncRNAs,479Chang et al. are pioneering genome-wide studies of480RNA secondary structure. Performing RNase diges-481tion as a function of temperature, they have482produced a technique capable of measuring RNA483folding energies on a genome-wide scale.63 The

C

U

AC

G A

U

A

U

U

U U

AA A

UG

U

GU

C

AA

A

G

U

U

U

G

A

A

A

A

UG

C

AAAAU

UUAU

A

G

A

U

U

AA

AG

C G

CG

CAG U

GCU

UGU A G

C A

GA

U

G

U

G

C

A

UA

U

A

U

C

G

C

GU

A

U C

U

A

A

G

G

GCUU

UUU

U

UU

U

CGU

U G G

U

GA

A

G A

(a)

(b)

Fig. 4. Expansion segment ES6of the yeast ribosome small subunit.(a) Secondary structure of ES6displays a large internal loop(22 nt) and a large stem–loop (13-nt loop). (b) Despite the lack ofcanonical base pairs, both theinternal loop and the stem–loopare highly structured in the recentX-ray structure of the yeast ribo-some solved by Ben-Shem et al.105

Many of the purines in the internalloop form noncanonical base pairs.

8 Review: Structure of RNAs and RNA MachinesQ2

Please cite this article as: Novikova, I. et al., Rise of the RNA Machines: Exploring the Structure of Long Non-Coding RNAs, J. Mol.Biol. (2013), http://dx.doi.org/10.1016/j.jmb.2013.02.030

Page 9: Rise of the RNA Machines: Exploring the Structure …1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung

484 experiments reveal key features near the start codon485 in mRNAs. The study also suggests that lncRNAs, on486 average, appear to be structured, with a level of487 structural content in between the structural content of488 the ribosome and the structural content of a typical489 mRNA. Lucks et al. have developed a method to490 perform SHAPE probing with the Illumina platform,491 which may also be scalable to genome-wide492 studies.110 New SHAPE reagents for in vivo RNA493 probing, with better solubility and increased half-lives,494 have recently been introduced by Spitale et al.111

495 Interesting structural elements involved in non-496 canonical 3′ end processing mechanisms have497 been determined for other lncRNA.26,27,112 Matura-498 tion of the NEAT1_v2 (MEN beta) and MALAT1499 transcripts relies on the formation of a cloverleaf500 (four-way junction) secondary structure. It is a highly501 conserved element, which mimics pre-tRNA sub-502 strate and recruits the RNase P complex. Subse-503 quently, RNase P cleaves upfront of this element,504 generating the mature ends of NEAT1_v2 and505 MALAT1 transcripts. The cleaved 3′ end fragments506 undergo an additional processing by RNase Z to507 yield shorter tRNA-like transcripts, named508 mascRNA (generated from MALAT1) and menRNA509 (generated from NEAT1_v2).510 The human accelerated region RNA, HAR1,511 associated with neocortex development, also con-512 tains a cloverleaf element, as evidenced by in vitro513 structural probing studies.113 Interestingly, signifi-514 cant sequence and structure divergence is observed515 between human and chimpanzee (18 mutations),516 attributed to brain evolution.114,115 In chimpanzee,517 structure probing of this region revealed an extended518 hairpin structure.113 It has been suggested that the519 chimpanzee RNA may form a cloverleaf element in520 the presence of an additional protein.521 Another element observed in lncRNAs is the522 double stem–loop, often associated with chromatin523 remodeling. This is a particularly interesting example524 because many lncRNAs have been shown to play525 important roles in chromatin remodeling, a focus of526 many transcriptome-wide lncRNA studies.15,116,117

527 In embryonic stem cells, Zhao et al. and Surface et al.528 discovered N9000 lncRNAs that interact with the529 polycomb repressive complex (PRC2).118,119 Bio-530 chemical analysis of PRC2-interacting lncRNAs531 showed that binding occurs through EZH2, a532 component of PRC2. In another study, a CLIP-seq533 investigation of RNAs associated with the SFRS1534 splicing factor uncovered N6000 spliced non-coding535 RNAs. Overall, chromatin remodeling lncRNAs536 appear to act in trans via association with chromatin537 modifying enzymes.116 It has been suggested, for538 the XIST system, that some of these interactionsmay539 occur via double stem–loop RNA elements.120,121

540 Double stem–loop RNA motifs have also been541 implicated in a PRC2-binding region of the HOTAIR542 lncRNA.12 HOTAIR (2.2 kb) regulates HoxD genes

543by recruiting the PRC2 and lysine-specific demethy-544lase (LSD), which each modify histones at targeted545loci. Deletions of portions of the HOTAIR RNA show546that PRC2 binds to a 300-nt region in the 5′ end of547HOTAIR and LSD binds to a 646-nt region at the 3′548end. The intervening sequence may spatially orga-549nize the two interaction sites. This region may also550contain motifs necessary for targeting. The exact551structures of the LSD1 binding motifs have not been552determined. Another system, the growth-arrest553specific non-coding transcript (gas5), contains a554hairpin element. This element is responsible for555regulation of the glucocorticoid receptor (GR) via a556decoy mechanism, mimicking the DNA hormone-557responsive element.21

558Possibilities for lncRNA three-dimensional559structure

560In the case of lncRNAs, many interesting ques-561tions are yet to be answered. For example, should562we expect to encounter structural motifs already met563in known RNA systems? What are the structural564differences at the RNA level across various organ-565isms? In light of the rapid turnover of lncRNAs, do we566expect the evolution of unique lineage-specific567structural elements?568We currently lack information on the tertiary569structure of lncRNAs. In addition, it is not known if570lncRNAs exist in ribonucleoprotein complexes571(RNPs) or predominantly as isolated RNAs. With572regard to tertiary structure, the ribosome is an573interesting system for comparison since the ribosome574is the only RNA system N1 kb whose crystallographic575structure has been solved. In terms of the composition576of a typical lncRNA complex, the total number of577unique lncRNAs and unique proteins is a useful578constraint. As mentioned above, the number of579human lncRNAs has been estimated to be580~15,000,5 while the number of protein-coding genes581is ~21,000.122 While an lncRNA-based ribosome-like582RNP complex could exist, it is unlikely that most583lncRNAs are ribosome-like RNPs, given the limited584number of unique protein-coding genes relative to the585number of unique lncRNA transcripts. Ruling out586ribosome-like RNA complexes, the following possibil-587ities remain for the composition of lncRNAcomplexes:588(1) RNP complexes with many repeats of a few589proteins, (2) RNP complexes with only a few proteins590or (3) isolated RNAs that transiently bind proteins as591needed for function. For scenario (1), ~10 protein592copies per 1 kb of lncRNA would be required to593produce complexeswith a similar protein-to-RNA ratio594as the ribosome (e.g., 20 protein copies for HOTAIR).595In scenarios (2) and (3), lncRNA complexes would596be similar in composition to RNase P, telomerase597RNA or the group I/II introns. An “RNase-P-like”598lncRNA complex would contain a highly structured599and compact RNA core with a single protein binding

9Review: Structure of RNAs and RNA MachinesQ2

Please cite this article as: Novikova, I. et al., Rise of the RNA Machines: Exploring the Structure of Long Non-Coding RNAs, J. Mol.Biol. (2013), http://dx.doi.org/10.1016/j.jmb.2013.02.030

ivn
Page 10: Rise of the RNA Machines: Exploring the Structure …1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung

600 site (Fig. 5a). Alternatively, the RNA could be601 structurally decentralized without a compact core,602 containing several distinct protein binding sites603 (Fig. 5c). The lncRNA may also act as a flexible604 structural scaffold, as suggested for the telomerase605 RNA (Fig. 5b). In loose terms, such a flexible RNA606 could be analogous to the cellulosome, where many607 cellulase subunits are connected by long disordered608 linker proteins (Fig. 2d). The cellulase subunits do609 not act coherently but allow the cellulosome to610 efficiently process cellulose. Finally, the lncRNA611 could exist predominantly as an isolated, highly612 structured RNA, similar to the self-splicing introns.

613Here, the RNA may transiently bind proteins that614each perform a specific function.

615Supramolecular lncRNA complexes

616The large size of certain lncRNA systems has617produced new kinds of RNA complexes on a scale618not previously studied in the context of structural619biology. In X chromosome inactivation, the human620XIST is responsible for gene silencing on the inactive621X chromosome (Xi). At 17 kb, XIST is one of the622largest lncRNAs. A large number of XIST copies are623transcribed to physically coat the X chromosome.

(a) (b)

(c)

Fig. 5. Homology models representing potential paradigms for lncRNA tertiary structure. (a) Compact core. The lncRNA(pink) is highly structured and has a protein (yellow) binding site that may bind different proteins. (b) Loosely organizedprotein binding domain with relatively unstructured RNA. (c) Decentralized. The lncRNA possesses several protein(yellow) binding sites but no compact core.

10 Review: Structure of RNAs and RNA MachinesQ2

Please cite this article as: Novikova, I. et al., Rise of the RNA Machines: Exploring the Structure of Long Non-Coding RNAs, J. Mol.Biol. (2013), http://dx.doi.org/10.1016/j.jmb.2013.02.030

Page 11: Rise of the RNA Machines: Exploring the Structure …1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung

624 FISH experiments suggest that XIST RNAs not only625 interact with chromatin but also crosstalk to produce626 a very large structural architecture on the scale of the627 entire X chromosome.123–125 The kcnq1ot1 lncRNA628 (108 kb) is also thought to coat chromosomes.16,18

629 Other lncRNAs participate in subcellular struc-630 ture formation, including NEAT1 (paraspeckle631 formation)27,37,38,126 and MALAT1/NEAT2 (nuclear632 speckle formation).28,36,33,127,128 NEAT1 non-coding633 RNA has two isoforms: a short NEAT1_V1 (3.7 kb)634 and a long NEAT1_V2 (22.7 kb). While the shorter635 transcript aids in the paraspeckle assembly, a longer636 transcript NEAT1_V2 acts as an essential scaffold,637 creating forming a structural network with sausage-638 like morphology.129,130 To date, we identify up to 35639 paraspeckle-associated proteins, which range from640 the splicing factors, the 3′ end processing enzymes641 and the disease-related proteins.129

642 From structure to mechanism: lncRNA dynamics

643 The dynamics of lncRNAs must be studied to fully644 understand their mechanisms. Timescales, in partic-645 ular, have been shown to play a critical role in646 understanding the mechanism of other molecular647 machines. In the ribosome system, rapid kinetics648 studies placedmany useful constraints on the order of649 events in tRNA selection and translocation prior to the650 solution of X-ray structures of the ribosome.131,132

651 With structural data in hand, single-molecule studies652 have identified more sub-steps of these processes,653 improved our understanding of transitions between654 states and produced a new framework for655 mechanism.73

656 As lncRNAs have only recently emerged as a657 class of RNAs, very little is known about the relevant658 timescales of lncRNA function. One recently discov-659 ered lncRNA, DBE-T, is an illustrative example with660 respect to processes that can occur in lncRNA661 systems.40 Here, the DBE-T lncRNA is an important662 part of the epigenetic switch associated with663 facioscapulohumeral muscular dystrophy. DBE-T is664 cis-acting RNA and tethers epigenetic factors to the665 D4Z4 binding element (DBE). As a result, histone666 methylation occurs. Here, timescales related to667 transcription, folding, protein binding to the RNA,668 protein binding to the chromatin and histone meth-669 ylation should be studied to determine the rate-670 limiting step.

671 Summary

672 LncRNAs have emerged as a new class of RNAs,673 playing important roles in development, stem cells,674 cancer, brain disease and epigenetic mechanism.675 LncRNA mechanism may be based on sequence,676 secondary structure, tertiary structure or a new677 combination of these mechanisms. In light of the678 high degree of diversity among lncRNAs, it is

679possible that the full mechanistic arsenal of previ-680ously studied RNAs may be employed in various681lncRNA systems. Because we are in the early stages682of lncRNA research, it is not clear if majority of these683systems provide scaffolding for gene regulation684complexes or engage in more active roles, such as685catalysis, molecular switching or information proces-686sing. With the identification of many thousands of687lncRNAs in recent years, it is clear that new688structural studies will play a key role in demystifying689these strange, new RNA machines.

690Acknowledgements

691The work was performed under the auspices of the692US Department of Energy via Los Alamos National693Laboratory, Laboratory Directed Research and694Development.

695Received 20 December 2012;696Received in revised form 21 February 2013;697Accepted 25 February 2013698

699Keywords:700long non-coding RNA;701lncRNA;702lincRNA;703lncRNA structure;704epigenetics705

706Abbreviations used:707lncRNA, long non-coding RNA; SRA, steroid receptor708RNA activator; SHAPE, selective 2′-hydroxyl acylation709analyzed by primer extension; LSD, lysine-specific de-710methylase; RNPribonucleoprotein complex.

711References

7121. Taft, R. J., Pheasant, M. & Mattick, J. S. (2007). The713relationship between non-protein-coding DNA and714eukaryotic complexity. BioEssays, 29, 288–299.7152. Djebali, S., Davis, C. A., Merkel, A., Dobin, A.,716Lassmann, T., Mortazavi, A. et al. (2012). Landscape717of transcription in human cells.Nature, 489, 101–108.7183. Mehler, M. F. & Mattick, J. S. (2007). Noncoding719RNAs and RNA editing in brain development,720functional diversification, and neurological disease.721Physiol. Rev. 87, 799–823.7224. Kapranov, P., Cheng, J., Dike, S., Nix, D. A.,723Duttagupta, R., Willingham, A. T. et al. (2007). RNA724maps reveal new RNA classes and a possible725function for pervasive transcription. Science, 316,7261484–1488.7275. Derrien, T., Johnson, R., Bussotti, G., Tanzer, A.,728Djebali, S., Tilgner, H. et al. (2012). The GENCODE729v7 catalog of human long noncoding RNAs: analysis730of their gene structure, evolution, and expression.731Genome Res. 22, 1775–1789.

11Review: Structure of RNAs and RNA MachinesQ2

Please cite this article as: Novikova, I. et al., Rise of the RNA Machines: Exploring the Structure of Long Non-Coding RNAs, J. Mol.Biol. (2013), http://dx.doi.org/10.1016/j.jmb.2013.02.030

ivn
ivn
Page 12: Rise of the RNA Machines: Exploring the Structure …1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung

732 6. Rinn, J. L. & Chang, H. Y. (2012). Genome regulation733 by long noncoding RNAs. Annu. Rev. Biochem. 81,734 145–166.735 7. Ziats, M. N. & Rennert, O. M. (2012). Aberrant736 expression of long noncoding RNAs in autistic brain.737 J. Mol. Neurosci. 49, 589–593.738 8. Brunner, A. L., Beck, A. H., Edris, B., Sweeney, R. T.,739 Zhu, S. X., Li, R. et al. (2012). Transcriptional profiling740 of lncRNAs and novel transcribed regions across a741 diverse panel of archived human cancers. Genome742 Biol. 13, R75.743 9. Sati, S., Ghosh, S., Jain, V., Scaria, V. & Sengupta,744 S. (2012). Genome-wide analysis reveals distinct745 patterns of epigenetic features in long non-coding746 RNA loci. Nucleic Acids Res. 40, 10018–10031.747 10. Zhang, X., Sun, S., Pu, J. K., Tsang, A. C., Lee, D.,748 Man, V. O. et al. (2012). Long non-coding RNA749 expression profiles predict clinical phenotypes in750 glioma. Neurobiol. Dis. 48, 1–8.751 11. Banfai, B., Jia, H., Khatun, J., Wood, E., Risk, B.,752 Gundling, W. E., Jr et al. (2012). Long noncoding753 RNAs are rarely translated in two human cell lines.754 Genome Res. 22, 1646–1657.755 12. Gupta, R. A., Shah, N., Wang, K. C., Kim, J.,756 Horlings, H. M., Wong, D. J. et al. (2010). Long757 non-coding RNA HOTAIR reprograms chromatin758 state to promote cancer metastasis. Nature, 464,759 1071–1076.760 13. Gong, C. & Maquat, L. E. (2011). lncRNAs transacti-761 vate STAU1-mediated mRNA decay by duplexing762 with 3′UTRs via Alu elements.Nature, 470, 284–288.763 14. Ponting, C. P., Oliver, P. L. & Reik, W. (2009).764 Evolution and functions of long noncoding RNAs.765 Cell, 136, 629–641.766 15. Guttman,M., Amit, I., Garber,M., French, C., Lin,M. F.,767 Feldser, D. et al. (2009). Chromatin signature reveals768 over a thousand highly conserved large non-coding769 RNAs in mammals. Nature, 458, 223–227.770 16. Korostowski, L., Sedlak, N. & Engel, N. (2012). The771 Kcnq1ot1 long non-coding RNA affects chromatin772 conformation and expression of Kcnq1, but does not773 regulate its imprinting in the developing heart. PLoS774 Genet. 8, e1002956.775 17. Sleutels, F., Zwart, R. & Barlow, D. P. (2002). The776 non-coding Air RNA is required for silencing autoso-777 mal imprinted genes. Nature, 415, 810–813.778 18. Pandey, R. R., Mondal, T., Mohammad, F., Enroth,779 S., Redrup, L., Komorowski, J. et al. (2008).780 Kcnq1ot1 antisense noncoding RNA mediates line-781 age-specific transcriptional silencing through chro-782 matin-level regulation. Mol. Cell, 32, 232–246.783 19. Vallot, C., Huret, C., Lesecque, Y., Resch, A.,784 Oudrhiri, N., Bennaceur-Griscelli, A. et al. (2013).785 XACT, a long noncoding transcript coating the active786 X chromosome in human pluripotent cells. Nat.787 Genet. 45, 239–241.788 20. Novikova, I. V., Hennelly, S. P. & Sanbonmatsu, K. Y.789 (2012). Structural architecture of the human long790 non-coding RNA, steroid receptor RNA activator.791 Nucleic Acids Res. 40, 5034–5051.792 21. Kino, T., Hurt, D. E., Ichijo, T., Nader, N. & Chrousos,793 G. P. (2010). Noncoding RNA gas5 is a growth794 arrest- and starvation-associated repressor of the795 glucocorticoid receptor. Sci. Signaling, 3, ra8.

79622. Lin, D., Pestova, T. V., Hellen, C. U. & Tiedge, H.797(2008). Translational control by a small RNA:798dendritic BC1 RNA targets the eukaryotic initiation799factor 4A helicase mechanism. Mol. Cell. Biol. 28,8003008–3019.80123. Pauli, A., Rinn, J. L. & Schier, A. F. (2011). Non-802coding RNAs as regulators of embryogenesis. Nat.803Rev., Genet. 12, 136–149.80424. Huarte, M. & Rinn, J. L. (2010). Large non-coding805RNAs: missing links in cancer? Hum. Mol. Genet. 19,806R152–R161.80725. Tsai, M. C., Spitale, R. C. & Chang, H. Y. (2011).808Long intergenic noncoding RNAs: new links in809cancer progression. Cancer Res. 71, 3–7.81026. Wilusz, J. E., Freier, S. M. & Spector, D. L. (2008). 3′811end processing of a long nuclear-retained noncoding812RNA yields a tRNA-like cytoplasmic RNA. Cell, 135,813919–932.81427. Sunwoo, H., Dinger, M. E., Wilusz, J. E., Amaral,815P. P., Mattick, J. S. & Spector, D. L. (2009). MEN816epsilon/beta nuclear-retained non-coding RNAs are817up-regulated upon muscle differentiation and are818essential components of paraspeckles. Genome819Res. 19, 347–359.82028. Tano, K. & Akimitsu, N. (2012). Long non-coding821RNAs in cancer progression. Front. Genet. 3, 219.82229. Tani, H., Nakamura, Y., Ijiri, K. & Akimitsu, N. (2010).823Stability of MALAT-1, a nuclear long non-coding RNA824in mammalian cells, varies in various cancer cells.825Drug Discoveries Ther. 4, 235–239.82630. Dinger, M. E., Amaral, P. P., Mercer, T. R., Pang,827K. C., Bruce, S. J., Gardiner, B. B. et al. (2008).828Long noncoding RNAs in mouse embryonic stem829cell pluripotency and differentiation. Genome Res.83018, 1433–1445.83131. Sone, M., Hayashi, T., Tarui, H., Agata, K., Takeichi,832M. & Nakagawa, S. (2007). The mRNA-like noncod-833ing RNA Gomafu constitutes a novel nuclear domain834in a subset of neurons. J. Cell Sci. 120, 2498–2506.83532. Mercer, T. R., Dinger, M. E., Sunkin, S. M., Mehler,836M. F. & Mattick, J. S. (2008). Specific expression of837long noncoding RNAs in the mouse brain. Proc. Natl.838Acad. Sci. USA, 105, 716–721.83933. Bernard, D., Prasanth, K. V., Tripathi, V., Colasse, S.,840Nakamura, T., Xuan, Z. et al. (2010). A long nuclear-841retained non-coding RNA regulates synaptogenesis842by modulating gene expression. EMBO J. 29,8433082–3093.84434. Dharap, A., Nakka, V. P. & Vemuganti, R. (2012).845Effect of focal ischemia on long noncoding RNAs.846Stroke, 43, 2800–2802.84735. Clemson, C. M., Hutchinson, J. N., Sara, S. A.,848Ensminger, A. W., Fox, A. H., Chess, A. &849Lawrence, J. B. (2009). An architectural role for a850nuclear noncoding RNA: NEAT1 RNA is essential851for the structure of paraspeckles. Mol. Cell, 33,852717–726.85336. Tano, K., Mizuno, R., Okada, T., Rakwal, R., Shibato,854J., Masuo, Y. et al. (2010). MALAT-1 enhances cell855motility of lung adenocarcinoma cells by influencing856the expression of motility-related genes. FEBS Lett.857584, 4575–4580.85837. Mao, Y. S., Sunwoo, H., Zhang, B. & Spector, D. L.859(2011). Direct visualization of the co-transcriptional

12 Review: Structure of RNAs and RNA MachinesQ2

Please cite this article as: Novikova, I. et al., Rise of the RNA Machines: Exploring the Structure of Long Non-Coding RNAs, J. Mol.Biol. (2013), http://dx.doi.org/10.1016/j.jmb.2013.02.030

Karissa Sanbonmatsu
New reference 0 (after 23):Klattenhoff CA, Scheuermann JC, Surface LE, Bradley RK, Fields PA, Steinhauser ML, Ding H, Butty VL, Torrey L, Haas S, Abo R, Tabebordbar M, Lee RT, Burge CB, Boyer LA (2013). Braveheart, a long noncoding RNA required for cardiovascular lineage commitment. Cell 152, 570-583.
Page 13: Rise of the RNA Machines: Exploring the Structure …1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung

860 assembly of a nuclear body by noncoding RNAs.Nat.861 Cell Biol. 13, 95–101.862 38. Ip, J. Y. & Nakagawa, S. (2011). Long non-coding863 RNAs in nuclear bodies. Dev., Growth Differ. [Epub864 ahead of print].865 39. Ietswaart, R., Wu, Z. & Dean, C. (2012). Flowering866 time control: another window to the connection867 between antisense RNA and chromatin. Trends868 Genet. 28, 445–453.869 40. Cabianca, D. S., Casa, V., Bodega, B., Xynos, A.,870 Ginelli, E., Tanaka, Y. & Gabellini, D. (2012). A long871 ncRNA links copy number variation to a polycomb/-872 trithorax epigenetic switch in FSHD muscular dys-873 trophy. Cell, 149, 819–831.874 41. Swiezewski, S., Liu, F., Magusin, A. & Dean, C.875 (2009). Cold-induced silencing by long antisense876 transcripts of an Arabidopsis Polycomb target.877 Nature, 462, 799–802.878 42. Song, J., Angel, A., Howard, M. & Dean, C. (2012).879 Vernalization—a cold-induced epigenetic switch. J.880 Cell Sci. 125, 3723–3731.881 43. Mizutani, R., Wakamatsu, A., Tanaka, N., Yoshida,882 H., Tochigi, N., Suzuki, Y. et al. (2012). Identification883 and characterization of novel genotoxic stress-884 inducible nuclear long noncoding RNAs in mamma-885 lian cells. PLoS One, 7, e34949.886 44. Frank, J., Jr & Gonzalez, R. L. (2010). Structure and887 dynamics of a processive Brownian motor: the trans-888 lating ribosome. Annu. Rev. Biochem. 79, 381–412.889 45. Sanbonmatsu, K. Y. (2012). Computational studies890 of molecular machines: the ribosome. Curr. Opin.891 Struct. Biol. 22, 168–174.892 46. Gavrilova, L.P., Kostiashkina,O. E., Koteliansky, V. E.,893 Rutkevitch, N. M. & Spirin, A. S. (1976). Factor-free894 (“non-enzymic”) and factor-dependent systems of895 translation of polyuridylic acid by Escherichia coli896 ribosomes. J. Mol. Biol. 101, 537–552.897 47. Kitao, A., Yonekura, K., Maki-Yonekura, S., Samatey,898 F. A., Imada, K., Namba, K. & Go, N. (2006). Switch899 interactions control energy frustration and multiple900 flagellar filament structures. Proc. Natl. Acad. Sci.901 USA, 103, 4894–4899.902 48. Chan, R. T., Robart, A. R., Rajashankar, K. R., Pyle,903 A. M. & Toor, N. (2012). Crystal structure of a group II904 intron in the pre-catalytic state. Nat. Struct. Mol. Biol.905 19, 555–557.906 49. Pyle, A. M. (2010). The tertiary structure of group II907 introns: implications for biological function and908 evolution.Crit. Rev. Biochem. Mol. Biol. 45, 215–232.909 50. Toor, N., Keating, K. S. & Pyle, A. M. (2009).910 Structural insights into RNA splicing. Curr. Opin.911 Struct. Biol. 19, 260–266.912 51. Reiter, N. J., Osterman, A., Torres-Larios, A.,913 Swinger, K. K., Pan, T. & Mondragon, A. (2010).914 Structure of a bacterial ribonuclease P holoenzyme915 in complex with tRNA. Nature, 468, 784–789.916 52. Mitchell, M., Gillis, A., Futahashi, M., Fujiwara, H. &917 Skordalakes, E. (2010). Structural basis for telome-918 rase catalytic subunit TERT binding to RNA template919 and telomeric DNA. Nat. Struct. Mol. Biol. 17,920 513–518.921 53. Zappulla, D. C. & Cech, T. R. (2004). Yeast922 telomerase RNA: a flexible scaffold for protein sub-923 units.Proc. Natl. Acad. Sci. USA, 101, 10024–10029.

92454. Batey, R. T., Gilbert, S. D. & Montange, R. K. (2004).925Structure of a natural guanine-responsive riboswitch926complexed with the metabolite hypoxanthine. Na-927ture, 432, 411–415.92855. Montange, R. K. & Batey, R. T. (2008). Riboswitches:929emerging themes in RNA structure and function.930Annu. Rev. Biophys. 37, 117–133.93156. Winkler, W., Nahvi, A. & Breaker, R. R. (2002).932Thiamine derivatives bind messenger RNAs directly933to regulate bacterial gene expression. Nature, 419,934952–956.93557. Tucker, B. J. & Breaker, R. R. (2005). Riboswitches936as versatile gene control elements. Curr. Opin.937Struct. Biol. 15, 342–348.93858. Hennelly, S. P. & Sanbonmatsu, K. Y. (2011).939Tertiary contacts control switching of the SAM-I940riboswitch. Nucleic Acids Res. 39, 2416–2431.94159. Stoddard, C. D., Montange, R. K., Hennelly, S. P.,942Rambo, R. P., Sanbonmatsu, K. Y. & Batey, R. T.943(2010). Free state conformational sampling of the944SAM-I riboswitch aptamer domain. Structure, 18,945787–797.94660. Hayes, R. L., Noel, J. K., Mohanty, U., Whitford, P. C.,947Hennelly, S. P., Onuchic, J. N. & Sanbonmatsu,948K. Y. (2012). Magnesium fluctuations modulate RNA949dynamics in the SAM-I riboswitch. J. Am. Chem. Soc.950134, 12043–12053.95161. Penchovsky, R. & Breaker, R. R. (2005). Computa-952tional design and experimental validation of oligonu-953cleotide-sensing allosteric ribozymes. Nat.954Biotechnol. 23, 1424–1433.95562. Guttman, M. & Rinn, J. L. (2012). Modular regulatory956principles of large non-coding RNAs. Nature, 482,957339–346.95863. Wan, Y., Qu, K., Ouyang, Z., Kertesz, M., Li, J.,959Tibshirani, R. et al. (2012). Genome-wide measure-960ment of RNA folding energies. Mol. Cell, 48,961169–181.96264. Appelt, K., Dijk, J., Reinhardt, R., Sanhuesa, S.,963White, S. W., Wilson, K. S. & Yonath, A. (1981). The964crystallization of ribosomal proteins from the 50S965subunit of the Escherichia coli and Bacillus stear-966othermophilus ribosome. J. Biol. Chem. 256,96711787–11790.96865. Schuwirth, B. S., Borovinskaya, M. A., Hau, C. W.,969Zhang, W., Vila-Sanjurjo, A., Holton, J. M. & Cate,970J. H. (2005). Structures of the bacterial ribosome at9713.5 Ǻ resolution. Science, 310, 827–834.97266. Pyle, A. M., Fedorova, O. & Waldsich, C. (2007).973Folding of group II introns: a model system for large,974multidomain RNAs? Trends Biochem. Sci. 32,975138–145.97667. Adams, P. L., Stahley, M. R., Kosek, A. B., Wang, J.977& Strobel, S. A. (2004). Crystal structure of a self-978splicing group I intron with both exons. Nature, 430,97945–50.98068. Toor, N., Keating, K. S., Taylor, S. D. & Pyle, A. M.981(2008). Crystal structure of a self-spliced group II982intron. Science, 320, 77–82.98369. Marcia, M. & Pyle, A. M. (2012). Visualizing group II984intron catalysis through the stages of splicing. Cell,985151, 497–507.98670. Gao, Y. G., Selmer,M., Dunham,C.M.,Weixlbaumer,987A., Kelley, A. C. & Ramakrishnan, V. (2009). The

13Review: Structure of RNAs and RNA MachinesQ2

Please cite this article as: Novikova, I. et al., Rise of the RNA Machines: Exploring the Structure of Long Non-Coding RNAs, J. Mol.Biol. (2013), http://dx.doi.org/10.1016/j.jmb.2013.02.030

Page 14: Rise of the RNA Machines: Exploring the Structure …1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung

988 structure of the ribosome with elongation factor G989 trapped in the posttranslocational state. Science,990 326, 694–699.991 71. Rodnina, M. V., Beringer, M. & Wintermeyer, W.992 (2006). Mechanism of peptide bond formation on the993 ribosome. Q. Rev. Biophys. 39, 203–225.994 72. Whitford, P. C., Ahmed, A., Yu, Y., Hennelly, S. P.,995 Tama, F., Spahn, C. M. et al. (2011). Excited states of996 ribosome translocation revealed through integrative997 molecular modeling. Proc. Natl Acad. Sci. USA, 108,998 18943–18948.999 73. Blanchard, S. C. (2009). Single-molecule observa-1000 tions of ribosome function. Curr. Opin. Struct. Biol.1001 19, 103–109.1002 74. Reiter, N. J., Chan, C. W. & Mondragon, A. (2011).1003 Emerging structural themes in large RNA molecules.1004 Curr. Opin. Struct. Biol. 21, 319–326.1005 75. Cole, D. I., Legassie, J. D., Bonifacio, L. N., Sekaran,1006 V. G., Ding, F., Dokholyan, N. V. & Jarstfer, M. B.1007 (2012). Newmodels of tetrahymena telomerase RNA1008 from experimentally derived constraints and model-1009 ing. J. Am. Chem. Soc. 134, 20070–20080.1010 76. Azzalin, C. M., Reichenbach, P., Khoriauli, L.,1011 Giulotto, E. & Lingner, J. (2007). Telomeric repeat1012 containing RNA and RNA surveillance factors at1013 mammalian chromosome ends. Science, 318,1014 798–801.1015 77. Schoeftner, S. & Blasco, M. A. (2008). Developmen-1016 tally regulated transcription of mammalian telomeres1017 by DNA-dependent RNA polymerase II. Nat. Cell1018 Biol. 10, 228–236.1019 78. Lau, N., Lim, L., Weinstein, E. & Bartel, D. (2001). An1020 abundant class of tiny RNAs with probable regulatory1021 roles in Caenorhabditis elegans. Science, 294,1022 858–862.1023 79. Reinhart, B., Weinstein, E., Rhoades, M., Bartel, B. &1024 Bartel, D. (2002). MicroRNAs in plants. Genes Dev.1025 16, 1616–1626.1026 80. Lewis, B., Shih, I., Jones-Rhoades, M., Bartel, D. &1027 Burge, C. (2003). Prediction of mammalian micro-1028 RNA targets. Cell, 115, 787–798.1029 81. Tang, G., Reinhart, B., Bartel, D. & Zamore, P.1030 (2003). A biochemical framework for RNA silencing1031 in plants. Genes Dev. 17, 49–63.1032 82. Zamore, P. (2002). Ancient pathways programmed1033 by small RNAs. Science, 296, 1265–1269.1034 83. Zamore, P. D. & Haley, B. (2005). Ribo-gnome: the1035 big world of small RNAs. Science, 309, 1519–1524.1036 84. Ghildiyal, M. & Zamore, P. D. (2009). Small silencing1037 RNAs: an expanding universe. Nat. Rev., Genet. 10,1038 94–108.1039 85. Tomari, Y. & Zamore, P. D. (2005). Perspective:1040 machines for RNAi. Genes Dev. 19, 517–529.1041 86. Martianov, I., Ramadass, A., Serra Barros, A., Chow,1042 N. & Akoulitchev, A. (2007). Repression of the1043 human dihydrofolate reductase gene by a non-1044 coding interfering transcript. Nature, 445, 666–670.1045 87. Bertani, S., Sauer, S., Bolotin, E. & Sauer, F. (2011).1046 The noncoding RNA mistral activates Hoxa6 and1047 Hoxa7 expression and stem cell differentiation by1048 recruiting MLL1 to chromatin. Mol. Cell, 43,1049 1040–1046.1050 88. Beltran, M., Puig, I., Pena, C., Garcia, J. M., Alvarez,1051 A. B., Pena, R. et al. (2008). A natural antisense

1052transcript regulates Zeb2/Sip1 gene expression1053during Snail1-induced epithelial–mesenchymal tran-1054sition. Genes Dev. 22, 756–769.105589. Cesana, M., Cacchiarelli, D., Legnini, I., Santini, T.,1056Sthandier, O., Chinappi, M. et al. (2011). A long1057noncoding RNA controls muscle differentiation by1058functioning as a competing endogenous RNA. Cell,1059147, 358–369.106090. Winkler,W. C., Cohen-Chalamish, S. &Breaker, R. R.1061(2002). An mRNA structure that controls gene1062expression by binding FMN. Proc. Natl Acad. Sci.1063USA, 99, 15908–15913.106491. Winkler, W. C., Nahvi, A., Sudarsan, N., Barrick, J. E.1065& Breaker, R. R. (2003). An mRNA structure that1066controls gene expression by binding S-adenosyl-1067methionine. Nat. Struct. Biol. 10, 701–707.106892. Montange, R. K. & Batey, R. T. (2006). Structure of1069the S-adenosylmethionine riboswitch regulatory1070mRNA element. Nature, 441, 1172–1175.107193. Gilbert, S. D., Rambo, R. P., Van Tyne, D. & Batey,1072R. T. (2008). Structure of the SAM-II riboswitch1073bound to S-adenosylmethionine. Nat. Struct. Mol.1074Biol. 15, 177–182.107594. Deigan, K. E., Li, T.W., Mathews, D. H. &Weeks, K. M.1076(2009). Accurate SHAPE-directed RNA structure de-1077termination. Proc. Natl Acad. Sci. USA, 106, 97–102.107895. Hamada, M., Sato, K. & Asai, K. (2010). Prediction of1079RNA secondary structure by maximizing pseudo-1080expected accuracy. BMC Bioinformatics, 11, 586.108196. Hamada,M., Sato, K. & Asai, K. (2011). Improving the1082accuracy of predicting secondary structure for aligned1083RNA sequences. Nucleic Acids Res. 39, 393–402.108497. Rivas, E., Lang, R. & Eddy, S. R. (2012). A range of1085complex probabilistic models for RNA secondary1086structure prediction that includes the nearest-neigh-1087bor model and more. RNA, 18, 193–212.108898. Bernhart, S. H., Hofacker, I. L., Will, S., Gruber, A. R.1089& Stadler, P. F. (2008). RNAalifold: improved1090consensus structure prediction for RNA alignments.1091BMC Bioinformatics, 9, 474.109299. Ghosh, S. K., Patton, J. R. & Spanjaard, R. A. (2012).1093A small RNA derived from RNA coactivator SRA1094blocks steroid receptor signaling via inhibition of1095Pus1p-mediated pseudouridylation of SRA: evi-1096dence of a novel RNA binding domain in the N-1097terminus of steroid receptors. Biochemistry, 51,10988163–8172.1099100. Xu, B., Yang,W. H.,Gerin, I., Hu, C. D., Hammer, G.D.1100& Koenig, R. J. (2009). Dax-1 and steroid receptor1101RNA activator (SRA) function as transcriptional1102coactivators for steroidogenic factor 1 in steroido-1103genesis. Mol. Cell. Biol. 29, 1719–1734.1104101. Colley, S. M. & Leedman, P. J. (2009). SRA and its1105binding partners: an expanding role for RNA-1106binding coregulators in nuclear receptor-mediated1107gene regulation. Crit. Rev. Biochem. Mol. Biol. 44,110825–33.1109102. Yao, H., Brick, K., Evrard, Y., Xiao, T., Camerini-1110Otero, R. D. & Felsenfeld, G. (2010). Mediation of1111CTCF transcriptional insulation by DEAD-box RNA-1112binding protein p68 and steroid receptor RNA1113activator SRA. Genes Dev. 24, 2543–2555.1114103. Woese, C. R., Magrum, L. J., Gupta, R., Siegel, R. B.,1115Stahl, D. A., Kop, J. et al. (1980). Secondary structure

14 Review: Structure of RNAs and RNA MachinesQ2

Please cite this article as: Novikova, I. et al., Rise of the RNA Machines: Exploring the Structure of Long Non-Coding RNAs, J. Mol.Biol. (2013), http://dx.doi.org/10.1016/j.jmb.2013.02.030

Page 15: Rise of the RNA Machines: Exploring the Structure …1 2 Rise of the RNA Machines: Exploring the Structure of 3 Long Non-Coding RNAs 4 5 Irina V. Q1 Novikova, Scott P. Hennelly, Chang-Shung

1116 model for bacterial 16S ribosomal RNA: phylogenetic,1117 enzymatic and chemical evidence.Nucleic Acids Res.1118 8, 2275–2293.1119 104. Noller, H. F. & Woese, C. R. (1981). Secondary1120 structure of 16S ribosomal RNA. Science, 212,1121 403–411.1122 105. Ben-Shem, A., Garreau de Loubresse, N., Melnikov,1123 S., Jenner, L., Yusupova, G. & Yusupov, M. (2011).1124 The structure of the eukaryotic ribosome at 3.0 Ǻ1125 resolution. Science, 334, 1524–1529.1126 106. Legiewicz, M., Badorrek, C. S., Turner, K. B., Fabris,1127 D., Hamm, T. E., Rekosh, D. et al. (2008).1128 Resistance to RevM10 inhibition reflects a confor-1129 mational switch in the HIV-1 Rev response element.1130 Proc. Natl Acad. Sci. USA, 105, 14365–14370.1131 107. Legiewicz, M., Zolotukhin, A. S., Pilkington, G. R.,1132 Purzycka, K. J., Mitchell, M., Uranishi, H. et al.1133 (2010). The RNA transport element of the murine1134 musD retrotransposon requires long-range intramo-1135 lecular interactions for function. J. Biol. Chem. 285,1136 42097–42104.1137 108. Purzycka, K. J., Legiewicz,M.,Matsuda,E.,Eizentstat,1138 L. D., Lusvarghi, S., Saha, A. et al. (2013). Exploring1139 Ty1 retrotransposon RNA structure within virus-like1140 particles. Nucleic Acids Res. 41, 463–473.1141 109. Watts, J. M., Dang, K. K., Gorelick, R. J., Leonard,1142 C. W., Bess, J. W., Jr, Swanstrom, R. et al. (2009).1143 Architecture and secondary structure of an entire1144 HIV-1 RNA genome. Nature, 460, 711–716.1145 110. Lucks, J. B., Mortimer, S. A., Trapnell, C., Luo, S.,1146 Aviran, S., Schroth, G. P. et al. (2011). Multiplexed1147 RNA structure characterization with selective 2′-1148 hydroxyl acylation analyzed by primer extension1149 sequencing (SHAPE-Seq). Proc. Natl Acad. Sci.1150 USA, 108, 11063–11068.1151 111. Spitale, R. C., Crisalli, P., Flynn, R. A., Torre, E. A.,1152 Kool, E. T. & Chang, H. Y. (2013). RNA SHAPE1153 analysis in living cells. Nat. Chem. Biol. 9, 18–20.1154 112. Wilusz, J. E. & Spector, D. L. (2010). An unexpected1155 ending: noncanonical 3′ end processing mecha-1156 nisms. RNA, 16, 259–266.1157 113. Beniaminov, A., Westhof, E. & Krol, A. (2008).1158 Distinctive structures between chimpanzee and1159 human in a brain noncoding RNA. RNA, 14,1160 1270–1275.1161 114. Pollard, K. S., Salama, S. R., Lambert, N., Lambot,1162 M. A., Coppens, S., Pedersen, J. S. et al. (2006). An1163 RNA gene expressed during cortical development1164 evolved rapidly in humans. Nature, 443, 167–172.1165 115. Pollard, K. S., Salama, S. R., King, B., Kern, A. D.,1166 Dreszer, T., Katzman, S. et al. (2006). Forces1167 shaping the fastest evolving regions in the human1168 genome. PLoS Genet. 2, e168.1169 116. Guttman, M., Donaghey, J., Carey, B. W., Garber,1170 M., Grenier, J. K., Munson, G. et al. (2011). lincRNAs1171 act in the circuitry controlling pluripotency and1172 differentiation. Nature, 477, 295–300.1173 117. Sanford, J. R., Wang, X., Mort, M., Vanduyn, N.,1174 Cooper, D. N., Mooney, S. D. et al. (2009). Splicing1175 factor SFRS1 recognizes a functionally diverse1176 landscape of RNA transcripts. Genome Res. 19,1177 381–394.

1178118. Zhao, J., Ohsumi, T. K., Kung, J. T., Ogawa, Y.,1179Grau, D. J., Sarma, K. et al. (2010). Genome-wide1180identification of polycomb-associated RNAs by RIP-1181seq. Mol. Cell, 40, 939–953.1182119. Surface, L. E., Thornton, S. R. & Boyer, L. A. (2010).1183Polycomb group proteins set the stage for early1184lineage commitment. Cell Stem Cell, 7, 288–298.1185120. Zhao, J., Sun, B. K., Erwin, J. A., Song, J. J. & Lee,1186J. T. (2008). Polycomb proteins targeted by a short1187repeat RNA to the mouse X chromosome. Science,1188322, 750–756.1189121. Wutz, A., Rasmussen, T. P. & Jaenisch, R. (2002).1190Chromosomal silencing and localization are mediat-1191ed by different domains of Xist RNA. Nat. Genet. 30,1192167–174.1193122. Bernstein, B. E., Birney, E., Dunham, I., Green, E. D.,1194Gunter, C. & Snyder, M. (2012). An integrated1195encyclopedia of DNA elements in the human1196genome. Nature, 489, 57–74.1197123. Jeon, Y., Sarma, K. & Lee, J. T. (2012). New and1198Xisting regulatory mechanisms of X chromosome1199inactivation. Curr. Opin. Genet. Dev. 22, 62–71.1200124. Lee, J. T. (2011). Gracefully ageing at 50, X-1201chromosome inactivation becomes a paradigm for1202RNA and chromatin control. Nat. Rev., Mol. Cell Biol.120312, 815–826.1204125. Kim, D. H., Jeon, Y., Anguera, M. C. & Lee, J. T.1205(2011). X-chromosome epigenetic reprogramming in1206pluripotent stem cells via noncoding genes. Semin.1207Cell Dev. Biol. 22, 336–342.1208126. Sasaki, Y. T., Ideue, T., Sano, M., Mituyama, T. &1209Hirose, T. (2009). MENepsilon/beta noncoding1210RNAs are essential for structural integrity of nuclear1211paraspeckles. Proc. Natl Acad. Sci. USA, 106,12122525–2530.1213127. Gutschner, T., Hammerle, M., Eissmann, M., Hsu, J.,1214Kim, Y., Hung, G. et al. (2012). The non-coding RNA1215MALAT1 is a critical regulator of the metastasis1216phenotype of lung cancer cells. Cancer Res. 73,12171180–1189.1218128. Miyagawa, R., Tano, K., Mizuno, R., Nakamura, Y.,1219Ijiri, K., Rakwal, R. et al. (2012). Identification of cis-1220and trans-acting factors involved in the localization of1221MALAT-1 noncoding RNA to nuclear speckles. RNA,122218, 738–751.1223129. Naganuma, T., Nakagawa, S., Tanigawa, A., Sasaki,1224Y. F., Goshima, N. & Hirose, T. (2012). Alternative 3′-1225end processing of long noncoding RNA initiates1226construction of nuclear paraspeckles. EMBO J. 31,12274020–4034.1228130. Souquere, S., Beauclair, G., Harper, F., Fox, A. &1229Pierron, G. (2010). Highly ordered spatial organiza-1230tion of the structural long noncoding NEAT1 RNAs1231within paraspeckle nuclear bodies. Mol. Biol. Cell,123221, 4020–4027.1233131. Rodnina, M. V. & Wintermeyer, W. (2001). Fidelity of1234aminoacyl-tRNA selection on the ribosome: kinetic1235and structural mechanisms. Annu. Rev. Biochem.123670, 415–435.1237132. Rodnina, M. V. &Wintermeyer, W. (2001). Ribosome1238fidelity: tRNA discrimination, proofreading and in-1239duced fit. Trends Biochem. Sci. 26, 124–130.

1240

15Review: Structure of RNAs and RNA MachinesQ2

Please cite this article as: Novikova, I. et al., Rise of the RNA Machines: Exploring the Structure of Long Non-Coding RNAs, J. Mol.Biol. (2013), http://dx.doi.org/10.1016/j.jmb.2013.02.030

kys2
new reference 1: Wilusz, J. E., Jnbaptiste, C. K., Lu, L. Y., Kuhn, C.-D., Joshua-Tor, L., and Sharp, P. A. A triple helix stabilizes the 3' ends of long noncoding RNAs that lack poly(A) tails (2012). Genes Dev., 26, 2392-2407. new reference 2: Brown, J. A., Valenstein, M. L., Yario, T. A., Tycowski, K. T., and Steitz, J. A. (2012). Formation of triple-helical structures by the 3'-end sequences of MALAT1 and MEN beta noncoding RNAs. Proc. Natl. Acad. Sci. USA, 109, 19202-19207.