Many of the RNA molecules in bacteria and virtually all RNA molecules in eukaryotes are processed to some degree after synthesis. Processing can include addition or deletion of nucleotide sequences as well as chemical modification of RNA nucleotides. All of these events can be used to control the posttranscriptional fate of the RNA in the cell. As a result, many mature RNAs are not exact copies of the DNA genes from which they were transcribed. Some of the most interesting molecular events in RNA metabolism occur during posttranscriptional processing. Intriguingly, several of the enzymes that catalyze these reactions have active sites composed of RNA rather than protein. The discovery of these catalytic RNAs, or ribozymes, has brought a revolution in thinking about RNA function and about the origin of life, as we will discuss in Section 26.4.
A newly synthesized RNA molecule is called a primary or precursor transcript. Perhaps the most extensive processing of primary transcripts occurs in eukaryotic precursor mRNAs (pre-mRNAs) and in the tRNAs of both bacteria and eukaryotes. However, many ncRNAs are also processed.
The precursor transcript for a eukaryotic mRNA typically contains sequences encompassing one gene, although the sequences encoding the polypeptide may not be contiguous. Noncoding tracts that break up the coding region of the transcript are called introns, and the coding segments are called exons (see the discussion of introns and exons in DNA in Chapter 24). In a process called RNA splicing, the introns are removed from the pre-mRNA, and the exons are spliced together to form a continuous sequence that specifies a functional polypeptide. Virtually all human genes contain introns, the average being eight introns per gene. Eukaryotic mRNAs are also modified at each end. A modified nucleotide structure called a cap is added at the end. The end is cleaved, and 80 to 250 A residues are added to create a poly(A) “tail.” The sometimes elaborate protein complexes that carry out capping, splicing, and polyadenylation mRNA-processing reactions do not operate independently. They are organized in association with each other and with the phosphorylated CTD of Pol II; each complex affects the function of the others, as outlined in Figure 26-12.
Proteins involved in mRNA transport to the cytoplasm are also associated with the mRNA in the nucleus, and the processing of the transcript is coupled to its transport. In effect, a eukaryotic mRNA, as it is synthesized, is ensconced in an elaborate and dynamic supramolecular messenger ribonucleoprotein (mRNP) complex comprising dozens of proteins. The composition of the mRNP changes as the transcript is processed, transported to the cytoplasm, and delivered to the ribosome for translation. Associated proteins can dramatically modulate the cellular destination, function, and fate of an mRNA.
In addition to splicing and and end modification, individual purine and pyrimidine nucleotides within primary transcripts can undergo chemical modification. Many eukaryotic mRNAs contain modified nucleotides that affect their interactions with RNA-binding proteins and regulate gene expression; however, RNA modification has been best characterized in primary tRNA transcript processing. Many bases and sugars in tRNAs are modified in both bacteria and eukaryotes, including with unusual bases not found in other nucleic acids (see Fig. 26-22). Many ncRNAs also undergo elaborate processing, often involving the removal of segments from one or both ends.
The ultimate fate of any RNA is its complete and regulated degradation. The rate of turnover of RNAs plays a critical role in determining their steady-state levels and the rate at which cells can shut down expression of a gene when its product is no longer needed. During the development of multicellular organisms, for example, certain proteins must be expressed at one stage only, and the mRNA encoding such a protein must be made and destroyed at the appropriate times.
Most eukaryotic mRNAs have a cap, a residue of 7-methylguanosine linked to the -terminal residue of the mRNA through an unusual ,-triphosphate linkage (Fig. 26-13). The cap helps protect mRNA from ribonucleases. It also binds to specific cap-binding complexes of proteins and participates in binding of the mRNA to the ribosome to initiate translation (Chapter 27).
The cap is formed by condensation of a molecule of GTP with the triphosphate at the end of the transcript. The guanine is subsequently methylated at N-7, and additional methyl groups are often added at the hydroxyls of the first and second nucleotides adjacent to the cap (Fig. 26-13a). The methyl groups are derived from S-adenosylmethionine. All these reactions (Fig. 26-12b) occur very early in transcription, after the first 20 to 30 nucleotides of the transcript have been added. All four of the enzymes in the cap-synthesizing complex, and through them the end of the transcript itself, are associated with the RNA polymerase II CTD (Fig. 26-13c) until the cap is synthesized. The capped end is then released from the cap-synthesizing complex and bound by the nuclear cap-binding complex, which facilitates both splicing and nuclear export of the RNA.
The cap does not provide permanent protection of the transcript. Eukaryotes also contain cellular decapping enzymes, which are important for RNA regulation. Cap removal allows RNAs to be degraded by exonucleases that hydrolyze the RNA in the direction. Some viruses have also evolved elaborate mechanisms for removing the cap from host mRNAs. The influenza virus needs no specialized enzymes for the synthesis of caps on its viral RNAs; instead, it borrows these structures from host-cell transcripts in a process termed “cap-snatching.” A capped host transcript is bound by the viral RNA polymerase and cleaved by an endonuclease. The influenza RNA polymerase can then use the resulting capped oligonucleotide to prime viral RNA synthesis.
In bacteria, the mRNA used for translation is generally a direct copy of the DNA gene sequence, continuing along the DNA template without interruption until the information needed to specify the polypeptide is complete. However, the notion that all genes are continuous was disproved in 1977 when Phillip Sharp and Richard Roberts independently discovered that many genes for polypeptides in eukaryotes are interrupted by noncoding sequences (introns).
The vast majority of genes in vertebrates contain introns; among the few exceptions are those that encode histones. The occurrence of introns in other eukaryotes varies. Many genes of the yeast Saccharomyces cerevisiae lack introns, but introns are more common in some other yeast species. Introns are also found in a few bacterial and archaeal genes. Introns in DNA are transcribed along with the rest of the gene by RNA polymerases. The introns in the primary RNA transcript are then spliced, and the exons are joined to form a mature, functional RNA. In eukaryotic mRNAs, most exons are less than 1,000 nucleotides long, with many in the 100 to 200 nucleotide size range, encoding stretches of 30 to 60 amino acids within a longer polypeptide. Introns vary in size from 50 to more than 700,000 nucleotides, with a median length of about 1,800. Genes of higher eukaryotes, including humans, typically have much more DNA devoted to introns than to exons. For example, the human dystrophin gene encodes a pre-mRNA more than 2 million nucleotides long. However, the final mRNA is 14,000 nucleotides long, indicating that more than 99% of the transcribed RNA is found within introns and removed by splicing. Deficiencies in dystrophin expression can lead to muscular dystrophies. The ~20,000 genes of the human genome include more than 200,000 introns.
There are four classes of introns (Table 26-3). The first two, the group I and group II introns, differ in the details of their splicing mechanisms but share one surprising characteristic: they are self-splicing — no proteins are needed to carry out catalysis. The introns found in the nuclear-encoded genes of eukaryotes comprise the third class. These pre-mRNA introns are removed by a large RNP called the spliceosome. Although the spliceosome requires dozens of proteins for its function, its active site includes RNA. The final class of introns requires protein enzymes for their removal. These introns are found in some tRNAs as well as certain mRNAs, such as that encoding the Xbox binding protein 1, Xbp1. Protein-mediated splicing of the XBP1 transcript regulates the cellular response to unfolded proteins that occurs under conditions of endoplasmic reticulum stress in human cells. The mechanisms of tRNA and XBP1 mRNA splicing are similar.
Mechanism | Components | Features | Cellular locations |
---|---|---|---|
Group I Intron |
Catalytic RNA |
Self-splicing using a guanine-derived cofactor |
Found in nuclear, mitochondrial, and chloroplast genes that encode mRNAs, rRNAs, or tRNAs. Can be found in bacteria. |
Group II Intron |
Catalytic RNA; maturase and reverse transcriptase proteins |
Self-splicing using a nucleophile within the intron to form a lariat |
Primarily found in mitochondrial and chloroplast genes of fungi, algae, and plants. Can be found in bacteria. |
Spliceosome |
Catalytic snRNAs; dozens of protein splicing factors |
Requires a large RNP for processing using a nucleophile within the intron to form a lariat |
Found in nuclear genes of eukaryotes. Capable of alternative splicing to create multiple products from a given transcript. |
Protein-catalyzed |
Protein enzymes |
Uses a splicing endonuclease and ligase |
Found in tRNAs and a few mRNAs. |
Group I introns are found in some nuclear, mitochondrial, and chloroplast genes that code for rRNAs, mRNAs, and tRNAs. Group II introns are generally found in the primary transcripts of mitochondrial or chloroplast mRNAs in fungi, algae, and plants. Group I and group II introns are also among the rare examples of introns in bacteria. The splicing mechanisms in both groups involve two transesterification reaction steps (Fig. 26-14), in which a ribose - or -hydroxyl group makes a nucleophilic attack on a phosphorus, and a new phosphodiester bond is formed at the expense of the old, maintaining the balance of energy. These reactions are very similar to the DNA breaking and rejoining reactions promoted by topoisomerases (see Fig. 24-18) and site-specific recombinases (see Fig. 25-37).
The group I splicing reaction requires a guanine nucleoside or nucleotide cofactor, but the cofactor is not used as a source of energy; instead, the -hydroxyl group of guanosine is used as a nucleophile in the first step of the splicing pathway. In group II splicing reactions, the nucleophile is the -hydroxyl group of an A residue within the intron (Fig. 26-14a). A branched lariat structure is formed as an intermediate (Fig. 26-14b). In both group I introns and group II introns, the hydroxyl of the exon that is displaced in the first step then acts as a nucleophile in a similar reaction at the end of the intron. The result is precise excision of the intron and ligation of the exons.
Self-splicing of introns was first revealed in 1982 in studies of the splicing mechanism of the group I rRNA intron from the ciliated protozoan Tetrahymena thermophila, conducted by Thomas Cech and colleagues. These workers transcribed isolated Tetrahymena DNA (including the intron) in vitro, using purified bacterial RNA polymerase. The resulting RNA spliced itself accurately without any protein enzymes from Tetrahymena. The discovery that RNAs could have catalytic functions was a milestone in our understanding of biological systems and a major step forward in the understanding of how life probably evolved. Catalytic RNAs like group I and group II introns share many features with protein-based enzymes, including folding into well-defined secondary and tertiary structures (Fig. 26-15). Catalytic RNAs and their significance in evolution are described in greater detail in Section 26.4.
In eukaryotes, most introns undergo splicing by the same lariat-forming mechanism as the group II introns. However, the intron splicing takes place within a spliceosome, a large complex made up of multiple specialized RNP complexes called small nuclear ribonucleoproteins (snRNPs, pronounced snurps) and dozens of non-snRNP proteins. Each snRNP contains one of a class of eukaryotic RNAs, 100 to 200 nucleotides long, known as small nuclear RNAs (snRNAs). Five snRNAs (U1, U2, U4, U5, U6) involved in splicing reactions are generally found in abundance in eukaryotic nuclei. The U3 snRNP is also found in the nucleus but is involved in ribosome assembly and is not part of the spliceosome.
The role of snRNPs in the splicing reaction was discovered by Joan Steitz in a remarkable example of “bedside-to-bench” science. Using antibodies isolated from patients with autoimmune diseases, members of the Steitz laboratory were able to purify the spliceosome’s snRNP components and identify the associated snRNAs. Based on complementarity between the end of the U1 snRNA and the splice site of nuclear pre-mRNA introns (Fig. 26-16), Steitz proposed that snRNPs participate in the splicing reaction. Subsequently, it was discovered that patients suffering from the autoimmune disease lupus can generate antibodies against protein components of their own spliceosomes.
In yeast, the various snRNPs include about 80 different proteins, most of which have close homologs in all other eukaryotes. In humans, these conserved protein components are augmented by more than 200 additional proteins, which mostly participate in regulating the splicing reaction. Spliceosomes are thus among the most complex supramolecular machines in any eukaryotic cell. The RNA components of a spliceosome are the catalysts of the various splicing steps. The overall complex can be considered a highly flexible ribonucleoprotein enzyme that can adapt to the great diversity in size and sequence of nuclear pre-mRNAs.
Spliceosomal introns generally have the dinucleotide sequence GU at the end and AG at the end, marking the sites where splicing occurs. The U1 snRNA contains a sequence complementary to the splice site (Fig. 26-16b), and the U1 snRNP binds to this region by forming an RNA duplex with the pre-mRNA. A U2 snRNP binds to the end, also by base-pairing, and identifies the A residue that becomes the nucleophile used during the first transesterification reaction (Fig. 26-14). Addition of a complex of the U4, U5, and U6 snRNPs, called the tri-snRNP, leads to formation of the spliceosome (Fig. 26-16c).
Key parts of the splicing active site found in the U6 snRNA are initially sequestered by base-pairing to parts of U4 snRNA to prevent aberrant cleavage of nontarget phosphodiester bonds. In a process called activation, the U6 and U4 snRNAs must be unwound and separated to expose the active site needed for the first step in splicing. Unwinding of U4 and U6 as well as many other steps in splicing require ATP hydrolysis by a set of eight different ATPases that are part of the splicing machinery.
Spliceosomes are single turnover enzymes, meaning that each spliceosome can remove only one intron from a single transcript. As a result, spliceosomes undergo a complex cycle of assembly, activation, catalysis, product release, and recycling of the snRNP components each time an intron is removed (Fig. 26-16c, steps through ).
The chemical events of splicing — splice site cleavage by formation of an intron lariat followed by exon ligation — are identical in mechanism to that of group II introns, despite the former requiring dozens of proteins for activity and the latter being a self-splicing ribozyme. The similarities in chemistry as well as conservation between essential RNA components of each enzyme suggests that group II introns and spliceosomes are evolutionarily related to one another. Comparison of x-ray crystal structures of group II introns and cryo-EM structures of spliceosomes provide strong support for this hypothesis. Despite being surrounded by a large protein scaffold, the catalytic center of the spliceosome is composed of RNA and arranged in a nearly identical manner to that of group II introns (Fig. 26-17). Thus, the spliceosome uses a ribozyme core to carry out pre-mRNA splicing. As we shall see, some group II introns also contain domains that are themselves translated as mRNAs and encode proteins that bear striking similarity to those in the spliceosome, strengthening this evolutionary connection.
About 1% of human introns are spliced by a less common type of spliceosome, called the minor spliceosome, in which the U1, U2, U4, and U6 snRNPs are replaced by the U11, U12, U4atac, and U6atac snRNPs. Whereas U1- and U2-containing spliceosomes remove introns with ()GU and AG() terminal sequences, the minor spliceosomes remove a rare class of introns that have ()AU and AC() terminal sequences to mark the splice sites. Introns removed by either the major or the minor spliceosome most often remain in the nucleus and are degraded.
Some components of the splicing apparatus are tethered to the CTD of RNA polymerase II, indicating that splicing, like other RNA processing reactions, is tightly coordinated with transcription. Most splicing in humans occurs cotranscriptionally, meaning that splicing occurs while Pol II is still transcribing the gene. For this to occur correctly, the rates of transcription, capping, splicing, and end formation must be carefully regulated. Splicing of a pre-mRNA in the nucleus can also have profound effects on the function of the mRNA in the cytoplasm. Lynne Maquat and Melissa Moore discovered that the human spliceosome leaves behind a set of proteins on each spliced mRNA near the junction between two exons. This exon junction complex is retained on the mRNA as it is exported to the cytoplasm, where it can regulate the extent to which an mRNA can be translated over its lifetime before degradation.
A fourth and final class of introns, found in certain tRNAs and a few mRNAs such as XBP1, is distinguished from other intron types in that the splicing reaction requires endonucleases and ligases made of protein and does not involve catalytic RNAs. The splicing endonuclease cleaves the phosphodiester bonds at both ends of the intron, and the two exons are joined by a mechanism similar to the DNA ligase reaction (see Fig. 25-15).
At their end, most eukaryotic mRNAs undergoing translation in the cell cytoplasm have a string of A residues, about 30 residues in yeast and 50 to 100 in animals, called the poly(A) tail. This tail serves as a binding site for one or more specific proteins. The poly(A) tail and its associated proteins have a variety of roles in coordinating transcription and translation, and may help protect mRNA from enzymatic destruction. Many bacterial mRNAs also acquire poly(A) tails, but these tails stimulate decay of mRNA rather than protecting it from degradation.
The poly(A) tail is added in a multistep process. The transcript is extended beyond the site where the poly(A) tail is to be added, then is cleaved at the poly(A) addition site by an endonuclease component of a large enzyme complex, again associated with the CTD of RNA polymerase II (Fig. 26-18). The mRNA site where cleavage occurs is marked by two sequence elements: the highly conserved sequence ()AAUAAA(), 10 to 30 nucleotides on the side (upstream) of the cleavage site, and a less well-defined sequence rich in G and U residues, 20 to 40 nucleotides downstream of the cleavage site. Cleavage generates the free -hydroxyl group that defines the end of the mRNA, to which A residues are immediately added by polyadenylate polymerase, which catalyzes the reaction
where to 250. This enzyme does not require a template but does require the cleaved mRNA as a primer. These longer poly(A) tails are added in the nucleus, and then shortened significantly after the mRNA is transported to the cytoplasm.
The overall processing of a typical eukaryotic mRNA is summarized in Figure 26-19. In some cases, the polypeptide-coding region of the mRNA is also modified by RNA “editing” (see Section 27.1 for details). This editing includes processes that add or delete bases in the coding regions of primary transcripts or that change the sequence (such as by enzymatic deamination of a C residue to create a U residue). A particularly dramatic example occurs in trypanosomes, which are parasitic protists: large regions of an mRNA are synthesized without any uridylate, and the U residues are inserted later by RNA editing.
One of the paradoxes of modern genomics is that the apparent complexity of organisms does not correlate with the number of protein-coding genes, or even the amount of genomic DNA. Some eukaryotic mRNA transcripts can be processed in more than one way to produce different mRNAs and thus different polypeptides. Much of the variability in processing is the result of alternative splicing, in which a particular exon may or may not be incorporated into the mature mRNA transcript. Alternative splicing occurs in a relatively small number of transcripts in yeast, but in more than 95% of human genes. Changes in alternative splicing can have profound impact on the development of an organism (Box 26-2). Alternative splicing of a single transcription factor in the staple grain quinoa differentiates palatable sweet varieties from those too bitter to ingest without processing. In Drosophila, sex is determined by alternative splicing of the sex lethal (Sxl) transcript based on the number of X chromosomes present in the cell.
Figure 26-20a illustrates how alternative splicing patterns can produce more than one protein from a common pre-mRNA. The pre-mRNA contains molecular signals for all the alternative processing pathways, and the pathway favored in a given cell or metabolic situation is determined by processing factors, RNA-binding proteins that promote one particular path. For example, splicing regulatory proteins or heterogeneous ribonuclear proteins (hnRNPs) may bind these signals and promote or inhibit spliceosome assembly at that site. There are many additional patterns of alternative splicing.
Part a shows a gray horizontal bar of D N A that is divided into regions. From left to right, these regions are numbered 1 through 5. Arrows indicate that the border between regions 1 and 2 is a 5 prime splice site, that the borders between regions 2 and 3 and between regions 3 and 4 are both 3 prime splice sites, and that the border between regions 4 and 5 is a poly (A) site. An arrow pointing downward is labeled transcription, 5 prime capping, splicing, cleavage, and polyadenylation. This arrows splits. The left-hand arrow has an arrow branching off to show the loss of green rectangle 2. This yields a mature m R N A that is horizontal bar with a narrow vertical red rectangle at the left side attached from left to right to green regions 1, 3, and 4. The right-hand arrow has an arrow branching off to show the loss of green rectangle 2 and adjacent green rectangle 3. This yields a mature m R N A that is horizontal bar with a narrow vertical red rectangle at the left side attached from left to right to green regions 1 and 4. Part b shows a gray horizontal bar of D N A that is divided into regions. From left to right, these regions are numbered 1 through 3. Two poly (A) sites are indicated by arrows. These arrows indicate that the border between regions 1 and 2 is A subscript 1 end subscript and the border between regions 2 and 3 is A subscript 2 end subscript. An arrow pointing downward is labeled on the left as cleavage, 5 prime capping, cleavage, and polyadenylation at A subscript 1 end subscript and on the right as cleavage, 5 prime capping, cleavage, and polyadenylation at A subscript 2 end subscript. The arrow branches to the left and right. The left-hand product is a mature m R N A that is a horizontal bar with a narrow vertical red rectangle on the left attached to green region 1 bonded to A A A (A) subscript italicized n end italics end subscript. The right-hand product is similar but contains green regions 1 and 2.
Complex transcripts can also have more than one site where poly(A) tails can form (Fig. 26-20b). If there are two or more sites for cleavage and polyadenylation, use of the one closest to the end will remove more of the primary transcript sequence. This mechanism, called poly(A) site choice, generates diversity in the variable domains of immunoglobulin heavy chains (see Fig. 25-42).
Both alternative splicing and poly(A) site choice come into play in the expression of many genes. For example, a single RNA transcript is processed using both mechanisms to produce two different hormones: the calcium-regulating hormone calcitonin in rat thyroid and calcitonin-gene-related peptide (CGRP) in rat brain (Fig. 26-21). Together, alternative splicing and poly(A) site choice greatly increase the variety of proteins generated from the genomes of higher eukaryotes.
A horizontal bar labeled D N A has a narrow red vertical bar on the left. From left to right, it has blue region 1, a gray region, narrow blue region 2, a gray region, narrow blue region 3, a gray region, purple region 4 labeled calcitonin, a gray region dark blue region 5 labeled C G R P, blue region 6, and a bright blue region at the right end. Arrows indicate that the boundary between regions 1 and the gray region to its right is a 5 prime splice site, the boundary between the same gray region and blue region 2 is a 3 prime splice site, the boundary between purple region 4 and the gray region to its right is a poly (A) site, and the boundary between region 6 and the bright blue region is a poly (A) site. Arrows point down on the left and right accompanied by text reading, transcription, 5 prime capping, splicing, cleavage, and polyadenylation. The left-hand arrow is labeled thyroid. The product is a mature m R N A with a narrow vertical red bar at the left, followed by green regions 1, 2, and 3, then purple region 4, then a narrow vertical orange bar, then A A A (A) subscript italicized n end italics end subscript. An arrow labeled translation points down to show a strand that is green on the left and purple on the right. An arrow labeled protease action points down. This yields a purple strand labeled calcitonin. The right-hand arrow is labeled brain. The product is a mature m R N A with a narrow vertical red bar at the left, followed by green regions 1, 2, and 3, then dark blue region 5, then green region 6, then a narrow vertical orange bar, then A A A (A) subscript italicized n end italics end subscript. An arrow labeled translation points down to show a strand that is green on the left, dark blue in the middle, and green on the right. An arrow labeled protease action points down. This yields a dark blue strand labeled CGRP.
Posttranscriptional processing is not limited to mRNA. Ribosomal RNAs of bacterial, archaeal, and eukaryotic cells are made from longer precursors called pre-ribosomal RNAs, or pre-rRNAs. Transfer RNAs are similarly derived from longer precursors. These RNAs may also contain a variety of modified nucleosides; some examples are shown in Figure 26-22.
4-thiouridine (S superscript 4 end superscript U): A six-membered ring has N substituted for C at position 1 at the bottom vertex bonded to a box labeled ribose below, C 2 at the lower left vertex double bonded to O, N substituted for C at position 3 at the upper left vertex and bonded to H, C 4 at the top vertex double bonded to S in a light red box, and a double bond between C 5 and C 6 at the right side. Inosine: A six-membered ring has N substituted for C at position 1 at the upper left vertex and bonded to H, N substituted for C at position 3 at the bottom vertex, a double bond at the lower left side between positions 2 and 3, C 6 at the top vertex double bonded to O, and a double bond between positions 4 and 5 on the right side shared with the left side of a five-membered ring to the right. The five-membered ring has N substituted for C at position 7 at the upper right vertex, a double bond between positions 7 and 8 at the upper right side, and N substituted for C at position 9 at the bottom vertex and further bonded to a box labeled ribose. 1-methylguanosine (m superscript 1 end superscript G): A six-membered ring has N substituted for C at position 1 at the upper left vertex and bonded to C H 3 in a light red box, N substituted for C at position 3 at the bottom vertex, C 2 at the lower left vertex bonded to N H 2, a double bond at the lower left side between positions 2 and 3, C 6 at the top vertex double bonded to O, and a double bond between positions 4 and 5 on the right side shared with the left side of a five-membered ring to the right. The five-membered ring has N substituted for C at position 7 at the upper right vertex, a double bond between positions 7 and 8 at the upper right side, and N substituted for C at position 9 at the bottom vertex and further bonded to a box labeled ribose. Italicized N end italics superscript 6 end superscript-isopentenyladenosine (i superscript 6 end superscript A): A six-membered ring has N substituted for C at position 1 at the upper left vertex, N substituted for C at position 3 at the bottom vertex, a double bond at the lower left side between positions 2 and 3, a double bond at the upper left side between positions 1 and 6, C 6 at the top vertex bonded to N H further bonded to a red highlighted substituent of C H 2 C H double bonded to C bonded to C H 3 to the upper and lower right, and a double bond between positions 4 and 5 on the right side shared with the left side of a five-membered ring to the right. The five-membered ring has N substituted for C at position 7 at the upper right vertex, a double bond between positions 7 and 8 at the upper right side, and N substituted for C at position 9 at the bottom vertex and further bonded to a box labeled ribose. Ribothymidine (T): A six-membered ring has N substituted for C at position 1 at the bottom vertex bonded to a box labeled ribose below, C 2 at the lower left vertex double bonded to O, N substituted for C at position 3 at the upper left vertex and bonded to H, C 4 at the top vertex double bonded to O, C 4 bonded to C H 3, and a double bond between C 5 and C 6 at the right side. Pseudouridine (psi): A six-membered ring has N in a red box substituted for C at position 1 at the bottom vertex and bonded to H, C 2 at the lower left vertex double bonded to O, N substituted for C at position at 3 the upper left vertex and bonded to H, C 4 at the top vertex double bonded to O, C 4 at the upper right vertex bonded to a red box labeled ribose, and a double bond between C 5 and C 6 at the right side. Dihydrouridine (D): A six-membered ring has N substituted for C at position 1 at the bottom vertex bonded to a box labeled ribose below, C 2 at the lower left vertex double bonded to O, N substituted for C at position 3 at the upper left vertex and bonded to H, C 4 at the top vertex double bonded to O, C 5 bonded to H in a light red box and H not in a box, and C 6 bonded to H in a light red box and H not in a box.
In bacteria, 16S, 23S, and 5S rRNAs (and some tRNAs, although most tRNAs are encoded elsewhere) arise from a single 30S RNA precursor of about 6,500 nucleotides. RNA at both ends of the 30S precursor and segments between the rRNAs are removed during processing (Fig. 26-23). The 16S and 23S rRNAs contain modified nucleosides. In E. coli, the 11 modifications in the 16S rRNA include a pseudouridine and 10 nucleosides methylated on the base or the -hydroxyl group or both. The 23S rRNA has 10 pseudouridines, 1 dihydrouridine, and 12 methylated nucleosides. In bacteria, each modification is generally catalyzed by a distinct enzyme. Methylation reactions use S-adenosylmethionine as cofactor. No cofactor is required for pseudouridine formation.
A bar labeled pre-r R N A transcript (30 S) has the following components from left to right: a short yellow region, a long green 16 S region, a short yellow region, a short green t R N A (4 S) region, a short yellow region, a long green 23 S region, a short yellow region, a green 5 S region, and a short yellow region. An arrow points downward labeled 1 and accompanied by text reading, methylation and pseudouridine or dihydrouridine formation. This yield a similar structure with short blue, red, and black vertical lines extending downward from the 16 S and 23 S regions as follows. 16 S region: blue vertical line just before the midpoint, then 10 red lines across the right side. 23 S region: red line at the boundary with the yellow region to the left, then blue line next to black line, then a mix of 11 red lines and 9 blue lines across the rest of the length of the region. Arrows labeled 1, 2, and 3 point downward as follows: arrow 1 points near the boundary between the left-hand yellow region and the 16 S region, arrow 1 points near the boundary between the 16 S region and the yellow region to its right, arrow 2 points near the boundary between the same yellow region and the t R N A (4 S) region, arrow 1 points near the yellow region just before its boundary with the 23 S region, arrow 1 points near the boundary of the 23 S region with the yellow region to its right, arrow 3 points near the boundary between the same yellow region and the 5 S region, and arrow 3 points near the boundary between the 5 S region and the yellow region to its right. An arrow pointing downward is labeled cleavage. This yields intermediates four intermediates as follows: A bar with narrow yellow bars on either side of a 17 S green region; t R N A with a green left half and yellow right half; a bar with narrow yellow bars on either side of a green 25 S region, and a bar with narrow yellow bars on either side of a green 5 S region. Arrows point down from each of these intermediates to show the mature R N A s formed. An arrow points downward from the piece containing the 17 S region and is labeled 3 and blue highlighted nucleases. This yields 16 S r R N A with no yellow bars on the ends. An arrow points down from the piece with t R N A to yield a green piece labeled t R N A. An arrow points downward from the piece containing the 25 S region and is labeled 3 and blue highlighted nucleases. This yields green 23 S r R N A with no yellow bars. An arrow points down from the piece containing the 5 S region to yield a green piece labeled 5 S r R N A with no yellow bars.
The genome of E. coli encodes seven pre-rRNA molecules. All of these genes have essentially identical rRNA-coding regions, but they differ in the segments between these regions. The segment between the 16S and 23S rRNA genes generally encodes one or two tRNAs, with different tRNAs produced from different pre-rRNA transcripts. Coding sequences for tRNAs are also found on the side of the 5S rRNA in some precursor transcripts.
The situation in eukaryotes is even more complicated (see Fig. 27-17). The entire process is initiated in the nucleolus, in large complexes that assemble on the rRNA precursor as it is synthesized by Pol I. There is a tight coupling between rRNA transcription, rRNA maturation, and ribosome assembly in the nucleolus. Each complex includes the ribonucleases that cleave the rRNA precursor, the enzymes that modify particular bases, large numbers of ncRNAs called small nucleolar RNAs, or snoRNAs, that guide nucleoside modification and some cleavage reactions, and ribosomal proteins. In yeast, the entire process involves the pre-rRNA, more than 170 nonribosomal proteins, snoRNAs for each nucleoside modification (about 70, because some snoRNAs guide two types of modification), and the 78 ribosomal proteins. Humans have an even greater number of modified nucleosides, about 200, and a greater number of associated snoRNAs. The composition of the complexes may change as the ribosomes are assembled, and many of the intermediate complexes rival the ribosome itself in complexity. The 5S rRNA of most eukaryotes is made as a completely separate transcript by a different polymerase (Pol III).
The most common nucleoside modifications in eukaryotic rRNAs are conversion of uridine to pseudouridine and adoMet-dependent nucleoside methylation (often at -hydroxyl groups). These reactions often rely on snoRNA-protein complexes, or snoRNPs, each consisting of a snoRNA and four or five proteins, including the enzyme that carries out the modification. There are two classes of snoRNPs, both defined by key conserved sequence elements referred to as lettered boxes. The box H/ACA snoRNPs function in pseudouridylylation, and box C/D snoRNPs in -O-methylations. The snoRNAs are 60 to 300 nucleotides long. Each snoRNA includes a 10 to 21 nucleotide sequence that is perfectly complementary to some site on an rRNA and serves to identify the modification site (Fig. 26-24). The conserved sequence elements in the remainder of the snoRNA fold into structures that are bound by the snoRNP proteins.
A light green horizontal strand begins with a small red piece that joins a green piece that bends upward. This piece is paired with a strand to its right to form a vertical structure. The two strands separate in the middle, then rejoin up to a loop at the top. The right-hand strand of this structure bends horizontally to connect to green box H, which contains A N A N N A, then continues horizontally before bending vertically to form a similar structure. The right-hand side of this second structure bends horizontally to the right at its bottom to join green A C A, which contains A C A and then joins to N N N that ends at O H (3 prime). To the right, 3 prime then three dots connect to a dark green strand labeled r R N A that runs into the opening in the middle of the left-hand vertical structure. It forms a loop with N next to psi in a pink box at the top, then curves down and exits at the lower right before entering the similar loop to the right to form a similar structure with N and psi in a pink box at the top before leaving to the lower right to end with three dots to 5 prime. The light green strands are labeled s n o R N A.
Most cells have 40 to 50 distinct tRNAs, and eukaryotic cells have multiple copies of many of the tRNA genes. Transfer RNAs are derived from longer RNA precursors by enzymatic removal of nucleotides from the and ends (Fig. 26-25). In eukaryotes, introns are present in a few tRNA transcripts and must be excised. Where two or more different tRNAs are contained in a single primary transcript, they are separated by enzymatic cleavage. The endonuclease RNase P, found in all organisms, removes RNA at the end of tRNAs. This enzyme contains both protein and RNA. The RNA component is essential for activity, and in bacterial cells it can carry out its processing function with precision even without the protein component. RNase P is another example of a catalytic RNA, as described in more detail below. The end of tRNAs is processed by one or more nucleases, including the exonuclease RNase D.
A primary transcript is shown with a 5 prime end to the left at the start of a horizontal yellow segment reading p G U U A U C A G U U A A U U G A. The strand turns green and bends vertically at a location labeled R N ase P cut. This vertical segment is paired with a segment to the right. From top to bottom, it is C U C U C G G. These are all paired with complementary bases to the right except for U, which has a dot between it and G to the right. At the bottom of this paired region, the strand bends to the left with U A before reaching a horizontal region paired with a region below that contains bases G C C from right to left. The strand continues to form a loop with A A G U U G G U U U A A before reaching the bases that pair to the upper half of the loop, G G and C. The strand bends downward at G and then forms a vertical paired region. From top to bottom, the left-hand bases are C A A G A. The strand bends to the left with C, then back to the right with U just above a dot between the left- and right-hand strands between the region just below this U and G to the right. A vertical region of paired bases follows, with G U A from top to bottom. The loop bends out with A, then turns yellow and forms a loop of U U U A C C A C before reaching the right side of the vertical paired section. This contains U A C before reaching the G next to the dot. The yellow piece curves outward with A A, then turns green and curves back inward with A before reaching the next paired piece. From bottom to top, this is U C U U G. The green strand bends right with A G A, then up to U, then diagonally up to C then to G that begins a paired horizontal sequence that continues with G G C G before forming a loop of U U C G A C U. Next, the top part of the paired sequence of this right-hand loop begins from right to left as C G C C C, then the strand curves upward to the paired sequence of C C G, then G with a dot between it and U on the left, then G A G. It then reaches an unpaired A, then turns yellow at a point labeled R N ase D cut before ending with U U ending with O H 3 prime. An arrow pointing to the right reads, base modification, 5 prime cleavage, 3 prime cleavage, C C A addition. This yields an intermediate in which the horizontal yellow piece at the upper left has been removed and the two yellow Us at the 3 prime end have been removed. Some of the bases have been changed. The structure now begins with a green p C labeled 5 prime at the top of the top vertical segment paired with a segment to the right. From top to bottom including the first base, it is p C U C U C G G. These are all paired with complementary bases to the right except for U, which has a dot between it and G to the right. At the bottom of this paired region, the strand bends to the left with U red highlighted m G before reaching a horizontal region paired with a region below that contains bases G C C from right to left. The strand continues to form a loop with A A G red highlighted D red highlighted D red highlighted m G G red lighted D red highlighted D red highlighted D A A before reaching the bases that pair to the upper half of the loop, G G and C. The strand bends downward at red highlighted m G and then forms a vertical paired region. From top to bottom, the left-hand bases are C A A G A. The strand bends to the left with C, then back to the right with U just above a dot between the left- and right-hand strands between the region just below this U and G to the right. A vertical region of paired bases follows, with G red highlighted psi A from top to bottom. The loop bends out with A, then turns yellow and forms a loop of U U U A C C A C before reaching the right side of the vertical paired section. This contains U A C before reaching the G next to the dot. The yellow piece curves outward with A A, then turns green and curves back inward with A before reaching the next paired piece. From bottom to top, this is red highlighted psi C U U G. The green strand bends right with A G A, then up to red highlighted D, then diagonally up to red highlighted m C then to G that begins a paired horizontal sequence that continues with G G C G before forming a loop of red highlighted T red highlighted psi C G red highlighted m A C U. Next, the top part of the paired sequence of this right-hand loop begins from right to left as C G C C C, then the strand curves upward to the paired sequence of C C G, then G with a dot between it and U on the left, then G A G. It then reaches an unpaired A, then turns red to end with C C A ending with O H 3 prime. An arrow labeled splicing points to the right. This yields mature t R N A superscript T y r end superscript. This is a similar molecule in which the yellow portion has been removed. This yields a structure that begins with a green p C labeled 5 prime at the top of the top vertical segment paired with a segment to the right. From top to bottom including the first base, it is p C U C U C G G. These are all paired with complementary bases to the right except for U, which has a dot between it and G to the right. At the bottom of this paired region, the strand bends to the left with U red highlighted m G before reaching a horizontal region paired with a region below that contains bases G C C from right to left. The strand continues to form a loop with A A G red highlighted D red highlighted D red highlighted m G G red lighted D red highlighted D red highlighted D A A before reaching the bases that pair to the upper half of the loop, G G and C. The strand bends downward at red highlighted m G and then forms a vertical paired region. From top to bottom, the left-hand bases are C A A G A. The strand bends to the left with C, then forms a loop with U G red highlighted psi A A A. A vertical region of paired bases follows, highlighted psi C U U G. The piece bends out with A G A, then up with red highlighted D, then back in with red highlighted m C, then reaches a horizontal paired piece of G G G C G before forming a loop of red highlighted T red highlighted psi C G red highlighted m A C U. Next, the top part of the paired sequence of this right-hand loop begins from right to left as C G C C C, then the strand curves upward to the paired sequence of C C G, then G with a dot between it and U on the left, then G A G. It then reaches an unpaired A, then turns red to end with C C A ending with O H 3 prime.
Transfer RNA precursors may undergo further posttranscriptional processing. The -terminal trinucleotide CCA() to which an amino acid is attached during protein synthesis (Chapter 27) is absent from some bacterial tRNA precursors and all eukaryotic tRNA precursors and is added during processing (Fig. 26-25). This addition is carried out by tRNA nucleotidyltransferase, an unusual enzyme that binds the three ribonucleoside triphosphate precursors in separate active sites and catalyzes formation of the phosphodiester bonds to produce the CCA() sequence. The creation of this defined sequence of nucleotides is therefore not dependent on a DNA or RNA template — the template is the binding site of the enzyme.
The final type of tRNA processing is the modification of some bases by methylation, deamination, or reduction (Fig. 26-22). These modifications can change how the tRNA interacts with cellular proteins and even how the tRNA is used by the ribosome during translation. In the case of pseudouridine, the base (uracil) is removed and reattached to the sugar through C-5. Some of these modified bases occur at characteristic positions in all tRNAs (Fig. 26-25).
The number of known classes of special-function noncoding RNAs (ncRNAs) is expanding rapidly, as is the variety of functions known to be associated with them. Many of these ncRNAs also undergo processing.
The snRNAs and snoRNAs not only facilitate RNA processing reactions but also are themselves synthesized as larger precursors and then processed. Many snoRNAs are encoded within the introns of other genes. As the introns are spliced from the pre-mRNA, proteins bind to the snoRNA sequences and ribonucleases remove the extra RNA at the and ends to form the snoRNP. The snRNAs destined for spliceosomes are synthesized as pre-snRNAs, and ribonucleases remove the extra RNA at each end. Particular nucleosides in snRNAs are also subject to 11 types of modification, with -O-methylation and conversion of uridine to pseudouridine predominating.
MicroRNAs (miRNAs) are a special class of noncoding RNAs involved in gene regulation. The miRNAs are about 22 nucleotides long, complementary in sequence to particular regions of mRNAs. Found in plants and in animals, from worms to mammals, they promote mRNA degradation and suppress translation to fine-tune gene expression. About 1,500 human genes encode miRNAs, and one or more of these miRNAs affect the expression of most protein-coding genes.
The miRNAs are synthesized from much larger precursors, in several steps (Fig. 26-26). The primary transcripts for miRNAs (pri-miRNAs) vary greatly in size; some are encoded in the introns of other genes and are coexpressed with these host genes. Processing of pri-miRNA is mediated by two endoribonucleases in the RNase III family, Drosha and Dicer. First, in the nucleus, the pri-miRNA is reduced to a 70 to 80 nucleotide precursor miRNA (pre-miRNA) by a protein complex including Drosha and another protein, DGCR8. The pre-miRNA is then exported to the cytoplasm in a complex with two proteins, exportin-5 and the Ran GTPase (see Fig. 27-42). In the cytoplasm, Ran hydrolyzes the GTP, then exportin-5 and the pre-miRNA are released. The pre-miRNA is then acted on by Dicer to produce the nearly mature miRNA paired with a short RNA complement. The complement is removed by an RNA helicase, and the mature miRNA is incorporated into protein complexes, such as the RNA-induced silencing complex (RISC), which then bind a target mRNA. If the complementarity between miRNA and its target is nearly perfect, the target mRNA is cleaved. If the complementarity is only partial, the complex blocks translation of the target mRNA. The roles of miRNAs and RISC in gene regulation are detailed in Chapter 28.
Two blue horizontal lines represent double stranded D N A in the nucleus of a cell. There is a gray region in the center. Beneath this gray region, there is a green arrow pointing to the right along the length of the gray region. An arrow points down labeled blue highlighted R N A polymerase Roman numeral 2 complex and transcription. This yields p r i-m i R N A (s), which are shown as four vertical structures with loops. A strand begins with a small horizontal red piece, then turns green before bending upward to form the first vertical structure. It has a vertical region with five paired sets of bases, then forms an oval loop, then has a vertical region with three paired sets of bases, then forms another loop. The left-hand side of this loop is yellow and begins a yellow strand on the left side. Another vertical region above has three paired sets of bases and is yellow on the left and green on the right. This is followed by another loop with a yellow left side and a green right side. Above, there is another vertical region of four sets of paired bases that is yellow on the left and green on the right. Above this, the left-hand strand turns green and there is an entirely green top loop. At the lower right of this vertical structure, it has a horizontal green piece that reaches two wavy vertical lines that show a break, then there are three more identical vertical structures before the right end of the right-hand structure becomes horizontal and ends at A A A (A) subscript italicized n end italics end subscript. An arrow points down to show a complex that contains a single one of these vertical structures in the center with a small red piece at its horizontal left end and a string of A A A A A at its horizontal right end. A pink bent oval labeled Drosha covers the three central loops and a dark purple oval labeled D G C R 8 beneath covers the bottom center of the structure with the horizontal pieces of the strand extending out to the left and right. An arrow labeled cleavage points down and is joined by an arrow from the left that forms the top part of a cycle of arrows. This left-hand arrow begins in the cytoplasm at an orange oval bonded to G D P below with a rectangular cutout in its top into which a light orange structure fits that has a rectangular base and then a curved portion that extends out to the upper left and right. This is at the seven o’clock position of the circle of arrows. An arrow points up from this structure into the nucleus accompanied by a curved arrow showing the loss of G D P and a curved line showing the addition of G T P. This joins the arrow labeled cleavage to yield a structure that includes a vertical piece of pre- m i R N A. The bottom part of the vertical structure is gone, so it has only two central loops and one top loop. The left side ends at the end of the yellow piece at the bottom of the bottom loop, which formerly had a green loop beneath, and the right side has a short vertical green strand that extends beneath the bottom loop. This structure has its bottom loop in a dark orange oval bonded to G T P below and its middle loop in the light orange green piece above. The complex is labeled exportin-5 and R a n. An arrow points down across the nuclear envelop and is labeled export. An arrow branches off to show the loss of P subscript i end subscript. A second arrow branches off to show exportin-5 and R a n bound to G D P at the seven o’clock position of the circle of arrows, ready to repeat the cycle. The third arrow points down to the pre- m I R N A within a light blue oval wider and flatter at the top than at the bottom labeled dicer. Only the top of the top green circle is visible above. Arrows point to the place where the top of the yellow strand meets the base of the top green circle and to the middle of the right-hand side of the top vertical paired region. An arrow labeled cleavage points down and to the left. An arrow labeled blue highlighted R N A helicase points down and to the left. This yields a structure that is horizontal with the two halves partially separated. The top strand is yellow and the bottom strand is green. There is a paired horizontal piece on the right, then a circle, then two horizontal pieces that have been pulled apart, then a circle that had been at the bottom before that has been pulled farther apart. An arrow labeled R I S C loading points down to a purple oval containing a yellow strand with very short green lines extending down labeled mature m i R N A. An arrow points down labeled target selection and splits in half. The top half points to a structure labeled near-perfect complementarity. This contains a yellow m R N A with a loop near the right side bonded to a dark green strand below with a red tip outside of the purple oval to the left and a right end that joins to A A A (A) subscript italicized n end italics). An arrow points from this to show a green strand with a red left end and a short yellow strand with A A A (A) subscript italicized n end italics end subscript at its right side. An arrow pointing downward is labeled m R N A cleavage and this yields many small green and yellow fragments. The second arrow from target selection points to a product labeled partial complementarity. This is similar to near-perfect complementarity except that there are two loops in the yellow strand that do not pair to the green strand below. An arrow points down to show the same structure above and arrow pointing down to text reading, translational expression. All data are approximate.
The expression of genes is regulated at many levels. A crucial factor governing a gene’s expression is the cellular concentration of its associated mRNA. The concentration of any molecule depends on two factors: its rate of synthesis and its rate of degradation. When synthesis and degradation of an mRNA are balanced, the concentration of the mRNA remains in a steady state. A change in either rate will lead to net accumulation or depletion of the mRNA. Degradative pathways ensure that mRNAs do not build up in the cell and direct the synthesis of unnecessary proteins.
The rates of degradation vary greatly for mRNAs from different eukaryotic genes. For a gene product that is needed only briefly, the half-life of its mRNA may be only minutes or even seconds. Gene products needed constantly by the cell may have mRNAs that are stable over many cell generations. The average half-life of the mRNAs of a vertebrate cell is about 3 hours, with the pool of each type of mRNA turning over about 10 times per cell generation. The half-life of bacterial mRNAs is much shorter — only about 1.5 min — perhaps because of regulatory requirements.
Messenger RNA is degraded by ribonucleases present in all cells. In E. coli, mRNAs typically contain triphosphates remaining from the initiation of transcription. These groups protect the mRNA from degradation. As a result, mRNA decay begins with one or several cuts by an endoribonuclease, followed by degradation by exoribonucleases (Fig. 26-27). The initial cut by the endonuclease generates an RNA fragment with a monophosphate end, which serves to tether the endonuclease to the transcript and ensure its rapid destruction. Some bacteria (Bacillus subtilis, for example), have exonucleases that also recognize the monophosphate end and can degrade RNA fragments in the direction.
The left end of a horizontal green strand begins with 5 prime p p p beneath a gray box that reads, 5 prime end is protected by p p p. An arrow points up to a position on the center left of the green strand. At the right end, the green strand runs up vertically to form a loop and then runs back down to the 3 prime end. Text in a gray box reads, 3 prime end is protected by a hairpin. An arrow points down accompanied by text reading, blue highlighted R N ase E end highlighted (italicized E. coli end italics) and blue highlighted R N ase Y end highlighted (italicized B. subtilis end italics). An accompanying gray box contains text reading, Endonucleases create unprotected ends for exonuclease action. The green strand now has a break at the location previously indicated by the arrow. The left-hand piece has p p p at its left end. The right-hand piece has p at its left end. Two arrows point downward accompanied by text in a gray box reading, fragment is degraded by 3 prime to 5 prime exo nucleases. The left-hand arrow is labeled blue highlighted R N ase E end highlighted (italicized E. coli end italics) and yields a green horizontal strand with a stem loop at the right end with a space to its left labeled R N ase E cut before a dashed green line that runs left for a short distance, then curves down to end at p below above a light blue sphere. An accompanying gray box contains text that reads, R N ase E is tethered to the transcript to promote rapid endonucleolytic degradation. The right-hand arrow is labeled blue highlighted R N ase J end highlighted (italicized B. subtilis end italics). It yields a green horizontal piece with a stem loop at its right end and a left end that has three short green dashes before ending at p. An accompanying gray box contains text that reads, R N ase J degrades transcript in 5 prime to 3 prime direction.
Polynucleotide phosphorylase (PNPase) is a common exoribonuclease responsible for the degradation of many mRNAs in bacteria, chloroplasts, and mitochondria. It catalyzes the reversible phosphorolysis (rather than hydrolysis) of the mRNA chain using orthophosphate as the nucleophile. The PNPase reaction is readily reversible and the enzyme can also add nucleotides to the ends of bacterial mRNAs. Decay of mRNAs containing complex end structures, such as the hairpins responsible for -independent transcription termination (see Fig. 26-7), can involve multiple rounds of lengthening and shortening of the mRNA by PNPase until it is finally consumed. This unusual nontemplated RNA polymerization activity of PNPase proved to be critical for production of mRNA polymers used for deciphering the genetic code (Chapter 27).
As we have previously seen with transcription and RNA processing, the analogous processes for RNA degradation in eukaryotes are much more complex than their bacterial counterparts. Eukaryotes have multiple pathways for mRNA decay, and the pathway used can depend on the mRNA location, its structure, its association with ribosomes, and other factors. However, in most cases, decapping the end and shortening the poly(A) tail are critical steps for allowing exonucleases to access the mRNA.
All eukaryotes also have large exoribonucleases called exosomes, which are responsible for the degradation for nearly all types of RNA. Exosomes are multisubunit complexes containing about 10 proteins. Specialized exosomes exist in the nucleus, cytoplasm, and nucleolus. The core of the exosome is a barrel-like structure through which RNA is threaded (Fig. 26-28). Even though this core is structurally similar to bacterial PNPase, RNA is not degraded within the barrel. Instead, the barrel serves as an adapter that efficiently channels the RNA to associated enzymes with exonuclease and endonuclease activity.
Part a shows a purple R N A helicase with a rounded portion at the top and two vertical pieces to the left with pink A T P ase running horizontally between them with a piece that extends up between the two vertical pieces of the purple R N A helicase. A yellow rounded exosome core is below with a gray nuclease visible extending below the yellow exosome core and with a diagonal gray piece labeled nuclease to the right. An R N A substrate is shown as a vertical green molecule that begins near the top of the R N A helicase and runs downward into A T P ase. A dashed line runs almost vertically down through the rest of the A T P ase and exosome core, where it forms a more standard R N A structure as it continues down into the nuclease, where it bends horizontally. Part b shows a purple rounded rectangular structure with a portion that bends up diagonally to the right at the top. This is labeled R N A helicase. Beneath, a rounded pink structure is labeled A T P ase. A taller structure below labeled exosome core is yellow and roughly cylindrical with a wider top than bottom. A blue rounded piece with a smaller rounded top to the upper right with a circle in is it labeled nuclease. A vertical cylinder extends from near the top of R N A helicase down into A T P ase, narrowing toward the bottom. It joins an more irregularly-shaped piece in the exosome core that is rounded at the top, then bends to the right and then upward into the nuclease. A green R N A substrate is shown vertically above and running through this passageway through the parts of the structure.