The study of posttranscriptional processing of RNA molecules led to one of the most exciting discoveries in biochemistry — the existence of RNA enzymes or ribozymes. The best-characterized ribozymes are the self-splicing group I introns, RNase P, and the hammerhead ribozyme (discussed below). Most of the activities of these ribozymes are based on two fundamental reactions: transesterification (Fig. 26-14) and phosphodiester bond hydrolysis (cleavage). The substrate for a ribozyme is often an RNA molecule, and it may even be part of the ribozyme itself. When its substrate is RNA, the RNA catalyst can make use of base-pairing interactions to align the substrate for the reaction.
Like protein enzymes, ribozymes vary greatly in size. A self-splicing group I intron may have more than 400 nucleotides. In comparison, the hammerhead ribozyme consists of two RNA strands with a total of just 41 nucleotides. Also as with protein enzymes, the three-dimensional structure of ribozymes is important for function. Ribozymes, like protein enzymes, are inactivated by heating above their melting temperature or by the addition of denaturing agents or complementary oligonucleotides, which disrupt normal base-pairing patterns. Ribozymes can also be inactivated if essential nucleotides are changed. The secondary structure of a self-splicing group I intron from the 26S rRNA precursor of Tetrahymena is shown in detail in Figure 26-37. This secondary structure highlights the large number of base-pairing and other noncovalent interactions that must occur for the ribozyme to adopt a catalytic structure. Just as amino acid mutations can change activities of protein enzymes, nucleotide mutations can alter the noncovalent interactions required for ribozyme folding and catalysis.
Ribozymes share several properties with enzymes besides accelerating the reaction rate, including kinetic behavior and specificity. Binding of the guanosine cofactor to the Tetrahymena group I rRNA intron is saturable () and can be competitively inhibited by -deoxyguanosine. The intron is very precise in its excision reaction, largely due to a segment called the internal guide sequence that can base-pair with exon sequences near the splice site (Fig. 26-37). This pairing promotes the alignment of specific bonds to be cleaved and rejoined.
Because the intron itself is chemically altered during the splicing reaction — its ends are cleaved — it may seem to lack one key enzymatic property: the ability to catalyze multiple reactions. Closer inspection has shown that after excision, the 414 nucleotide intron from Tetrahymena rRNA can, in vitro, act as a true enzyme (but in vivo it is quickly degraded). A series of intramolecular cyclization and cleavage reactions in the excised intron leads to the loss of 19 nucleotides from its end. The remaining 395 nucleotide, linear RNA — referred to as L-19 IVS (intervening sequence) — promotes nucleotidyl transfer reactions in which some oligonucleotides are lengthened at the expense of others (Fig. 26-38). The best substrates are oligonucleotides, such as a synthetic oligomer, that can base-pair with the same guanylate-rich internal guide sequence that held the exon in place for self-splicing.
The enzymatic activity of the L-19 IVS ribozyme results from a cycle of transesterification reactions mechanistically similar to self-splicing. Each ribozyme molecule can process about 100 substrate molecules per hour and is not altered in the reaction — that is, the intron acts as a catalyst. It follows Michaelis-Menten kinetics, is specific for RNA oligonucleotide substrates, and can be competitively inhibited. The (specificity constant) is , lower than that of many protein enzymes, but the ribozyme accelerates hydrolysis by a factor of relative to the uncatalyzed reaction. It makes use of substrate orientation, covalent catalysis, and metal-ion catalysis — all strategies shared with protein enzymes.
E. coli RNase P has both an RNA component (the M1 RNA, with 377 nucleotides) and a protein component (). In 1983, Sidney Altman and Norman Pace and their coworkers discovered that under some conditions, the M1 RNA alone is capable of catalysis, cleaving tRNA precursors at the correct position. The protein component apparently serves to stabilize the RNA or facilitate its function in vivo. The RNase P ribozyme recognizes the three-dimensional shape of its pre-tRNA substrate, along with the CCA sequence, and thus can cleave the leaders from diverse tRNAs (Fig. 26-25).
The known catalytic repertoire of ribozymes continues to expand. Some virusoids, small RNAs associated with plant RNA viruses, include a structure that promotes a self-cleavage reaction; the small hammerhead ribozyme illustrated in Figure 26-39 is in this class, catalyzing the hydrolysis of an internal phosphodiester bond. There are at least nine structural classes of ribozymes that engage in self-cleavage; all use general acid and base catalysis (Fig. 6-8) to promote the attack of a -hydroxyl group on an adjacent phosphodiester bond. Despite being surrounded by proteins, the splicing reaction that occurs in a spliceosome relies on a catalytic center formed by the U2, U5, and U6 snRNAs and intron (see Figs 26-16 and 26-17). And, as we shall see in Chapter 27, an RNA component of ribosomes catalyzes the synthesis of proteins. Exploring catalytic RNAs has provided new insights into catalytic function in general and has important implications for our understanding of the origin and evolution of life on this planet.
The extraordinary complexity and order that distinguish living from inanimate systems are key manifestations of fundamental life processes. Maintaining the living state requires that selected chemical transformations occur very rapidly — especially those that use environmental energy sources and synthesize elaborate or specialized cellular macromolecules. Life depends on powerful and selective catalysts — enzymes — and on informational systems capable of both securely storing the blueprint for these enzymes and accurately reproducing the blueprint for generation after generation. Chromosomes encode the blueprint not for the cell but for the enzymes that construct and maintain the cell. The parallel demands for information and catalysis present a classic conundrum: what came first, the information needed to specify structure or the enzymes needed to maintain and transmit the information?
How might a self-replicating polymer come to be? How might it maintain itself in an environment where the precursors for polymer synthesis are scarce? How could evolution progress from such a polymer to the modern DNA-protein world? These difficult questions can be addressed by careful experimentation, providing clues about how life on Earth began and evolved.
The unveiling of the structural and functional complexity of RNA led Carl Woese, Francis Crick, and Leslie Orgel to propose in the 1960s that this macromolecule might serve as both information carrier and catalyst. Since that time, at least six lines of evidence have given increasing substance to their RNA world hypothesis.
The probable origin of purine and pyrimidine bases is suggested by experiments designed to test hypotheses about prebiotic chemistry (pp. 31–32). Beginning with simple molecules thought to be present in the early atmosphere (, , , ), electrical discharges mimicking lightning generate, first, more reactive molecules such as HCN and aldehydes, then an array of amino acids and organic acids (see Fig. 1-33). When molecules such as HCN become abundant, purine and pyrimidine bases are synthesized in detectable amounts. Remarkably, a concentrated solution of ammonium cyanide, refluxed for a few days, generates adenine in yields of up to 0.5% (Fig. 26-40). Adenine may well have been the first and most abundant nucleotide constituent to appear on Earth. Intriguingly, most enzyme cofactors contain adenosine as part of their structure, although it plays no direct role in the cofactor function (see Fig. 8-41). This may suggest an evolutionary relationship. Based on the simple synthesis of adenine from cyanide, adenine may simply have been abundant and available.
In an “RNA world,” RNAs, not proteins, act as catalysts. Perhaps more than anything else, the discovery of ribozymes gave life to the RNA world hypothesis and led to widespread speculation that an RNA world might have been important in the transition from prebiotic chemistry to life (see Fig. 1-35). The parent of all life on this planet, in the sense that it could reproduce itself across the generations from the origin of life to the present, might have been a self-replicating RNA, or a polymer with equivalent chemical characteristics.
A self-replicating polymer would quickly use up available supplies of precursors provided by the relatively slow processes of prebiotic chemistry. Thus, from an early stage in evolution, metabolic pathways would be required to generate precursors efficiently, with the synthesis of precursors presumably catalyzed by ribozymes. The extant ribozymes found in nature have a limited repertoire of catalytic functions, and of the ribozymes that may once have existed, no trace is left. To explore the RNA world hypothesis more deeply, we need to know whether RNA has the potential to catalyze the many different reactions needed in a primitive system of metabolic pathways.
The search for RNAs with new catalytic functions has been aided by the development of a method that rapidly searches pools of random polymers of RNA and extracts those with particular activities; known as SELEX, this is nothing less than accelerated evolution in a test tube (Box 26-4). It has been used to generate RNA molecules that bind to amino acids, organic dyes, nucleotides, cyanocobalamin, and other molecules. Researchers have isolated ribozymes that catalyze ester and amide bond formation, reactions, metallation of (addition of metal ions to) porphyrins, and carbon–carbon bond formation. The evolution of enzymatic cofactors with nucleotide “handles” that facilitate their binding to ribozymes might have further expanded the repertoire of chemical processes available to primitive metabolic systems.
As we shall see in Chapter 27, some natural RNA molecules, components of ribosomes, catalyze the formation of peptide bonds, offering a glimpse of how the RNA world might have been transformed by the greater catalytic potential of proteins. The evolution of a capacity to synthesize proteins would have been a major event in the RNA world, allowing the generation of polymers that could greatly stabilize complex RNA structures. However, the onset of peptide synthesis would also have hastened the demise of the RNA world. Proteins simply have more catalytic potential. The information-carrying role of RNA may have passed to DNA because DNA is chemically more stable. RNA replicase and reverse transcriptase may be modern versions of enzymes that once played important roles in making the transition to the modern DNA-based system.
The known functions of RNA continue to multiply with each decade. Retroviruses, other RNA viruses, and retrotransposons inhabit a semi-independent universe, maintaining a parasitic existence within the biosphere. For evolutionary biologists, these almost-living entities provide a window on key steps in the evolution of life. Transposons may have been an early innovation in an RNA world. With the appearance of the first, inefficient self-replicators, transposition could have been an important alternative to replication as a strategy for successful reproduction and survival. Early parasitic RNAs would simply hop into a self-replicating molecule via catalyzed transesterification, then passively undergo replication. Natural selection would have driven transposition to become site-specific, targeting sequences that did not interfere with the catalytic activities of the host RNA. Replicators and RNA transposons could have existed in a primitive symbiotic relationship, each contributing to the evolution of the other. Modern introns, retroviruses, and transposons may all be vestiges of a “piggyback” strategy pursued by early parasitic RNAs. These elements continue to make major contributions to the evolution of their hosts.
The RNA world hypothesis requires a nucleotide polymer to reproduce itself. Can a ribozyme bring about its own synthesis in a template-directed manner? Researchers are getting closer to finding such a ribozyme or ribozyme system. For example, Gerald Joyce and colleagues, in 2009, reported on the first set of two ribozymes that could cross-catalyze each other’s formation (Fig. 26-41). One ribozyme, E, catalyzes the joining of two oligonucleotides ( and ) to create a second, complementary ribozyme called . could then catalyze the joining of two other oligonucleotides (A and B) to form another molecule of E. In this system, the formation of E and was templated, and the amounts grew exponentially as long as substrates were available and proteins were absent. The system evolved so that more-efficient enzymes appeared in the population. A more general RNA-polymerase-like ribozyme was described in 2011 by Philipp Holliger and colleagues.
In the center of part a, a molecule is shown that has a blue top strand labeled E and an orange bottom strand labeled E prime. The top strand has a long vertical protrusion upward on the left, the right strand has a long vertical protrusion downward on the right, and there are two small loops in the main strand. An arrow points counterclockwise to the three o’clock position accompanied by a reverse arrow indicating a reversible reaction. This yields E prime only. An arrow points counterclockwise to a structure at the ten o’clock position accompanied by a reverse arrow indicating a reversible reaction. The forward reaction is accompanied by a curved line showing the addition of a structure resembling E divided into two parts with a green piece labeled A on the left and a separate yellow piece labeled B on the right. The reverse reaction is accompanied by a branched arrow showing the loss of the same structure. This reaction yields E prime beneath A and B. An arrow points from this to the starting structure of blue E above orange E prime. A short clockwise arrow points from this structure to E by itself at the three o’clock position of a second cycle of arrows below. A reverse arrow is also present. Another clockwise arrow points from this position to the nine o’clock position, accompanied by a short line showing the addition of a structure resembling E prime divided into two halves, which are purple B prime on the left and red A prime on the right. The reverse reaction has a branched arrow showing the loss of these structures. This yields a structure with E above B prime to the left and A prime to the right. An arrow points from this back to the starting structure. All data are approximate. Part b shows a structure consisting of dark green A on the left, light green B on the upper right, and orange E prime below. There is a gap between U bonded to O H with a red pair of electrons on O at the end of the dark green A strand and p p p G at the beginning of the light green B strand. A red arrow points from the unpaired electrons to p p p G. The full structure is as follows. The top strand shows a green 5 prime strand labeled A that begins with unpaired G, then G above a dot, then paired U U C A U G U before bending upward with G bonded to C above bonded to U hydrogen bonded to A to the right below C hydrogen bonded to G to the right, then G with a dot to the right also bonded to A hydrogen bonded to U to the right and to U above with a dot to the right beneath U bonded to A to the right that begins a vertical string of hydrogen bonded nucleotides that continues as G U U A C before forming an unpaired top loop of G U A A and then returning down the right side with G U A A C A G with a dot to the left U, then a short loop to U to the right, then U with a dot to the left, then G bonded to C to the left, then A bonded to U to the left, then a slight loop of A U G before a new vertical strand begins of bonded G G U then U with a dot beneath, then a slight loop of G A A, then G with a dot beneath, then A A both hydrogen bonded below, then U with dot beneath bonded to O H above with a pair of red electrons on O. Light green B is to the right and begins with ppp G bonded to A below bonded to G with a dot beneath then to A C C all hydrogen bonded below with the final C bonded to G with a dot beneath bonded to C A A C U U all hydrogen bonded below before ending at unpaired A at the 3 prime end. From left to right, the orange bottom strand is 3 prime C U U A U with a dot above bonded A A G U A C G bonded to C C A G with a dot above, then a loop of A G bonded to U with a dot above, then U U both hydrogen bonded above, then G with a dot above, then a loop of A A G, then U with a dot above, then bonded U G G, then a bend vertically to G U A then bonded A G then U with a dot to the right, then U slightly to the left, then U bonded to the right, then G with a dot to the right, then A C A A G A U U C all bonded to the right before a loop of A A U G before the upward complementary strand of G A A U C U U G U U with a dot to the left then A bonded to the left and bonded to G above with e wdot to the left, then C and U bonded to the left, then C bonded to G above followed by U with a dot above connected to a bonded chain of G U U G A A ending in unbonded G G at the 5 prime end.
Although the RNA world remains a hypothesis, with many gaps yet to be explained, experimental evidence supports a growing list of its key elements. Further experimentation should increase our understanding. Important clues to the puzzle will be found in the workings of fundamental chemistry, in living cells, and perhaps on other planets.