1.2 Chemical Foundations in Chapter 1 The Foundations of Biochemistry

1.2 Chemical Foundations

Biochemistry aims to explain biological form and function in chemical terms. During the first half of the twentieth century, parallel biochemical investigations of glucose breakdown in yeast and in animal muscle cells revealed remarkable chemical similarities between these two apparently very different cell types; for example, the breakdown of glucose in yeast and in muscle cells involved the same 10 chemical intermediates and the same 10 enzymes. Subsequent studies of many other biochemical processes in many different organisms have confirmed the generality of this observation, neatly summarized in 1954 by the biochemist Jacques Monod: “What is true of E. coli is true of the elephant.” The current understanding that all organisms share a common evolutionary origin is based in part on this observed universality of chemical intermediates and transformations, often termed “biochemical unity.”

Fewer than 30 of the more than 90 naturally occurring chemical elements are known to be essential to organisms. Most of the elements in living matter have a relatively low atomic number; only three have an atomic number above that of selenium, 34 (Fig. 1-11). The four most abundant elements in living organisms, in terms of percentage of total number of atoms, are hydrogen, oxygen, nitrogen, and carbon, which together make up more than 99% of the mass of most cells. They are the lightest elements capable of efficiently forming one, two, three, and four bonds, respectively; in general, the lightest elements form the strongest bonds. The trace elements represent a miniscule fraction of the weight of the human body, but all are essential to life, usually because they are essential to the function of specific proteins, including many enzymes. The oxygen-transporting capacity of the hemoglobin molecule, for example, is absolutely dependent on four iron ions that make up only 0.3% of the molecule’s mass.

A periodic table is color-coded to distinguish bulk elements from trace elements. — FIGURE 1-11 Elements essential to animal life and health. Bulk elements (shaded light red) are structural components of cells and tissues and are required in the diet in gram quantities daily. For trace elements (shaded yellow), the requirements are much smaller: for humans, a few milligrams per day of Fe, Cu, and Zn, and even less of the others. The elemental requirements for plants and microorganisms are similar to those shown here; the ways in which they acquire these elements vary.

Each square of the table contains a number and a chemical abbreviation. From left to right, the rows are as follows. Row 1: 1 H column 1 bulk element, 2 H lowercase E in column 18. Row 2 has elements in columns 1, 2, and 13 through 18 only as follows: 3 L lowercase I, 4 B lowercase E, 5 B, 6 C bulk element, 7 N bulk element, 8 O bulk element, 9 F, 10 N lowercase E; Row 3 has elements in columns 1, 2, and 13 through 18 only as follows: 11 N lowercase A bulk element, 12 M lowercase G trace element; 13 A lowercase L, 14 S lowercase I, 15 P bulk element, 16 S bulk element, 16 C lowercase L bulk element, 18 A lowercase R. Row 4: 19 K bulk element, 20 C lowercase A bulk element, 21 S lowercase C, 22 T lowercase I, 23 V trace element, 24 C lowercase R trace element, 25 M lowercase N trace element, 26 F lowercase E trace element, 27 C lowercase O trace element, 28 N lowercase I trace element, 29 C lowercase U trace element, 30 Z lowercase N trace element, 31 G lowercase A, 32 G lowercase E, 33 A lowercase S, 34 S lowercase E trace element, 35 B lowercase R, 36 K lowercase R. Row 5: 37 R lowercase B, 38 S lowercase R, 39 Y, 40 Z lowercase R, 41 N lowercase B, 42 M lowercase O trace element, 43 T lowercase C, 44 R lowercase U, 45 R lowercase H, 46 P lowercase D, 47 A lowercase G, 48 C lowercase D, 49 I lowercase N , 50 S lowercase N, 51 S lowercase B, 52 T lowercase E, 53 I trace element, 54 X lowercase E; Row 5: 55 C lowercase S, 56 B lowercase A, blank square labeled lanthanides, 72 H lowercase F, 73 T lowercase A, 74 W trace element, 75 R lowercase E, 76 O lowercase S, 77 I lowercase R, 78 P lowercase T, 79 A lowercase U, 80 H lowercase G, 81 T lowercase L, 82 P lowercase B, 83 B lowercase I, 84 P lowercase O, 85 A lowercase T, 86 R lowercase N; Row 6: 87 F lowercase R, 88 R lowercase A, blank square labeled, actinides.

Biomolecules Are Compounds of Carbon with a Variety of Functional Groups

The chemistry of living organisms is organized around carbon, which accounts for more than half of the dry weight of cells. Carbon can form single bonds with hydrogen atoms and can form both single bonds and double bonds with oxygen and nitrogen atoms (Fig. 1-12). Of greatest significance in biology is the ability of carbon atoms to form very stable single bonds with up to four other carbon atoms. Two carbon atoms also can share two (or three) electron pairs, thus forming double (or triple) bonds.

A diagram shows eight examples of the formation of carbon bonds using substrates and products shown as Lewis structures and products shown as structural formulas. — FIGURE 1-12 Versatility of carbon bonding. Carbon can form covalent single, double, and triple bonds (all bonds in red), particularly with other carbon atoms. Triple bonds are rare in biomolecules.

Two columns each contain diagrams of four reactions. For each reaction, Lewis diagrams are given first. Two atoms are added together to form a bond indicated using colored dots that come together in one or more bonds in the product. C atoms are initially always shown with one dot on each side. Column 1, row 1: C with a pink right-hand dot is added to H with a pink dot on the left to form C bonded to H. C is shown with bonds extending from each side and a pink bond to H on the right. Column 1, row 2: C with a pink right-hand dot is added to O with one dot above, two dots to the right and below, and a single pink dot on the left to form C bonded to O. C is shown with bonds extending from each side and a pink bond to O on the right, from which another bond extends to the right. Column 1, row 3: C with pink dots to the right and below is added to O with one pink dot above, two dots to the right and below, and a single pink dot on the left to form C double bonded to O (with two vertical pairs of pink dots between them). C is shown with two bonds extending diagonally to the upper and lower left and a pink double bond to O on the right. Column 1, row 4: C with a pink right-hand dot is added to N with a pink dot on the left, single dots above and below, and two dots to the right to produce C bonded to N. This is shown as C with bonds extending from all sides and a pink bond to N, from which two angled bonds extend to the right. Column 2, row 1: C with a pink right-hand dot and a pink dot below is added to N with a pink dot to the left and below, one dot above, and two dots to the right to produce C double bonded to N (with two vertical pairs of pink dots between them). This is shown as C with two bonds extending at angles to the left and with a pink double bond to N that has a single bond extending to the right. Column 2, row 2: C with a pink right-hand dot is added to C with a pink dot on the left to produces C bonded to C. This is shown as 2 C atoms each with bonds extending from all sides and a pink bond connecting them. Column 2, row 3: C with a pink dot to the right and below is added to C with a pink dot to the left and below to produce C double bonded to C (with two vertical pairs of pink dots between them). This is shown as 2 C atoms each with two bonds extending from one side at opposite angles and a pink double bond between them. Column 2, row 4: C with a pink dot to the left, below, and to the right is bonded with an identical C to produce C with a single bond to the left triple bonded to C with a single bond to the right. Three vertical pairs of pink dots are between the C atoms. This is shown as 2 C atoms each with a single bond extending away and three pink bonds connecting them.

The four single bonds that can be formed by a carbon atom project from the nucleus to the four apices of a tetrahedron (Fig. 1-13), with an angle of about $109.5 °$ $109.5 degree$ between any two bonds and an average bond length of 0.154 nm. There is free rotation around each single bond, unless very large or highly charged groups are attached to both carbon atoms, in which case rotation may be restricted. A double bond is shorter (about 0.134 nm) and rigid, and it allows only limited rotation about its axis.

Three molecular diagrams, a, b, and c, show the geometry of carbon bonding by showing bonds around a single atom in part a, two bonded carbon atoms in part b, and doubled bonded carbons in part c. — FIGURE 1-13 Geometry of carbon bonding. (a) Carbon atoms have a characteristic tetrahedral arrangement of their four single bonds. (b) Carbon–carbon single bonds have freedom of rotation, as shown for the compound ethane ${(CH}_{3} — {CH}_{3}) .$ $left-parenthesis CH Subscript 3 Baseline em-dash CH Subscript 3 Baseline right-parenthesis period$ (c) Double bonds are shorter and do not allow free rotation. The two doubly bonded carbons and the atoms designated A, B, X, and Y all lie in the same rigid plane.

FIGURE 1-13 Geometry of carbon bonding. (a) Carbon atoms have a characteristic tetrahedral arrangement of their four single bonds. (b) Carbon–carbon single bonds have freedom of rotation, as shown for the compound ethane ${(CH}_{3} — {CH}_{3}) .$ $left-parenthesis CH Subscript 3 Baseline em-dash CH Subscript 3 Baseline right-parenthesis period$ (c) Double bonds are shorter and do not allow free rotation. The two doubly bonded carbons and the atoms designated A, B, X, and Y all lie in the same rigid plane.

Part a, at the left, shows a sphere labeled, C with four evenly placed cylinders extending from it to produce a tetrahedral shape of which two sides are visible. The bond angles are 109.5 degrees. Part b, in the center, shows two C atoms similar to the one in part b, except that they share one cylinder to form a bond connecting them. A clockwise arrow indicates that one atom can rotate relative to the other along their common axis. Part c shows two C atoms and atoms “A”, “B”, “X”, and “Y” all embedded in a flat rectangular plane. The two C atoms are connected by two tubes, one looping above the plane and the other looping below the plane. Atoms “A” and “B” are each connected by a single cylinder to the left-hand carbon and extend to its left with a 120 degree angle between them. Atoms “X” and “Y” are each connected by a single cylinder to the right-hand carbon and extend to its right. There is a 120 degree angle between them.

Covalently linked carbon atoms in biomolecules can form linear chains, branched chains, and cyclic structures. It seems likely that the bonding versatility of carbon, with itself and with other elements, was a major factor in the selection of carbon compounds for the molecular machinery of cells during the origin and evolution of living organisms. No other chemical element can form molecules of such widely different sizes, shapes, and composition.

Most biomolecules can be regarded as derivatives of hydrocarbons, with hydrogen atoms replaced by a variety of functional groups that confer specific chemical properties on the molecule, forming various families of organic compounds. Typical of these are alcohols, which have one or more hydroxyl groups; amines, with amino groups; aldehydes and ketones, with carbonyl groups; and carboxylic acids, with carboxyl groups (Fig. 1-14). Many biomolecules are polyfunctional, containing two or more types of functional groups (Fig. 1-15), each with its own chemical characteristics and reactions. The chemical “personality” of a compound is determined by the chemistry of its functional groups and their disposition in three-dimensional space.

A table shows 24 common functional groups in biomolecules with highlighting to represent the atom characterizing the group as C, O, N, S, or P. — FIGURE 1-14 Some common functional groups of biomolecules. Functional groups are screened with a color typically used to represent the element that characterizes the group: gray for C, red for O, blue for N, yellow for S, and orange for P. In this figure and throughout the book, we use R to represent “any substituent.” It may be as simple as a hydrogen atom, but typically it is a carbon-containing group. When two or more substituents are shown in a molecule, we designate them $R^{1}, R^{2},$ $upper R Superscript 1 Baseline comma upper R squared comma$ and so forth.

FIGURE 1-14 Some common functional groups of biomolecules. Functional groups are screened with a color typically used to represent the element that characterizes the group: gray for C, red for O, blue for N, yellow for S, and orange for P. In this figure and throughout the book, we use R to represent “any substituent.” It may be as simple as a hydrogen atom, but typically it is a carbon-containing group. When two or more substituents are shown in a molecule, we designate them $R^{1}, R^{2},$ $upper R Superscript 1 Baseline comma upper R squared comma$ and so forth.

Three columns contain labeled chemical structures for various functional groups. Each functional group is highlighted and bonded to one or more R groups that are not highlighted. The structure is followed by the highlight color. First column, top to bottom: Methyl: C evenly bonded to 3 H atoms and R; gray. Ethyl: C bonded to 2 H and R and further bonded to C H 3; gray. Phenyl: A six-membered carbon ring with alternating single and double bonds with 5 carbons bonded to H and one carbon bonded to R; gray. Carbonyl (aldehyde): C single bonded to H and R on either side and double-bonded to O below; red. Carbonyl (ketone): C bonded to R 1 on one side, bonded to R 2 on the other side, and double-bonded to O below; red. Carboxyl: C bonded to R on the left, bonded to O minus on the right, and double-bonded to O below; red. Hydroxyl (alcohol): O bonded to R on one side and H on the other; red. Enol: C bonded to O H above, bonded to R to the left, and double bonded to C to the right that is further bonded to 2 H; red. Second column, top to bottom: Ether: O bonded to R 1 on the left and R 2 on the right; red. Ester: C bonded to R 1 on the left, double bonded to O below, and bonded to O that is further bonded to R 2 on the right; red. Acetyl: C bonded to O R on the left, double bonded to O below, and bonded to C H 3 on the right; red. Anhydride (two carboxylic acids): C bonded to R 1 on the left, double bonded to O below, and bonded to O that is further bonded to C that is bonded to R 2 on the right and double bonded to O below; red. Amino (protonated): N plus bonded to R on the left and H on the other three sides; blue. Amido: N bonded to R on the left, double bonded to O below, and bonded to N H 2 on the right; blue. Imine: C bonded to R 1 on the left, bonded to R 2 on the right, and double bonded to N H above; blue. N-substituted imine (Schiff base): same as the imine, except that N is bonded to R 3 instead of H; blue. Third column, top to bottom: Guanidinium: C is bonded to N H that is further bonded to R on the left, bonded to N H 2 on the right, and double bonded to N plus that is further bonded to 2 H below; blue. Imidazole: A five-membered ring has a double bond at its top and lower right sides, C bonded to H at the bottom vertex, N H substituted for C at the bottom left vertex, N with two pink electrons substituted for C at the lower right vertex, and R bonded to C at the upper left vertex; blue. Sulfhydryl: S is bonded to R and H; yellow. Disulfide: S is bonded to R 1 on one side and to S that is further bonded to R 2 on the other; yellow. Thioester: C is bonded to R 1 on the left, bonded to S that is further bonded to R 2 on the right, and double bonded to O below; yellow. Phosphoryl: P is bonded to O bonded to R on the left, bonded to O minus above, bonded to O H on the right, and double bonded to O below; orange. Phosphoanhydride: P is bonded to O minus above, bonded to O further bonded to R 1 on the left, double bonded to O below, and bonded to O on the right that is further bonded to P that is bonded to O minus above, bonded to O bonded to R 2 on the right, and double bonded to O below; orange. Mixed anhydride (carboxylic acid and phosphoric acid; also called acyl phosphate): C is bonded to R on the left, double bonded to O below, and bonded to O on the right that is further bonded to P that is bonded to O minus above, bonded to O H on the right, and double bonded to O below; orange.

A structural formula and a space-filling model show functional groups in a biomolecule, acetyl-coenzyme A. — FIGURE 1-15 Several common functional groups in a single biomolecule. Acetyl-coenzyme A (often abbreviated as acetyl-CoA) is a carrier of acetyl groups in some enzymatic reactions. Its functional groups are screened in the structural formula. In the space-filling model, N is blue, C is black, P is orange, O is red, and H is white. The yellow atom at the left is the sulfur of the critical thioester bond between the acetyl moiety and coenzyme A. [Acetyl-CoA structure data from PDB ID 1DM3, Y. Modis and R. K. Wierenga, *J. Mol. Biol*. 297:1171, 2000.]

The space-filling model of acetyl Co A is shown beneath its structural formula. This model is linear but kinks back toward the left at its right end. From left to right, the structure begins with C H 3 that is bonded to C of a thioester group that also includes double bonded O below and S to the right. S is further bonded to C H 2, which is bonded to C H 2, which is bonded to N H of an amido group. This N is bonded to C that is double bonded to O below, forming the amido group, and then is further bonded to C H 2 that is bonded to C H 2 that is further bonded to the N H of another amido group. This N is bonded to C that is double bonded to O below, forming the amido group, and then is further bonded to C that is bonded to H above, O H below (a hydroxyl group), and C on the right that is bonded to C H 3 above, C H 3 below, and C H 2 to the right that is further bonded to O that is further bonded to P of a phosphoanhydride. The phosphoanhydride consists of the P bonded to O minus above, double bonded to O below, and bonded to O that is further bonded to P bonded to O minus above, double bonded to O below, and bonded to O outside of the amido group. The O is bonded to C H 2 that bonds to C H at the left side vertex of a five-membered ring. The ring has O at the top vertex. C H at the bottom left vertex is further bonded to O that is bonded to a phosphoryl group of P bonded to O minus, bonded to O H, and double bonded to O. The C H at the bottom right vertex is bonded to O H. The C H at the right side vertex is bonded to N at the bottom vertex of an imidazole-like five-membered ring that is fused with a six-membered ring to its right. The imidazole-like ring has N substituted for C at its bottom and top vertices, C H at its left side vertex, and double bonds at the top left vertex and the right side vertex shared with the six-membered ring. The rest of the six-membered ring is not part of the imidazole-like group. It has N substituted for C at the top right and bottom vertices, C H at the right side vertex, double bonds at the top and bottom right vertices, and an amino group, N H 2, bonded to C at the top vertex.

Cells Contain a Universal Set of Small Molecules

Dissolved in the aqueous phase (cytosol) of all cells is a collection of perhaps several thousand different small organic molecules $(M_{r} ~ 100 to ~ 500),$ $left-parenthesis upper M Subscript r Baseline tilde 100 to tilde 500 right-parenthesis comma$ with intracellular concentrations ranging from nanomolar to > 10 mm (see Fig. 13-31). (See Box 1-1 for an explanation of the various ways of referring to molecular weight.) These are the central metabolites in the major pathways occurring in nearly every cell — the metabolites and pathways that have been conserved throughout the course of evolution. This collection of molecules includes the common amino acids, nucleotides, sugars and their phosphorylated derivatives, and mono-, di-, and tricarboxylic acids. The molecules may be polar or charged and most are water-soluble. They are trapped in the cell because the plasma membrane is impermeable to them, although specific membrane transporters can catalyze the movement of some molecules into and out of the cell or between compartments in eukaryotic cells. The universal occurrence of the same set of compounds in living cells reflects the evolutionary conservation of metabolic pathways that developed in the earliest cells.

Box 1-1

Molecular Weight, Molecular Mass, and Their Correct Units

There are two common (and equivalent) ways to describe molecular mass; both are used in this text. The first is molecular weight, or relative molecular mass, denoted $M_{r} .$ $upper M Subscript r Baseline period$ The molecular weight of a substance is defined as the ratio of the mass of a molecule of that substance to one-twelfth the mass of an atom of carbon-12 $(^{12} C) .$ $left-parenthesis Superscript 12 Baseline upper C right-parenthesis period$ Since $M_{r}$ $upper M Subscript r$ is a ratio, it is dimensionless — it has no associated units. The second is molecular mass, denoted m. This is simply the mass of one molecule, or the molar mass divided by Avogadro’s number. The molecular mass, m, is expressed in daltons (abbreviated Da). One dalton is equivalent to one-twelfth the mass of an atom of carbon-12; a kilodalton (kDa) is 1,000 daltons; a megadalton (MDa) is 1 million daltons.

Consider, for example, a molecule with a mass 1,000 times that of water. We can say of this molecule either $M_{r} = 18,000$ $upper M Subscript r Baseline equals 18,000$ or m = 18,000 daltons. We can also describe it as an “18 kDa molecule.” However, the expression $M_{r} = 18,000 daltons$ $upper M Subscript r Baseline equals 18,000 daltons$ is incorrect.

Another convenient unit for describing the mass of a single atom or molecule is the atomic mass unit (formerly amu, now commonly denoted u). One atomic mass unit (1 u) is defined as one-twelfth the mass of an atom of carbon-12. Since the experimentally measured mass of an atom of carbon-12 is $1.9926 \times 10^{-}^{23} g,$ $1.9926 times 10 Superscript minus Superscript 23 Baseline g comma$ $1 u = 1.6606 \times 10^{-}^{24} g .$ $1 u equals 1.6606 times 10 Superscript minus Superscript 24 Baseline g period$ The atomic mass unit is convenient for describing the mass of a peak observed by mass spectrometry (see Chapter 3, p. 93).

There are other small biomolecules, specific to certain types of cells or organisms. For example, vascular plants contain, in addition to the universal set, small molecules called secondary metabolites, which play roles specific to plant life. These metabolites include compounds that give plants their characteristic scents and colors, and compounds such as morphine, quinine, nicotine, and caffeine that are valued for their physiological effects on humans but have other purposes in plants.

The entire collection of small molecules in a given cell under a specific set of conditions has been called the metabolome, in parallel with the term “genome.” Metabolomics is the systematic characterization of the metabolome under very specific conditions (such as following administration of a drug, or a biological signal such as insulin).

Macromolecules Are the Major Constituents of Cells

Many biological molecules are macromolecules, polymers with molecular weights above ~5,000 that are assembled from relatively simple precursors (Fig. 1-16). Shorter polymers are called oligomers (Greek oligos, “few”). Proteins, nucleic acids, and polysaccharides are macromolecules composed of monomers with molecular weights of 500 or less. Synthesis of macromolecules is a major energy-consuming activity of cells. Macromolecules themselves may be further assembled into supramolecular complexes, forming functional units such as ribosomes. Table 1-1 shows the major classes of biomolecules in an E. coli cell.

A four-part figure, a, b, c, d, shows important components of organic compounds, with part a showing some of the amino acids of proteins, part b showing some components of nucleic acids, part c showing some components of lipids, and part d showing the parent sugar, Greek letter alpha D-glucose. — FIGURE 1-16 The organic compounds from which most cellular materials are constructed: the ABCs of biochemistry. Shown here are (a) 4 of the 20 amino acids from which all proteins are built (the side chains are shaded light red); (b) 3 of the 5 nitrogenous bases, the two 5-carbon sugars, and the phosphate ion from which all nucleic acids are built; (c) 4 components of membrane lipids (including phosphate); and (d) d-glucose, the simple sugar from which most carbohydrates are derived.

Part a, at the top left, shows some of the amino acids of proteins. Each of these has a central carbon bonded to N H 3 plus on the left, C O O minus above, H on the right, and a highlighted R group below. The R groups are as follows. Alanine: C H 3; Serine: C H 2 O H; Aspartate: C H 2 further bonded to C O O minus; Histidine; C H 2 further bonded to C at the upper left vertex of a five-membered ring with N H substituted for C at the top vertex, N H plus substituted for C at the bottom right vertex, C H at the right side and bottom left vertices, and double bonds at the left side and lower right side. Part b, at the bottom left, shows some components of nucleic acids. These include nitrogenous bases, phosphate, and five-carbon sugars. The nitrogenous bases are uracil, adenine, and guanine. Uracil is a six-membered ring with C double bonded to O at the top vertex, C H at the top and bottom right vertices, N H substituted for C at the bottom vertex, C double bonded to O at the lower right vertex, N H substituted for C at the upper right vertex, and a double bond on the right side. Adenine has a six-membered ring fused to a five-membered ring. The six-membered ring has C bonded to N H 2 at the top vertex, N substituted for C at the top left and bottom vertices, C bonded to H at the lower right vertex, double bonds at the top left and bottom left vertices, and a double bond at the right side vertex shared with the five-membered ring. The five-membered ring has N substituted for C at the top right vertex, N H substituted for C at the bottom right vertex, C H at the right side vertex, and a double bond at the top right side. Guanine has a similar structure to adenine, except that the six-membered ring has C double bonded to O at the top vertex, N H at the top left vertex, C bonded to N H 2 at the bottom left side vertex, and no double bond at the upper right side. Phosphate has P bonded to 2 O minus, bonded to O H, and double bonded to O. The five-carbon sugars are alpha-D-ribose and 2-deoxy-alpha-D-ribose. Alpha-D-ribose is a five-membered ring with O at the top vertex and the following structure: C 1, C 2, and C 3 have H above the ring and O H below the right; C 4 has H O C H 2 above the ring and H below the ring. 2-deoxy-alpha-D-ribose has a similar structure, except that C 2 has H above and below the ring. Part c, at the right of the figure, shows some components of lipids including palmitate, glycerol, and choline. Palmitate is a 16-carbon chain with C H 3 at one end, C H 2 for 14 carbons, and ending with C O O minus. Glycerol is a three-carbon chain with C 1 bonded to 2 H and O H, C 2 bonded to H and O H, and C 3 bonded to 2 H and O H. Choline has N plus bonded to 3 C H 3 and a chain consisting of C H 2 bonded to C H 2 O H. Part d at the bottom right is the parent sugar, which is alpha-D-glucose. It consists of a six-membered ring with O at the top right vertex; C 1, C 2, and C 4 bonded to H above and O H below, C 3 bonded to O H above and H below; and C 5 bonded to C H 2 O H above and H below.

TABLE 1-1 Molecular Components of an *E. coli* Cell
	Percentage of total weight of cell	Approximate number of different molecular species
Water	70	1
Proteins	15	3,000
Nucleic acids
DNA	1	1–4
RNA	6	>3,000
Polysaccharides	3	20
Lipids	2	50^a
Monomeric subunits and intermediates	2	2,600
Inorganic ions	1	20
Source: A. C. Guo et al., Nucleic Acids Res. 41:D625, 2013. ^aIf all permutations and combinations of fatty acid substituents are considered, this number is much larger.

Proteins, long polymers of amino acids, constitute the largest mass fraction (besides water) of a cell. Some proteins have catalytic activity and function as enzymes; others serve as structural elements, signal receptors, or transporters that carry specific substances into or out of cells. Proteins are perhaps the most versatile of all biomolecules; a catalog of their many functions would be very long. The sum of all the proteins functioning in a given cell is the cell’s proteome, and proteomics is the systematic characterization of this protein complement under a specific set of conditions. The nucleic acids, DNA and RNA, are polymers of nucleotides. They store and transmit genetic information, and some RNA molecules have structural and catalytic roles in supramolecular complexes. The genome is the entire sequence of a cell’s DNA (or in the case of RNA viruses, its RNA), and genomics is the characterization of the structure, function, evolution, and mapping of genomes.

The polysaccharides, polymers of simple sugars such as glucose, have three major functions: as energy-rich fuel stores, as rigid structural components of cell walls (in plants and bacteria), and as extracellular recognition elements that bind to proteins on other cells. Shorter polymers of sugars (oligosaccharides) attached to proteins or lipids at the cell surface serve as specific cellular signals. A cell’s glycome is its entire complement of carbohydrate-containing molecules. The lipids, water-insoluble hydrocarbon derivatives, serve as structural components of membranes, energy-rich fuel stores, pigments, and intracellular signals. The lipid-containing molecules in a cell constitute its lipidome.

Proteins, polynucleotides, and polysaccharides have large numbers of monomeric subunits and thus high molecular weights — in the range of 5,000 to more than 1 million for proteins, up to several billion for DNA, and in the millions for polysaccharides such as starch. Individual lipid molecules are much smaller ( $M_{r}$ $upper M Subscript r$ 750 to 1,500) and are not classified as macromolecules, but they can associate noncovalently into very large structures. Cellular membranes are built of enormous noncovalent aggregates of lipid and protein molecules.

Given their characteristic information-rich subunit sequences, proteins and nucleic acids are often referred to as informational macromolecules. Some oligosaccharides, as noted above, also serve as informational molecules.

Three-Dimensional Structure Is Described by Configuration and Conformation

The covalent bonds and functional groups of a biomolecule are, of course, central to its function, but so also is the arrangement of the molecule’s constituent atoms in three-dimensional space — its stereochemistry. Carbon-containing compounds commonly exist as stereoisomers, molecules with the same chemical bonds and same chemical formula but different configuration, the fixed spatial arrangement of atoms. Interactions between biomolecules are typically stereospecific, requiring specific configurations in the interacting molecules.

Figure 1-17 shows three ways to illustrate the stereochemistry, or configuration, of simple molecules. The perspective diagram specifies stereochemistry unambiguously, but bond angles and center-to-center bond lengths are better represented with ball-and-stick models. In space-filling models, the radius of each “atom” is proportional to its van der Waals radius, and the contours of the model define the space occupied by the molecule (the volume of space from which atoms of other molecules are excluded).

A three-part figure, a, b, c, shows three different representations of a molecule with part a showing a structural model in perspective form, part b showing a ball-and-stick model, and part c showing a space-filling model. — FIGURE 1-17 Representations of molecules. Three ways to represent the structure of the amino acid alanine (shown here in the ionic form found at neutral pH). (a) Structural formula in perspective form: a solid wedge represents a bond in which the atom at the wide end projects out of the plane of the paper, toward the reader; a dashed wedge represents a bond extending behind the plane of the paper. (b) Ball-and-stick model, showing bond angles and relative bond lengths. (c) Space-filling model, in which each atom is shown with its correct relative van der Waals radius.

FIGURE 1-17 Representations of molecules. Three ways to represent the structure of the amino acid alanine (shown here in the ionic form found at neutral pH). (a) Structural formula in perspective form: a solid wedge represents a bond in which the atom at the wide end projects out of the plane of the paper, toward the reader; a dashed wedge represents a bond extending behind the plane of the paper. (b) Ball-and-stick model, showing bond angles and relative bond lengths. (c) Space-filling model, in which each atom is shown with its correct relative van der Waals radius.

Configuration is conferred by the presence of either (1) double bonds, around which there is little or no freedom of rotation, or (2) chiral centers, around which substituent groups are arranged in a specific orientation. The identifying characteristic of stereoisomers is that they cannot be interconverted without the temporary breaking of one or more covalent bonds. Figure 1-18a shows the configurations of maleic acid and its isomer, fumaric acid. These compounds are geometric isomers, or cis-trans isomers; they differ in the arrangement of their substituent groups with respect to the nonrotating double bond (Latin cis, “on this side” — groups on the same side of the double bond; trans, “across” — groups on opposite sides). Maleic acid (maleate at the neutral pH of cytoplasm) is the cis isomer, and fumaric acid (fumarate) is the trans isomer; each is a well-defined compound that can be separated from the other, and each has its own unique chemical properties. A binding site (on an enzyme, for example) that is complementary to one of these molecules would not be complementary to the other, which explains why the two compounds have distinct biological roles despite their similar chemical makeup. The visual pigment in the vertebrate eye, rhodopsin, contains retinal, a vitamin A–derived lipid (Fig. 1-18b). In the primary event of vision, light converts one isomer of retinal to another, triggering a neuronal signal to the brain (see Fig. 12-19).

A two-part figure, a and b, shows cis and trans configurations of geometric isomers with part a showing maleic acid open parenthesis cis close parenthesis and fumaric acid open parenthesis trans close parenthesis and part b showing 11-italicized cis end italics-retinal and all italicized trans end italics retinal. — FIGURE 1-18 Configurations of geometric isomers. (a) Isomers such as maleic acid (maleate at pH 7) and fumaric acid (fumarate) cannot be interconverted without breaking covalent bonds, which requires the input of much more energy than the average kinetic energy of molecules at physiological temperatures. (b) In the vertebrate retina, the initial event in light detection is the absorption of visible light by 11-*cis*-retinal. The energy of the absorbed light (about 250 kJ/mol) converts 11-*cis*-retinal to all-*trans*-retinal, triggering electrical changes in the retinal cell that lead to a nerve impulse. (Note that the hydrogen atoms are omitted from the ball-and-stick models of the retinals.)

Part a, at the top of the figure, shows maleic acid (cis) and fumaric acid (trans) using structural formulas and ball-and-stick models. Maleic acid has C bonded to H on the upper left, bonded to C O O H on the lower left, and double bonded to C on the right that is further bonded to H above and to C O O H below. Fumaric acid is the same except that the C on the left has C O O H on the upper left and H below. Part b, at the bottom of the figure, shows 11-cis-retinal and all-trans-retinal. 11-cis-retinal can be converted into all-trans-retinal when light is added. 11-cis-retinal has a six-membered ring with two C H 3 bonded to the top vertex, C H 3 bonded to the lower right vertex, a double bond on the right side, and a chain extending from the right side. The chain consists of a single bond, then double bond, then single bond to C 9, which is bonded to C H 3 above and double bonded to C 10, which is single bonded to C 11. The double bond between C 11 and C 12 is highlighted. C 12 is single bonded to C 13 from the upper left. C 13 is bonded to C H 3 to the lower left and has a double bond to the right. C 14 is bonded to C below that is double bonded to O and single bonded to H. The overall chain is straight and then kinks downward at the double bond between C 11 and C 12, which is cis. All-trans-retinal has the same structure except that C 12 is bonded to the remaining chain above instead of below. The overall chain is straight as a result.

In the second type of stereoisomer, four different substituents bonded to a tetrahedral carbon atom may be arranged in two different ways in space — that is, have two configurations — yielding two stereoisomers that have similar or identical chemical properties but differ in certain physical and biological properties. A carbon atom with four different substituents is said to be asymmetric, and asymmetric carbons are called chiral centers (Greek chiros, “hand”; some stereoisomers are related structurally as the right hand is to the left hand). A molecule with only one chiral carbon can have two stereoisomers; when two or more (n) chiral carbons are present, there can be $2^{n}$ $2 Superscript n$ stereoisomers. Stereoisomers that are mirror images of each other are called enantiomers (Fig. 1-19). Pairs of stereoisomers that are not mirror images of each other are called diastereomers (Fig. 1-20).

A two-part drawing, a and b, shows how chiral and achiral molecules differ by comparing a chiral molecule shown in part a with an achiral molecule shown in part b. — FIGURE 1-19 Molecular asymmetry: chiral and achiral molecules. (a) When a carbon atom has four different substituent groups (A, B, X, Y), they can be arranged in two ways that represent nonsuperposable mirror images of each other (enantiomers). This asymmetric carbon atom is called a chiral atom or chiral center. (b) When a tetrahedral carbon has only three dissimilar groups (that is, the same group occurs twice), only one configuration is possible and the molecule is symmetric, or achiral. In this case, the molecule is superposable on its mirror image: the molecule on the left can be rotated counterclockwise (when looking down the vertical bond from A to C) to create the molecule in the mirror.

Part a, at the left, shows a ball-and-stick model with C in the center bonded to “A” above, “X" to the right rear, “B” to the front, and “Y” to the back rear in a regular tetrahedron. A mirror image of the original molecule has the same structure except that “X” is in front and “B” is in the back rear position. A counterclockwise arrow is shown rotating “Y”, “B", and “X” to produce a similar molecule that still has “A” above but has “B” to the right rear, “Y” in front, and “X” in the left rear. Text reads, chiral molecule: rotated molecule cannot be superimposed on its mirror image. Part b shows a similar molecule with “X” in the right and left rear positions and no “Y”. The mirror image of the original has “B” in the right rear position and “X” in the front and left rear positions. When rotated counterclockwise, the original molecule is the same as the mirror image. Text reads, achiral molecule: rotated molecule can be superimposed on its mirror image.

A figure compares the structures of enantiomers (mirror images) and diastereomers. Two sets of enantiomers are shown as structural models in perspective form and as ball-and-stick models. — FIGURE 1-20 Enantiomers and diastereomers. There are four different stereoisomers of 2,3-disubstituted butane (n = 2 asymmetric carbons, hence $2^{n} = 4$ $2 Superscript n Baseline equals 4$ stereoisomers). Each is shown in a box as a perspective formula and a ball-and-stick model, which has been rotated to show all of the groups. Two pairs of stereoisomers are mirror images of each other, or enantiomers. All other possible pairs are not mirror images, and so are diastereomers. [Information from F. Carroll, *Perspectives on Structure and Mechanism in Organic Chemistry*, p. 63, Brooks/Cole Publishing Co., 1998.]

FIGURE 1-20 Enantiomers and diastereomers. There are four different stereoisomers of 2,3-disubstituted butane (n = 2 asymmetric carbons, hence $2^{n} = 4$ $2 Superscript n Baseline equals 4$ stereoisomers). Each is shown in a box as a perspective formula and a ball-and-stick model, which has been rotated to show all of the groups. Two pairs of stereoisomers are mirror images of each other, or enantiomers. All other possible pairs are not mirror images, and so are diastereomers. [Information from F. Carroll, *Perspectives on Structure and Mechanism in Organic Chemistry*, p. 63, Brooks/Cole Publishing Co., 1998.]

Four molecules are shown using a structural formula in perspective form and ball-and-stick molecules. The first molecule has C bonded to C H 3 by a hashed wedge, bonded to H on the right by a solid wedge, bonded to “X” on the left by a solid wedge, and bonded by a line to C below that is further bonded to H on the right by a solid wedge, “Y” on the left by a solid wedge, and C H 3 below by a hashed wedge. The ball-and-stick figure shows that “X” and “Y” are positioned in the front of the molecule, toward the observer. The second molecule is similar except “X” and “Y” have switched to the right side and the two Hs that were on the right side are now on the left. The ball-and-stick model shows that “X” and Y are now toward the back on the right. The first and second molecules are enantiomers. The third and fourth molecules are enantiomers of each other, but not of the first two molecules. The third molecule is similar to the first molecule except that “Y” is on the right instead of the left. The ball-and-stick model shows “X” in the front and “Y” in the back. The fourth molecule has “X” on the right on the first carbon and “Y” on the left on the second carbon. The ball-and-stick model shows that “X” is now toward the back and “Y” is now toward the front.

As the biologist, microbiologist, and chemist Louis Pasteur first observed in 1843 (Box 1-2), enantiomers have nearly identical chemical reactivities but differ in a characteristic physical property: optical activity. In separate solutions, two enantiomers rotate the plane of plane-polarized light in opposite directions, but an equimolar solution of the two enantiomers (a racemic mixture) shows no optical rotation. Compounds without chiral centers do not rotate the plane of plane-polarized light.

Box 1-2

Louis Pasteur and Optical Activity: In Vino, Veritas

Louis Pasteur encountered the phenomenon of optical activity in 1843, during his investigation of the crystalline sediment that accumulated in wine casks (a form of tartaric acid called paratartaric acid — also called racemic acid, from Latin racemus, “bunch of grapes”). He used fine forceps to separate two types of crystals identical in shape but mirror images of each other. Both types proved to have all the chemical properties of tartaric acid, but in solution one type rotated plane-polarized light to the left (levorotatory), whereas the other rotated it to the right (dextrorotatory). Pasteur later described the experiment and its interpretation:

In isomeric bodies, the elements and the proportions in which they are combined are the same, only the arrangement of the atoms is different … We know, on the one hand, that the molecular arrangements of the two tartaric acids are asymmetric, and, on the other hand, that these arrangements are absolutely identical, excepting that they exhibit asymmetry in opposite directions. Are the atoms of the dextro acid grouped in the form of a right-handed spiral, or are they placed at the apex of an irregular tetrahedron, or are they disposed according to this or that asymmetric arrangement? We do not know.*

Louis Pasteur 1822–1895

Now we do know. In 1951, x-ray crystallographic studies confirmed that the levorotatory and dextrorotatory forms of tartaric acid are mirror images of each other at the molecular level and established the absolute configuration of each (Fig. 1). The same approach has been used to demonstrate that although the amino acid alanine has two stereoisomeric forms (designated d and l), alanine in proteins exists exclusively in one form (the l isomer; see Chapter 3).

FIGURE 1 Pasteur separated crystals of two stereoisomers of tartaric acid and showed that solutions of the separated forms rotated plane-polarized light to the same extent but in opposite directions. These dextrorotatory and levorotatory forms were later shown to be the (R,R) and (S,S) isomers represented here.

Key convention

Given the importance of stereochemistry in reactions between biomolecules (see below), biochemists must name and represent the structure of each biomolecule so that its stereochemistry is unambiguous. For compounds with more than one chiral center, the most useful system of nomenclature is the RS system. In this system, each group attached to a chiral carbon is assigned a priority. The priorities of some common substituents are

— {OCH}_{3} > — OH > — {NH}_{2} > — COOH > — CHO > — {CH}_{2} OH > — {CH}_{3} > — H

$em-dash OCH Subscript 3 Baseline greater-than em-dash OH greater-than em-dash NH Subscript 2 Baseline greater-than em-dash COOH greater-than em-dash CHO greater-than em-dash CH Subscript 2 Baseline OH greater-than em-dash CH Subscript 3 Baseline greater-than em-dash upper H$

For naming in the RS system, the chiral atom is viewed with the group of lowest priority (4 in the following diagram) pointing away from the viewer. If the priority of the other three groups (1 to 3) decreases in clockwise order, the configuration is (R) (Latin rectus, “right”); if counterclockwise, the configuration is (S) (Latin sinister, “left”). In this way, each chiral carbon is designated either (R) or (S), and the inclusion of these designations in the name of the compound provides an unambiguous description of the stereochemistry at each chiral center.

A figure compares the R and S forms of a molecule with a central atom that has four substituents numbered 1 through 4.

Another naming system for stereoisomers, the d and l system, is described in Chapter 3. A molecule with a single chiral center can be named unambiguously by either system, as shown here. The two naming systems are based on different criteria, so no general correlation can be made between, say, the l isomer and the (S) isomer seen in this example.

A figure compares the L and (S) isomers of glyceraldehyde.

Distinct from configuration is molecular conformation, the spatial arrangement of substituent groups that, without breaking any bonds, are free to assume different positions in space because of the freedom of rotation about single bonds. In the simple hydrocarbon ethane, for example, there is nearly complete freedom of rotation around the $C — C$ $upper C em-dash upper C$ bond. Many different, interconvertible conformations of ethane are possible, depending on the degree of rotation (Fig. 1-21). Two conformations are of special interest: the staggered, which is more stable than all others and thus predominates, and the eclipsed, which is the least stable. We cannot isolate either of these conformational forms, because they are freely interconvertible. However, when one or more of the hydrogen atoms on each carbon is replaced by a functional group that is either very large or electrically charged, freedom of rotation around the $C — C$ $upper C em-dash upper C$ bond is hindered. This limits the number of stable conformations of the ethane derivative.

A graph shows the relationship between the torsion angle plotted against potential energy for ethane in different conformations, ranging from fully eclipsed to fully staggered. — FIGURE 1-21 Conformations. Many conformations of ethane are possible because of freedom of rotation around the $C — C$ $upper C em-dash upper C$ bond. In the ball-and-stick model, when the front carbon atom (as viewed by the reader) with its three attached hydrogens is rotated relative to the rear carbon atom, the potential energy of the molecule rises to a maximum in the fully eclipsed conformation (torsion angle $0 degree comma$ $0 degree comma$ $120 °,$ $120 degree comma$ and so on), then falls to a minimum in the fully staggered conformation (torsion angle $60 degree comma$ $60 degree comma$ $180 °,$ $180 degree comma$ and so on). Because the energy differences are small enough to allow rapid interconversion of the two forms (millions of times per second), the eclipsed and staggered forms cannot be separately isolated.

FIGURE 1-21 Conformations. Many conformations of ethane are possible because of freedom of rotation around the $C — C$ $upper C em-dash upper C$ bond. In the ball-and-stick model, when the front carbon atom (as viewed by the reader) with its three attached hydrogens is rotated relative to the rear carbon atom, the potential energy of the molecule rises to a maximum in the fully eclipsed conformation (torsion angle $0 degree comma$ $0 degree comma$ $120 °,$ $120 degree comma$ and so on), then falls to a minimum in the fully staggered conformation (torsion angle $60 degree comma$ $60 degree comma$ $180 °,$ $180 degree comma$ and so on). Because the energy differences are small enough to allow rapid interconversion of the two forms (millions of times per second), the eclipsed and staggered forms cannot be separately isolated.

The graph plots torsion angle (degrees) on the horizontal axis, ranging from 0 to 360, labeled in increments of 60, and potential energy on the vertical axis, ranging from 0 to 12, labeled in increments of 4. A sine curve begins at (0, 12), drops to (60, 0), rises to (120, 12), drops to (180, 0), rises to (240, 12), drops to (300, 0), and rises to (360, 12). The distance between 0 and 12 on the vertical axis is shown to be 12.1 k J per mol. All data are approximate. Ethane is shown as a ball-and-stick model above each peak and below each curve. It has two gray atoms connected by a single bond. Each gray atom has three evenly spaced white atoms extending out from it. At the peaks, the three atoms around the central atom are lined up so that someone looking at the atom directly from the front would not see the atoms in the back. This is labeled, fully eclipsed conformation. At the minima on the curve, the front atom has been rotated clockwise so that the atoms around the central atom are not lined up. If viewed from the front, all six atoms would appear evenly spaced. This is labeled, fully staggered conformation.

Interactions between Biomolecules Are Stereospecific

When biomolecules interact, the “fit” between them is often stereochemically correct; they are complementary. The three-dimensional structure of biomolecules large and small — the combination of configuration and conformation — is of the utmost importance in their biological interactions: reactant with its enzyme, hormone with its receptor, antigen with its specific antibody, for example (Fig. 1-22). The study of biomolecular stereochemistry, with precise physical methods, is an important part of modern research on cell structure and biochemical function.

A macromolecule is shown as a large, deeply textured mass in a rough crescent shape. In the center of the mass, a small red structure is visible in a recess in the surface texture. A close-up shows a small molecule fitting into a tiny cavity in the large molecule. — FIGURE 1-22 Complementary fit between a macromolecule and a small molecule. A glucose molecule fits into a pocket on the surface of the enzyme hexokinase and is held in this orientation by several noncovalent interactions between the protein and the sugar. This representation of the hexokinase molecule is produced with software that can calculate the shape of the outer surface of a macromolecule, defined either by the van der Waals radii of all the atoms in the molecule or by the “solvent exclusion volume,” the volume that a water molecule cannot penetrate. [Data from PDB ID 3B8A, P. Kuser et al., *Proteins* 72:731, 2008.]

In living organisms, chiral molecules are usually present in only one of their chiral forms. For example, the amino acids in proteins occur only as their l isomers; glucose occurs only as its d isomer. (The conventions for naming stereoisomers of the amino acids are described in Chapter 3; those for sugars, in Chapter 7. The RS system, described above, is the most useful for some biomolecules.) In contrast, when a compound with an asymmetric carbon atom is chemically synthesized in the laboratory, the reaction usually produces both possible chiral forms: a mixture of the d and l forms, for example. Living cells produce only one chiral form of a biomolecule because the enzymes that synthesize that molecule are also chiral.

Stereospecificity, the ability to distinguish between stereoisomers, is a property of enzymes and other proteins and a characteristic feature of biochemical interactions. If the binding site on a protein is complementary to one isomer of a chiral compound, it will not be complementary to the other isomer, for the same reason that a left-handed glove does not fit a right hand. Two striking examples of the ability of biological systems to distinguish stereoisomers are shown in Figure 1-23.

A two-part figure, a and b, compares the stereoisomers of two molecules with sweet-tasting L-aspartyl-L-phenylalanine methyl ester (aspartame) and bitter L-aspartyl-d-phenylalanine methyl ester in part a and with therapeutically active (S) citalopram and therapeutically inactive (R) citalopram in part b. — FIGURE 1-23 Stereoisomers have different effects in humans. (a) Aspartame, the artificial sweetener sold under the trade name Nutra-Sweet, is easily distinguishable by taste receptors from its bitter-tasting stereoisomer, although the two differ only in the configuration at one of the two chiral carbon atoms. (b) The antidepressant medication citalopram (trade name Celexa), a selective serotonin reuptake inhibitor, is a racemic mixture of these two stereoisomers, but only (S)-citalopram has the therapeutic effect. A stereochemically pure preparation of (S)-citalopram (escitalopram oxalate) is sold under the trade name Lexapro. As you might predict, the effective dose of Lexapro is one-half the effective dose of Celexa.

Part a, at the top, shows L-Aspartyl-L-phenylalanine methyl ester (aspartame) (sweet). It has a seven-member chain with N substituted for C 3. C 1 is bonded to O C H 3 and double bonded to O; highlighted C 2 is connected to C 1 and N at position 3 by highlighted lines, to highlighted H by a solid wedge, and to highlighted C H 2 further bonded to a non-highlighted phenyl group; N bonded to H is substituted for C at position 3; C 4 is double bonded to O; C 5 is bonded to H by a solid wedge and bonded to N H 3 plus by a hashed wedge; C 6 is bonded to 2 H; and C 6 is bonded to C O O minus (C7). L-Aspartyl-D-phenylalanine methyl ester (bitter) has a similar structure but differs in the highlighted chiral center at C 2 in that it is bonded to C H 2 by a solid wedge and to H by a hashed wedge. Part b, at the bottom, shows (S)-Citalopram as a benzene ring fused to a five-membered ring with C triple bonded to N at the lower left vertex of the benzene ring, the right side of benzene fused to the five-membered ring, O at the right vertex of the five-membered ring, a highlighted hashed wedge connecting the top right vertex to a phenyl group with F at the para position, and a highlighted solid wedge connecting the top right vertex to a three carbon chain that terminates with N further bonded to two C. (R)-Citalopram has a similar structure, except that the highlighted wedges are switched so that the phenyl group with F at the para position is bonded to the top right vertex of the five-membered ring by a solid wedge and the carbon chain ending with N further bonded to two C is bonded to the top right vertex of the five-membered ring by a hashed wedge.

The common classes of chemical reactions encountered in biochemistry are described in Chapter 13, as an introduction to the reactions of metabolism.

SUMMARY 1.2 Chemical Foundations

Because of its bonding versatility, carbon can produce a broad array of carbon–carbon skeletons with a variety of functional groups; these groups give biomolecules their biological and chemical personalities.
A nearly universal set of several thousand small molecules is found in living cells; the interconversions of these molecules in the central metabolic pathways have been conserved in evolution.
Proteins and nucleic acids are macromolecules — long, linear polymers of simple monomeric subunits; their sequences contain the information that gives each molecule its three-dimensional structure and its biological functions.
Molecular configuration can be changed only by breaking and re-forming covalent bonds. For a carbon atom with four different substituents (a chiral carbon), the substituent groups can be arranged in two different ways, generating stereoisomers with distinct properties. Only one stereoisomer is biologically active. Molecular conformation is the position of atoms in space that can be changed by rotation about single bonds, without covalent bonds being broken.
Interactions between biological molecules are often stereospecific: there is a close fit between complementary structures in the interacting molecules.