In addition to their important roles as fuel stores (starch, glycogen, dextran) and as structural materials (cellulose, chitin, peptidoglycans), polysaccharides and oligosaccharides are information carriers. Some provide communication between cells and their extracellular surroundings; others label proteins for transport to and localization in specific organelles, or for destruction when the protein is malformed or superfluous; and others serve as recognition sites for extracellular signal molecules (growth factors, for example) or extracellular parasites (bacteria or viruses). On almost every eukaryotic cell, specific oligosaccharide chains attached to components of the plasma membrane form a carbohydrate layer (the glycocalyx), several nanometers thick, that serves as an information-rich surface that the cell shows to its surroundings. These oligosaccharides are central players in cell-cell recognition and adhesion, cell migration during development, blood clotting, the immune response, wound healing, and other cellular processes. In most of these cases, the informational carbohydrate is covalently joined to a protein or a lipid to form a glycoconjugate, which is the biologically active molecule (Fig. 7-20).
FIGURE 7-20 Glycoconjugates. The structures of some typical proteoglycans, glycoproteins, and glycosphingolipids described in the text.
Proteoglycans are macromolecules of the cell surface or ECM in which one or more sulfated glycosaminoglycan chains are joined covalently to a membrane protein or a secreted protein. The glycosaminoglycan chain can bind to extracellular proteins through electrostatic interactions between the protein and the negatively charged sugar moieties on the proteoglycan. Proteoglycans are major components of all extracellular matrices.
Glycoproteins have one or several oligosaccharides of varying complexity joined covalently to a protein. They are usually found on the outer face of the plasma membrane (as part of the glycocalyx), in the ECM, and in the blood. Inside cells, they are found in specific organelles such as Golgi complexes (where the oligosaccharide moieties are added to the proteins), secretory granules, and lysosomes. The oligosaccharide portions of glycoproteins are very heterogeneous and, like glycosaminoglycans, are rich in information, forming highly specific sites for recognition and high-affinity binding by carbohydrate-binding proteins called lectins. Some cytosolic and nuclear proteins can be glycosylated as well.
Glycolipids are plasma membrane components in which the hydrophilic head groups are oligosaccharides. Glycosphingolipids are a class of glycolipids with a specific backbone structure that will be considered in more detail in Chapter 10. They have complex oligosaccharide components that act as specific sites for recognition by lectins, as they do in glycoproteins. Neurons are rich in glycosphingolipids, which act in nerve conduction and myelin formation. Glycosphingolipids also play a role in signal transduction in cells.
Mammalian cells can produce dozens of types of proteoglycans. Many are secreted into the ECM, where they act as tissue organizers and influence cellular activities, such as growth factor activation and adhesion. The basic proteoglycan unit consists of a “core protein” with covalently attached glycosaminoglycan(s). The point of attachment is a Ser residue, to which the glycosaminoglycan is joined through a tetrasaccharide linker (Fig. 7-21).
FIGURE 7-21 Proteoglycan structure, showing the tetrasaccharide bridge. A typical tetrasaccharide linker (blue) connects a glycosaminoglycan — in this case, chondroitin sulfate (orange) — to a Ser residue in the core protein. The xylose residue at the reducing end of the linker is joined by its anomeric carbon to the hydroxyl of the Ser residue.
Some proteoglycans are integral membrane proteins (see Fig. 11-9). For example, the basal lamina, a specialized layer of the ECM that separates two organized groups of cells, contains a family of core proteins ( 20,000 to 40,000), each with several covalently attached heparan sulfate chains. There are two major families of membrane heparan sulfate proteoglycans. Syndecans have a single transmembrane domain and an extracellular domain bearing three to five chains of heparan sulfate and, in some cases, chondroitin sulfate (Fig. 7-22a). Glypicans are attached to the membrane by a GPI anchor, a glycosylated derivative of the membrane lipid phosphatidylinositol (see Fig. 11-16). Both syndecans and glypicans can be shed into the extracellular space. A protease in the ECM that cuts proteins close to the membrane surface releases syndecan ectodomains (domains outside the plasma membrane), and a phospholipase that breaks the connection to the GPI anchor releases glypicans. These mechanisms provide a way for a cell to change its surface features quickly. Shedding is highly regulated and is activated in proliferating cells, such as cancer cells. Proteoglycan shedding is involved in cell-cell recognition and adhesion, and in the proliferation and differentiation of cells. Numerous chondroitin sulfate and dermatan sulfate proteoglycans also exist, some as membrane-bound entities, others as secreted products in the ECM.
FIGURE 7-22 Two families of membrane proteoglycans. (a) Syndecans and glypicans can be shed from the membrane by enzymatic cleavage near the outer membrane surface. (b) Along a heparan sulfate chain, regions rich in sulfated sugars, the NS domains (green), alternate with regions with chiefly unmodified residues of GlcNAc and GlcA, the NA domains (gray). One of the NS domains is shown in more detail, revealing a high density of modified residues: GlcNS (N-sulfoglucosamine) and both GlcA and IdoA, with a sulfate ester at C-2. [(a) Information from U. Häcker et al., Nature Rev. Mol. Cell Biol. 6:530, 2005. (b) Information from J. Turnbull et al., Trends Cell Biol. 11:75, 2001.]
The glycosaminoglycan chains in proteoglycans can bind to a variety of extracellular ligands and thereby modulate the ligands’ interaction with specific receptors of the cell surface. Detailed studies of heparan sulfate demonstrate a domain structure that is not random; some domains (typically three to eight disaccharide units long) differ from neighboring domains in sequence and in ability to bind to specific proteins. Highly sulfated domains (called NS domains) alternate with domains having unmodified GlcNAc and GlcA residues (N-acetylated, or NA, domains) (Fig. 7-22b). The exact pattern of sulfation in the NS domain depends on the particular proteoglycan; given the number of possible modifications of the GlcNAc–IdoA (iduronic acid) dimer, at least 32 different disaccharide units are possible. Furthermore, the same core protein can display different heparan sulfate structures when synthesized in different cell types.
Heparan sulfate molecules with precisely organized NS domains bind specifically to extracellular proteins and signaling molecules to alter their activities. The change in activity may result from a conformational change in the protein that is induced by the binding (Fig. 7-23a), or it may be due to the ability of adjacent domains of heparan sulfate to bind to two different proteins, bringing them into close proximity and enhancing protein-protein interactions (Fig. 7-23b). A third general mechanism of action is the binding of extracellular signal molecules (growth factors, for example) to heparan sulfate, which increases their local concentrations and enhances their interaction with growth factor receptors on the cell surface; in this case, the heparan sulfate acts as a coreceptor (Fig. 7-23c). For example, fibroblast growth factor (FGF), an extracellular protein signal that stimulates cell division, first binds to heparan sulfate moieties of syndecan molecules in the target cell’s plasma membrane. Syndecan presents FGF to the FGF plasma membrane receptor, and only then can FGF interact productively with its receptor to trigger cell division. Finally, in another type of mechanism, the NS domains interact — electrostatically and otherwise — with a variety of soluble molecules outside the cell, maintaining high local concentrations at the cell surface (Fig. 7-23d).
FIGURE 7-23 Four types of protein interactions with NS domains of heparan sulfate. [Information from J. Turnbull et al., Trends Cell Biol. 11:75, 2001.]
The protease thrombin, essential to blood coagulation (see Fig. 6-44), is inhibited by another blood protein, antithrombin, which prevents premature blood clotting. Antithrombin does not bind to or inhibit thrombin in the absence of heparan sulfate. In the presence of heparan sulfate or heparin, the binding affinity of thrombin for antithrombin increases 2,000-fold, and thrombin is strongly inhibited. When thrombin and antithrombin are crystallized in the presence of a short (16 residue) segment of heparan sulfate, the negatively charged heparan sulfate mimic is seen to bridge positively charged regions of the two proteins, causing an allosteric change that inhibits thrombin’s protease activity (Fig. 7-24). The binding sites for heparan sulfate and heparin in both proteins are rich in Arg and Lys residues; the amino acids’ positive charges interact electrostatically with the sulfates of the glycosaminoglycans. Antithrombin also inhibits two other blood coagulation proteins (factors IXa and Xa) in a heparan sulfate–dependent process.
FIGURE 7-24 Molecular basis for heparan sulfate enhancement of the binding of thrombin to antithrombin. In this crystal structure of thrombin, antithrombin, and a 16 residue heparan sulfate–like polymer, all crystallized together, the binding sites for heparan sulfate in both proteins are rich in Arg and Lys residues. These positively charged regions, shown in blue in this electrostatic representation of the proteins, allow strong electrostatic interaction with multiple negatively charged sulfates and carboxylates of the heparan sulfate. Consequently, the affinity of antithrombin for thrombin is three orders of magnitude greater in the presence of heparan sulfate than in its absence. [Data from PDB ID 1TB6, W. Li et al., Nat. Struct. Mol. Biol. 11:857, 2004.]
The importance of correctly synthesizing sulfated domains in heparan sulfate is demonstrated in mutant (“knockout”) mice lacking the enzyme that sulfates the C-2 hydroxyl of iduronate (IdoA). These animals are born without kidneys and with severe developmental abnormalities of the skeleton and eyes. Other studies demonstrate that membrane proteoglycans are important in the liver for clearing lipoproteins from the blood. Finally, there is growing evidence that proteoglycans containing heparan sulfate and chondroitin sulfate provide directional cues for axon outgrowth, influencing the path taken by developing axons in the nervous system.
The functional importance of proteoglycans and the glycosaminoglycans associated with them can also be seen in the effects of mutations that block the synthesis or degradation of these polymers in humans (Box 7-3).
Some proteoglycans can form proteoglycan aggregates, enormous supramolecular assemblies of many core proteins all bound to a single molecule of hyaluronan (also called hyaluronate). Aggrecan core protein has multiple chains of chondroitin sulfate and keratan sulfate, joined to Ser residues in the core protein through trisaccharide linkers, to give an aggrecan monomer of . When a hundred or more of these “decorated” core proteins bind a single, extended molecule of hyaluronan (Fig. 7-25), the resulting proteoglycan aggregate and its associated water of hydration occupy a volume about equal to that of a bacterial cell! Aggrecan interacts strongly with collagen in the ECM of cartilage, contributing to the development, tensile strength, and resilience of this connective tissue.
FIGURE 7-25 Proteoglycan aggregate of the extracellular matrix. Schematic drawing of a proteoglycan with many aggrecan molecules. One very long molecule of hyaluronan is associated noncovalently with about 100 molecules of the core protein aggrecan. Each aggrecan molecule contains many covalently bound chondroitin sulfate and keratan sulfate chains. Link proteins at the junction between each core protein and the hyaluronan backbone mediate the core protein–hyaluronan interaction.
A wavy strand represents the hyaluronan molecule. There are repeating units of a brushy structure extending out vertically in pairs from its top and bottom sides. A close-up shows that each brushy structure is connected to hyaluronan by a small link protein that wraps around to form a circle at the base of the brushy structure, where it meets hyaluronan. A strand labeled aggrecan core protein extends up from this link protein. Near the link protein, blue threads extend out horizontally from the aggrecan core protein to the left and right. After a short region of blue threads, most of the aggrecan core protein has red threads extending to left and right.
Interwoven with these enormous extracellular proteoglycans are fibrous matrix proteins such as collagen, elastin, and fibronectin, forming a cross-linked meshwork that gives the whole ECM strength and resilience. Some of these proteins are multiadhesive, a single protein having binding sites for several different matrix molecules. Fibronectin, for example, has separate domains that bind fibrin, heparan sulfate, and collagen. Fibronectin and a number of other proteins in the extracellular matrix contain the conserved RGD sequence (Arg–Gly–Asp), through which they bind to a family of proteins called integrins. Integrins mediate signaling between the cell interior and the network of molecules in the ECM. The overall picture of cell-matrix interactions that emerges (Fig. 7-26) shows an array of interactions between cellular and extracellular molecules. These interactions serve not merely to anchor cells to the ECM, providing the strength and elasticity of skin and joints. They also provide paths that direct the migration of cells in developing tissue and serve to convey information in both directions across the plasma membrane.
FIGURE 7-26 Interactions between cells and the extracellular matrix. The association between cells and the proteoglycan of the extracellular matrix is mediated by a membrane protein (integrin) and by an extracellular protein (fibronectin, with its RGD motif) with binding sites for both integrin and the proteoglycan. Note the close association of collagen fibers with the fibronectin and proteoglycan.
The figure shows a horizontal plasma membrane as two sheets of white spheres with tails extending irregularly into the space between them. Beneath the plasma membrane on the left, two intertwined chains of spheres are labeled actin filament. Two strands hook over the actin filament, one to the left and one to the right. Each strand is connected to a long vertical strand that runs through the plasma membrane. The left strand ends with a series of three vertical ovals that end with a larger oval. The right strand ends with a series of four small ovals that end with another small oval to the side and a larger oval above that connects to an even larger oval above. Both of these structures are bent to the right and are labeled integrins. Two similar structures are present in the middle of the membrane and on the far right of the membrane, but these are both entirely vertical instead of bent. Above the right two pairs of integrins, there are two roughly horizontal strands made up of rods and labeled fibronectin. The two strands are close together to the left of the central pair of integrins and separate with the top one extending upward and the bottom one extending to the right above the pair of integrins on the right. The top fibronectin runs through three blue strands each made up of intertwined strands and labeled cross-linked fibers of collagen. The top fibronectin also runs over two proteoglycans, shown as long strands with brush-like structures extending above and below them along their entire lengths.
Glycoproteins are carbohydrate-protein conjugates in which the glycans are branched and are much smaller and more structurally diverse than the huge glycosaminoglycans of proteoglycans. The carbohydrate is attached at its anomeric carbon through a glycosidic link to the —OH of a Ser or Thr residue (O-linked), or through an N-glycosyl link to the amide nitrogen of an Asn residue (N-linked) (Fig. 7-27). N-glycosyl bonds join the anomeric carbon of a sugar to a nitrogen atom in glycoproteins and nucleotides (see Fig. 8-1). Some glycoproteins have a single oligosaccharide chain, but many have more than one; the carbohydrate may constitute from 1% to 70% of the glycoprotein by mass. About half of all proteins of mammals are glycosylated.
FIGURE 7-27 Oligosaccharide linkages in glycoproteins. (a) O-linked oligosaccharides have a glycosidic bond to the hydroxyl group of Ser or Thr residues (light red), illustrated here with GalNAc as the sugar at the reducing end of the oligosaccharide. One simple chain and one complex chain are shown. (b) N-linked oligosaccharides have an N-glycosyl bond to the amide nitrogen of an Asn residue (green), illustrated here with GlcNAc as the terminal sugar. Three common types of oligosaccharide chains that are N-linked in glycoproteins are shown. A complete description of oligosaccharide structure requires specification of the position and stereochemistry (α or β) of each glycosidic linkage.
Part a is titled O-linked and shows a six-membered ring labeled G lowercase A L uppercase N A lowercase C bonded to S e r. The ring has O at its top right vertex between C 5 and C 1. C 1 is bonded to H above the ring and has a bond that curves down and to the right to O of S e r; C 2 is bonded to H above the ring and to N H below the ring that is further bonded to C that is double bonded to O and bonded to C H 3; C 3 is bonded to O H above the ring and to H below the ring; C 4 is bonded to O H above the ring and to H below the ring; C 5 is bonded to C 6 above the ring and to H below the ring; and C 6 is bonded to 2 H and O that is further bonded above. S e r is shown as a brown highlighted molecule with a vertical chain. C at the top is bonded to a wavy line above, double bonded to O on the right, and bonded to a central C H below that is further bonded to N H below that is further bonded to a wavy line. The central C H is bonded to C H 2 on the left that is further bonded to O that is bonded to C 1 of the ring. A key shows that a blue square is G lowercase L C uppercase N A lowercase C, a green circle is M lowercase A N, a yellow circle is G lowercase A L, a purple triangle is N lowercase E U 5 uppercase A lowercase C, and a yellow square is G lowercase A lowercase L uppercase N A lowercase C. Two examples of O-linked oligosaccharides are shown using these symbols. The first example has a brown bar labeled S e r / T h r. On the left, S e r is bonded to a yellow square that is further bonded to a purple triangle. The second example also has a brown bar labeled S e r / T h r. On the left, S e r is bonded to a yellow square that is bonded to a yellow circle above and a blue square below. The yellow circle above is further bonded to a blue square bonded to a yellow circle bonded to a blue square bonded to a yellow circle bonded to a yellow square and a purple square. The blue square below is further bonded to a yellow circle bonded to a purple diamond. Part b is titled N-linked and shows a six-membered ring labeled G lowercase L C uppercase N A lowercase C bonded to A s n. The ring has O at the top right vertex between C 5 and C 1. C 1 is bonded to N of A s n above the ring and to H below the ring; C 2 is bonded to H above the ring and to N H below the ring that is further bonded to C that is double bonded to O and further bonded to C H 3; C 3 is bonded to O H above the ring and H below the ring; C 4 is bonded to H above the ring and to O that is further bonded below the ring; C 5 is bonded to C 6 above the ring and to H below the ring; and C 6 is bonded to 2 H and O H. A s n has a top C that is bonded to a wavy line above, double bonded to O, and bonded to a central C below that is bonded to H on the right, N H below that is further bonded to a wavy line, and C H 2 on the left that is further bonded to C that is double bonded to O and further bonded to N H that is bonded to C 1 of the ring. Three examples are shown, each with a bar labeled A s n that has a chain extending to the left. The first chain is blue square, blue square, green circle bonded to green circle above and green circle below. The top green circle is further bonded to two green circles and the bottom green circle is bonded to a blue square that is bonded to a yellow circle. The second chain is blue square, blue square, green circle bonded to a green circle above and a green circle below. The top green circle is further bonded to two chains of two green circles each. The bottom green circle is further bonded to two more green circles. The third chain is blue square, blue square, and then green circle bonded to a green circle above and a green circle below. The top green circle is further bonded to blue square, yellow circle, and purple diamond. The bottom green circle is bonded to two chains that each have blue square, yellow circle, and purple diamond.
As we shall see in Chapter 11, the external surface of the plasma membrane has many membrane glycoproteins with arrays of covalently attached oligosaccharides of varying complexity. Mucins are secreted or membrane glycoproteins that can contain large numbers of O-linked oligosaccharide chains. Mucins are present in most secretions; they are what gives mucus its characteristic slipperiness. Many of the proteins secreted by eukaryotic cells are glycoproteins, including most of the proteins of blood. For example, immunoglobulins (antibodies) and certain hormones, such as follicle-stimulating hormone, luteinizing hormone, and thyroid-stimulating hormone, are glycoproteins. Some of the proteins secreted by the pancreas are glycosylated, as are most of the proteins contained in lysosomes. Many milk proteins, including the major whey protein α-lactalbumin, are glycosylated.
Glycomics is the systematic characterization of all carbohydrate components of a given cell or tissue, including those attached to proteins and to lipids. For glycoproteins, this also means determining which proteins are glycosylated and where in the amino acid sequence each oligosaccharide is attached. This is a challenging undertaking, but worthwhile because of the potential insights it offers into normal patterns of glycosylation and the ways in which they are altered during development or in genetic diseases or cancer. Current methods of characterizing the entire carbohydrate complement of cells depend heavily on sophisticated application of nuclear magnetic resonance and mass spectrometry.
The structures of a large number of O- and N-linked oligosaccharides from a variety of glycoproteins are known; Figures 7-20 and 7-27 show a few typical examples. In Chapter 27, we consider the mechanisms by which specific proteins acquire specific oligosaccharide moieties.
What are the biological advantages of adding oligosaccharides to proteins? The very hydrophilic carbohydrate clusters alter the polarity and solubility of the proteins with which they are conjugated. Figs. 27-38, 27-39). When numerous negatively charged oligosaccharide chains are clustered in a single region of a protein, the charge repulsion among them favors the formation of an extended, rodlike structure in that region. The bulkiness and negative charge of oligosaccharide chains also protect some proteins from attack by proteolytic enzymes.
Oligosaccharide chains that are attached to newly synthesized proteins in the endoplasmic reticulum (ER) and elaborated in the Golgi complex serve as destination labels and also act in protein quality control, targeting misfolded proteins for degradation (seeBeyond these global physical effects on protein structure, there are also more specific biological effects of oligosaccharide chains in glycoproteins (Section 7.4). The importance of normal protein glycosylation is clear from at least 40 different genetic disorders of glycosylation having been found in humans, all causing severely defective physical or mental development; some of these disorders are fatal.
Glycoproteins are not the only cellular components that bear complex oligosaccharide chains; some lipids, too, have covalently bound oligosaccharides. Gangliosides are membrane lipids of eukaryotic cells in which the polar head group, the part of the lipid that forms the outer surface of the membrane, is a complex oligosaccharide containing a sialic acid (Fig. 7-9) and other monosaccharide residues. Some of the oligosaccharide moieties of gangliosides, such as those that determine human blood groups (see Fig. 10-13), are identical with those found in certain glycoproteins, which therefore also contribute to blood group type. Like the oligosaccharide moieties of glycoproteins, those of membrane lipids are generally, perhaps always, found on the outer face of the plasma membrane.
Fig. 7-28). E. coli has similar but unique lipopolysaccharides. The lipid A portion of the lipopolysaccharides of some bacteria is called endotoxin; its toxicity to humans and other animals is responsible for the dangerously lowered blood pressure that occurs in toxic shock syndrome resulting from gram-negative bacterial infections.
Lipopolysaccharides are the dominant surface feature of the outer membrane of gram-negative bacteria such as Escherichia coli and Salmonella typhimurium. These molecules are prime targets of the antibodies produced by the vertebrate immune system in response to bacterial infection and are therefore important determinants of the serotype of bacterial strains. (Serotypes are strains that are distinguished on the basis of antigenic properties.) The lipopolysaccharides of S. typhimurium contain six fatty acids bound to two glucosamine residues, one of which is the point of attachment for a complex oligosaccharide (FIGURE 7-28 Bacterial lipopolysaccharides. Schematic diagram of the lipopolysaccharide of the outer membrane of S. typhimurium. Kdo is 3-deoxy-d-manno-octulosonic acid (previously called ketodeoxyoctonic acid); Hep is l-glycero-d-manno-heptose; AbeOAc is abequose (a 3,6-dideoxyhexose) acetylated on one of its hydroxyls. Different bacterial species have subtly different lipopolysaccharide structures, but they have in common a lipid A region, composed of six fatty acid residues and two phosphorylated glucosamines, a core oligosaccharide, and an “O-specific” chain, which is the principal determinant of the serotype (immunological reactivity) of the bacterium. The outer membranes of the gram-negative bacteria S. typhimurium and E. coli contain so many lipopolysaccharide molecules that the cell surface is almost completely covered with O-specific chains.
A key shows that a blue square is G lowercase L C uppercase N A lowercase C, a green circle is M lowercase A N, a blue circle is G lowercase L C, a yellow circle is G lowercase A L, a hexagon labeled A is A lowercase B E uppercase O A lowercase C, a hexagon labeled R is R lowercase H A, a hexagon labeled K is K lowercase D O, and a hexagon labeled H is H lowercase E P. The molecule is divided into three parts: O-specific chain, core, and lipid A. The top portion of lipid A has two sugar rings. The left-hand ring has a chair shape with O at the top right corner with a bond extending down to C 1, which has a bond extending up to O that is further bonded to C 6 of the right-hand ring. C 2 is bonded to N H below the ring that is further bonded to C that is double bonded to O and further bonded to C that is bonded to an 11-carbon chain below and hashed wedge bonded to O that is further bonded to C double bonded to O and to an 11-carbon chain below; C 3 is bonded to O above the ring that has a bond downward to C that is double bonded to O and further bonded to C that is bonded to an 11-carbon chain below and hashed wedge bonded to O that is further bonded to C that is double bonded to O and further bonded to an 11-carbon chain extending downward; C 4 is bonded to O below the ring that is further bonded to P that is double bonded to O above, bonded to O minus on the left, and bonded to O H below; C 5 is bonded to C 6 above the ring that is further bonded to O that is bonded to a hexagon labeled K in the core. The second sugar ring has O at the top right vertex between C 5 and C 1. C 1 is bonded to O below the ring that is further bonded to P that is double bonded to O above, bonded to O minus to the back right, and bonded to O H to the front right; C 2 is bonded to N H below the ring that is further bonded to C that is double bonded to O and single bonded to C that is hashed wedge bonded to O H and further bonded to an 11-carbon chain; C 3 is bonded to O above the ring that has a bond extending down to C that is double bonded to O and further bonded to C that is hashed wedge bonded to O H and further bonded to an 11-carbon chain. All of the 11-carbon chains in lipid A extend vertically downward. A bond between a hexagon labeled K in the core to O of the left-hand sugar ring connects the core with lipid A. The hexagon labeled K is connected on the right to a chain of two hexagons labeled K and is connected above to a hexagon labeled H that is connected above to a hexagon labeled H that is connected to a hexagon labeled H on the right and to a blue circle above. The blue circle is connected to a yellow circle on the right and to a yellow circle above that is connected to a blue circle above that is connected to a blue square on its right and a yellow circle above where the O-specific chain starts that is further connected to a hexagon labeled R that is bonded to a green circle above that is further bonded on the right and bonded to a yellow circle above that is connected to a green circle on the right and to a hexagon labeled R above. The hexagon labeled R is connected to a green circle above that is connected to a hexagon labeled A on the right and to a yellow circle of a repeating subunit above. The repeating subunit has the yellow circle at the bottom, which is bonded to a blue circle on its right and a hexagon labeled R above that is further bonded to a green circle above that is bonded to a hexagon labeled A on tis right. Text next to the subunit reads n greater than or equal to 10.