当前位置: 首页 > 期刊 > 《核酸研究》 > 2006年第4期 > 正文
编号:11367425
Nucleic acid visualization with UCSF Chimera
http://www.100md.com 《核酸研究医学期刊》
     Computer Graphics Laboratory, Department of Pharmaceutical Chemistry, University of California 600 16th Street, San Francisco, CA 94143-2240, USA 1Department of Plant and Microbial Biology 111 Koshland Hall # 3102 University of California Berkeley, CA 94720-3102, USA

    *To whom correspondence should be addressed. Tel: +1 415 476 2299; Fax: +1 415 502 1755; Email: tef@cgl.ucsf.edu

    ABSTRACT

    With the increase in the number of large, 3D, high-resolution nucleic acid structures, particularly of the 30S and 50S ribosomal subunits and the intact bacterial ribosome, advancements in the visualization of nucleic acid structural features are essential. Large molecular structures are complicated and detailed, and one goal of visualization software is to allow the user to simplify the display of some features and accent others. We describe an extension to the UCSF Chimera molecular visualization system for the purpose of displaying and highlighting nucleic acid characteristics, including a new representation of sugar pucker, several options for abstraction of base geometries that emphasize stacking and base pairing, and an adaptation of the ribbon backbone to accommodate the nucleic acid backbone. Molecules are displayed and manipulated interactively, allowing the user to change the representations as desired for small molecules, proteins and nucleic acids. This software is available as part of the UCSF Chimera molecular visualization system and thus is integrated with a suite of existing tools for molecular graphics.

    INTRODUCTION

    There has been an enormous increase in the size and number of deposited structures of nucleic acids in recent years, including high-resolution X-ray crystal structures of two riboswitches (1,2), two ribonuclease P structures (3,4), several ribozymes (5,6) and structures of multiple macromolecular-assemblages that include nucleic acids, such as the 30S (7) and 50S (8,9) ribosomal subunits, as well as the intact ribosome (10), and the nucleosome core particle (11,12). This growth is reflected in the increase of nucleic acid structures available from the Nucleic Acid Database (NDB) (13) and structure-related databases, such as the Structural Classification of RNA (SCOR) database (14), a classification of 3D structural motifs, with larger structures having many more motifs than smaller structures. SCOR doubled in the number of structures characterized between 2002 and 2004, and simultaneously increased by nearly 20-fold in the number of structural features characterized, from 423 internal and hairpin loops in version 1.1 to 8270 in version 2.0.3 (15).

    The viewing of macromolecular structures is improved by the use of tools that highlight, or if needed, abstract the details, of molecular features. And while many visualization tools exist and representations are standardized for protein structures, these tools are often inadequate for the structural features of nucleic acids. It is a challenge of visualization methods to display and emphasize key concepts and features without overwhelming the viewer and while maintaining the accuracy of the data. For proteins, abstractions exist such as stylized ribbon representations of alpha helices and beta sheets accenting the different secondary structures and displaying the direction of the chain, and for displaying the backbone and side chains (16); however, these methods fall short for nucleic acids, where the structural features are quite different from proteins.

    Here, we present a new tool for nucleic acid visualization that highlights the features of nucleic acids. We present a new representation to emphasize sugar pucker, a modification to the backbone ribbon for the nucleic acid backbone, and several options for displaying bases and their interactions, emphasizing base pairing and base stacking.

    MATERIALS AND METHODS

    UCSF Chimera (17) (henceforth referred to as ‘Chimera’) is a molecular graphics program designed to maximize interactive visualization and collaboration, that also allows users to easily write additional tools to extend its capabilities. Chimera is available free of charge to academic and non-profit users and is available on a wide array of platforms, including Microsoft Windows, Apple OS X and Linux. The nucleic acid visualization features presented here are an extension to the basic Chimera visualization system, and are currently available within the Chimera package and available from the Chimera web site (http://www.cgl.ucsf.edu/chimera/).

    The new features available in the Nucleotides extension are discussed below, and examples of the features can be seen in Figures 2–7 and in the image gallery on the Chimera web site. All features are easily accessible via a menu interface within Chimera, as shown in Figure 1.

    Figure 1 The Nucleotides menu for the nucleic acid visualization tool, showing the many options for the display of the backbone, sugars and bases. Chimera has separate menus for controlling the parameters used for coloring and for drawing ribbons. See http://www.cgl.ucsf.edu/chimera/docs/ContributedSoftware/nucleotides/nucleotides.html for a detailed description.

    Representations of sugars

    The furanose ring found in nucleic acids generally takes either the envelope form, with four atoms in a plane, or the twist form, with three atoms in a plane and two adjacent atoms on either side of that plane (18). We have created a new representation to emphasize the plane and twist forms of the ring. This representation is unique to the Chimera Nucleotides extension.

    In order to elucidate sugar pucker, we fill the furanose ring by drawing either two or four planes. For the envelope form, a single atom is out of the plane, so we draw two planes: one defined by the four atoms in-plane, and a triangle that extends to the outlying atom. For the twist form, we emphasize the location of the twist by drawing one plane defined by the three atoms in-plane together with a fourth point located towards the twist of the ring and combine this with three triangles connecting the other atoms (Figure 2). The decision of when to consider atoms coplanar and thus which form to use when drawing a sugar is driven by geometric considerations only; in the future we plan to add an option so that the user can adjust the threshold for controlling this behavior. (Source code for the representation of sugars is available in the Supplementary Data.)

    Figure 2 Sugar pucker, in the envelope form (red) and twist form (yellow). The envelope form is highlighted using two planes, while the twist form is accented by the drawing of four planes, achieved by the introduction of a non-atom vertex.

    Another abstraction of the sugar is as a tube that connects the base to the backbone (atoms or ribbon). The tube is drawn from the C4' atom of the sugar to the N1 atom of the base for pyrimidines or N9 for purines. If the user chooses to display the glycosidic bond, then the tube terminates at the sugar C1'. Thus, the simplified connection of base to backbone can either by shown as a single cylinder (as in the G–C pairs in Figure 4) or as two cylinders broken at the C1' (as in Figures 3 and 5).

    Figure 3 Backbone ribbon representations of B-form DNA (PDB identifier 1bna) (27). In the image on the left, the ribbon is drawn with the C1' atom, located in the sugar, as the orientation atom and the plane of the ribbon along the sugar. The image on the right shows the new nucleic acid ribbon representation, with the ribbon axis rotated by 90°.

    Figure 5 Netropsin bound to double-stranded DNA (28) (PDB identifier 6bna). Each strand of the DNA is colored with the ‘rainbow’ option, with the base colors changing over a range from blue to red from the 5' to the 3' end, respectively. The DNA is shown with the backbone represented as a smooth ribbon, the sugars drawn as elliptical tubes and the bases as ellipsoids. Netropsin is colored by element and shown in the ball-and-stick representation, with a transparent pink molecular surface.

    Backbone ribbon

    In order to more accurately represent the backbone of the nucleic acid, we modified the ribbon representation that is used for proteins. For proteins, we draw a ribbon with the backbone oxygen as the orientation atom, and the backbone alpha carbon as the guide atom, with the resulting ribbon perpendicular to the side chain. For nucleic acids, we choose C1' as the orientation atom and C5' as the guide atom, both located in the sugar. Similar to the method first described by Carson and Bugg (19), our ribbon representations are based on B-splines (20) with the coordinates of the guide atoms used as spline control points and the orientation atom used to determine the plane of the ribbon. But with the default approach used for proteins, the resulting ribbon was parallel to the base rather than perpendicular. We created a new option for ribbon representations for nucleic acids such that the ribbon axis is rotated by 90°, and the resulting backbone ribbon is perpendicular to the bases (Figure 3). The original representation of the ribbon remains available as the ‘classic ribbon’ option in the Nucleotides menu. Additional options to the ribbon, such as changing the cross-section of the ribbon, are available as a standard option within Chimera's Ribbon Style Editor menu and the color of the backbone ribbon is customizable by the user.

    Bases

    Base stacking is a key structural feature and is important for the stabilization of nucleic acids (18). Visual emphasis of this feature was one of the motivating factors for building the Nucleotides extension to Chimera. Four representations of bases are available: filled, box slab (or box), elliptical tube slab and ellipsoid slab. Examples are shown in Figure 4. In order to display the orientation of the base, each base is drawn with a rounded protuberance (a dot) at the center of each ring on the positive face of the base. The positive face of the base is defined by a coordinate frame such that, in an idealized right-handed A-form or B-form helix, the X-axis points toward the major groove, the Y-axis is parallel to a C1'–C1' vector within paired bases, and the Z-axis points along the 5'–3' direction. In the case of A- or B-form helices, dots appear on the 5' side of the bases.

    Figure 4 Base representations, including, from top to bottom, filled rings, boxes, ellipsoids and elliptical tubes.

    The filled base option simply fills the rings of the pyrimidines and purines. By selecting this option, the user preserves the purine versus pyrimidine identity of the base, but also emphasizes the position of the base by simply making the rings much more visible.

    The slab options are designed to emphasize base pairing and stacking. With bases displayed in the box formation the ‘spiral stair case’ quality of a helix is much more prominent than with the filled or atom/bond representation (as shown in Figure 4). Several options for the position and size of the slab with respect to the base are available, as well as a custom setting, and these settings are easily obtained by the user through a simple interface. The default setting for the slab option draws the slab as a box, with the purine box anchored at the base N9 and the pyrimidine box anchored at the base N1. The default slab covers most of the base, and extends slightly beyond the base rings to emphasize base pairing. The user can also adjust the thickness of the box.

    Additional representations of bases are displayed in Figure 4. These include the platter-like ellipsoid and the elliptical tube, which is elliptical in cross-section, along the axis of the -orbitals of the base, but with the rings abstracted as squares or rectangles, much like the slab representation.

    In addition to multiple representation of the bases, the Nucleotides extension also offers a quick way to color bases by the NDB convention (22) by selecting the ‘NDB Colors’ button, with yellow for C, red for A, green for G, cyan for U, and blue for T. Bases may also be colored using the Rainbow tool, varying in color from blue at the 5' end to red at the 3' end of a chain, and may be colored in numerous other ways, all standard within Chimera, including by atom, by residue and by chain. The backbone ribbon color can differ from the color of the base.

    RESULTS

    Several examples of our new representations of nucleic acids can be seen in Figures 5–7, for both DNA and RNA molecules and their complexes. (Refer to the figure captions for explanatory details.) The representations described here are easily created from menu- and command-line options within Chimera and the representations are drawn on-demand, quickly and interactively, and publication-quality figures can be generated from the interactive sessions.

    Figure 7 The Escherichia coli L25 ribosomal protein with 5S ribosomal RNA fragment (29) (PDB identifier 1dfu). Atomic interactions are highlighted between the RNA and the protein by filling the base and sugar rings and leaving the nucleic acid backbone fully represented. The protein is drawn in the stick representation, and the bound metal ions are in yellow. Hydrogen bonds between the RNA and protein were calculated by Chimera and are drawn in yellow.

    DISCUSSION

    This report describes a new set of tools for nucleic acid visualization that are packaged as extensions to the UCSF Chimera molecular visualization suite. The original goals of the extensions were to emphasize base stacking, rearrange the ribbon and attempt to create and extend the kinds of representations often seen in textbooks (23). The new representations for nucleic acids include filled bases, alternate representations of bases as slabs and ellipsoids, a modified ribbon backbone and a new representation of sugar geometry. While other approaches provide beautiful representations, e.g. the ribbon representations of nucleic acids available in the Ribbons (24) and DRAWNA (25) programs, and are integrated into existing, interactive software packages, such as the nuccyl extension (http://www.biosci.ki.se/groups/ljo/software/nuccyl.html) to PyMol (http://pymol.sourceforge.net/), our tools are unique in their representations of sugar pucker, base sidedness and ease of changing between alternative conventions. As part of UCSF Chimera, the Nucleotides extension is integrated with a mature, well-supported software suite and can easily be used in conjunction with Chimera's chemical knowledge (e.g. hydrogen bonding and atom typing) and multi-scale models (26) for large molecules (i.e. ribosomes, viruses, nucleosome core).

    SUPPLEMENTARY DATA

    Supplementary Data are available at NAR Online.

    Figure 6 The Thermus thermophilus 30S ribosomal subunit (7) (PDB identifier 1j5e ). Drawn with the Nucleotides extension and Chimera's MultiScale extension (26), the single-stranded RNA chain is colored with the rainbow option, and the proteins are shown as low-resolution surfaces. From this perspective, the helices appear to be formed from local interactions within the RNA, e.g. red bases pairing with other red bases and green bases pairing with other green bases. Tertiary interactions between the differently colored helices are highlighted as the different colored helices are brought together. Distinct from the other figures, the ribbon representation of the 16S ribosomal RNA has a rounded cross-section, and thus its smooth edges make the orientation of the ribbon less visible.

    ACKNOWLEDGEMENTS

    This work has been supported by NIH grants P41 RR001081 (to TE Ferrin) and R01 GM066199 (to SR Holbrook). The authors thank Nikolai B. Ulyanov for numerous helpful discussions, and Steven E. Brenner and Stephen R. Holbrook for their encouragement. Funding to pay the Open Access publication charges for this article was provided by NIH grant P41 RR001081.

    REFERENCES

    Batey, R.T., Gilbert, S.D., Montange, R.K. (2004) Structure of a natural guanine-responsive riboswitch complexed with the metabolite hypoxanthine Nature, 432, 411–415 .

    Serganov, A., Yuan, Y.R., Pikovskaya, O., Polonskaia, A., Malinina, L., Phan, A.T., Hobartner, C., Micura, R., Breaker, R.R., Patel, D.J. (2004) Structural basis for discriminative regulation of gene expression by adenine- and guanine-sensing mRNAs Chem. Biol, . 11, 1729–1741 .

    Krasilnikov, A.S., Yang, X., Pan, T., Mondragon, A. (2003) Crystal structure of the specificity domain of ribonuclease P Nature, 421, 760–764 .

    Krasilnikov, A.S., Xiao, Y., Pan, T., Mondragon, A. (2004) Basis for structural diversity in homologous RNAs Science, 306, 104–107 .

    Adams, P.L., Stahley, M.R., Kosek, A.B., Wang, J., Strobel, S.A. (2004) Crystal structure of a self-splicing group I intron with both exons Nature, 430, 45–50 .

    Golden, B.L., Kim, H., Chase, E. (2005) Crystal structure of a phage Twort group I ribozyme-product complex Nature Struct. Mol. Biol, . 12, 82–89 .

    Wimberly, B.T., Brodersen, D.E., Clemons, W.M., Jr, Morgan-Warren, R.J., Carter, A.P., Vonrhein, C., Hartsch, T., Ramakrishnan, V. (2000) Structure of the 30S ribosomal subunit Nature, 407, 327–339 .

    Ban, N., Nissen, P., Hansen, J., Moore, P.B., Steitz, T.A. (2000) The complete atomic structure of the large ribosomal subunit at 2.4 A resolution Science, 289, 905–920 .

    Harms, J., Schluenzen, F., Zarivach, R., Bashan, A., Gat, S., Agmon, I., Bartels, H., Franceschi, F., Yonath, A. (2001) High resolution structure of the large ribosomal subunit from a mesophilic eubacterium Cell, 107, 679–688 .

    Schuwirth, B.S., Borovinskaya, M.A., Hau, C.W., Zhang, W., Vila-Sanjurjo, A., Holton, J.M., Cate, J.H. (2005) Structures of the bacterial ribosome at 3.5 A resolution Science, 310, 827–834 .

    Luger, K., Mader, A.W., Richmond, R.K., Sargent, D.F., Richmond, T.J. (1997) Crystal structure of the nucleosome core particle at 2.8 A resolution Nature, 389, 251–260 .

    Edayathumangalam, R.S., Weyermann, P., Gottesfeld, J.M., Dervan, P.B., Luger, K. (2004) Molecular recognition of the nucleosomal ‘supergroove’ Proc. Natl Acad. Sci. USA, 101, 6864–6869 .

    Holbrook, S.R. (2005) RNA structure: the long and the short of it Curr. Opin. Struct. Biol, . 15, 302–308 .

    Klosterman, P.S., Tamura, M., Holbrook, S.R., Brenner, S.E. (2002) SCOR: a Structural Classification of RNA database Nucleic Acids Res, . 30, 392–394 .

    Tamura, M., Hendrix, D.K., Klosterman, P.S., Schimmelman, N.R., Brenner, S.E., Holbrook, S.R. (2004) SCOR: Structural Classification of RNA, version 2.0 Nucleic Acids Res, . 32, D182–D184 .

    Richardson, J.S. (1981) The anatomy and taxonomy of protein structure Adv. Protein Chem, . 34, 167–339 .

    Pettersen, E.F., Goddard, T.D., Huang, C.C., Couch, G.S., Greenblatt, D.M., Meng, E.C., Ferrin, T.E. (2004) UCSF Chimera—a visualization system for exploratory research and analysis J. Comput. Chem, . 25, 1605–1612 .

    Saenger, W. Principles of Nucleic Acid Structure, (1983) New York, NY Springer-Verlag .

    Carson, M. and Bugg, C.E. (1986) Algorithm for ribbon models of proteins J. Mol. Graph, . 4, 121–122 .

    Foley, J.D., van Dam, A., Feiner, S.K., Hughes, J.F. Computer Graphics: Principles and Practice, 2nd edn, . (1990) Reading, MA Addison-Wesley .

    Olson, W.K., Bansal, M., Burley, S.K., Dickerson, R.E., Gerstein, M., Harvey, S.C., Heinemann, U., Lu, X.J., Neidle, S., Shakked, Z., et al. (2001) A standard reference frame for the description of nucleic acid base-pair geometry J. Mol. Biol, . 313, 229–237 .

    Lu, X.J. and Olson, W.K. (2003) 3DNA: a software package for the analysis, rebuilding and visualization of three-dimensional nucleic acid structures Nucleic Acids Res, . 31, 5108–5121 .

    Branden, C.-I. and Tooze, J. Introduction to Protein Structure, 1st edn, . (1991) Garland Publishing, Inc .

    Carson, M. (1997) Ribbons In Carter, C.W., Jr and Sweet, R.M. (Eds.). Methods in Enzymology, Macromolecular Crystallography Part B, Academic Press 277, pp. 493–502 .

    Massire, C., Gaspin, C., Westhof, E. (1994) DRAWNA: a program for drawing schematic views of nucleic acids J. Mol. Graph, . 12, 201–206 196 .

    Goddard, T.D., Huang, C.C., Ferrin, T.E. (2005) Software extensions to UCSF Chimera for interactive visualization of large molecular assemblies Structure, 13, 473–482 .

    Drew, H.R., Wing, R.M., Takano, T., Broka, C., Tanaka, S., Itakura, K., Dickerson, R.E. (1981) Structure of a B-DNA dodecamer: conformation and dynamics Proc. Natl Acad. Sci. USA, 78, 2179–2183 .

    Kopka, M.L., Yoon, C., Goodsell, D., Pjura, P., Dickerson, R.E. (1985) Binding of an antitumor drug to DNA, Netropsin and C-G-C-G-A-A-T-T-BrC-G-C-G J. Mol. Biol, . 183, 553–563 .

    Lu, M. and Steitz, T.A. (2000) Structure of Escherichia coli ribosomal protein L25 complexed with a 5S rRNA fragment at 1.8-A resolution Proc. Natl Acad. Sci. USA, 97, 2023–2028 .(Gregory S. Couch, Donna K. Hendrix1 and )