Analysis of the genome sequence of the flowering plant Arabidopsis thaliana

被引:6625
作者
Kaul, S
Koo, HL
Jenkins, J
Rizzo, M
Rooney, T
Tallon, LJ
Feldblyum, T
Nierman, W
Benito, MI
Lin, XY
Town, CD
Venter, JC
Fraser, CM
Tabata, S
Nakamura, Y
Kaneko, T
Sato, S
Asamizu, E
Kato, T
Kotani, H
Sasamoto, S
Ecker, JR
Theologis, A
Federspiel, NA
Palm, CJ
Osborne, BI
Shinn, P
Conway, AB
Vysotskaia, VS
Dewar, K
Conn, L
Lenz, CA
Kim, CJ
Hansen, NF
Liu, SX
Buehler, E
Altafi, H
Sakano, H
Dunn, P
Lam, B
Pham, PK
Chao, Q
Nguyen, M
Yu, GX
Chen, HM
Southwick, A
Lee, JM
Miranda, M
Toriumi, MJ
Davis, RW
机构
[1] Inst Genom Res, Rockville, MD 20850 USA
[2] Kazusa DNA Res Inst, Kisarazu, Chiba 292, Japan
[3] Univ Penn, Dept Biol, Inst Plant Sci, Philadelphia, PA 19104 USA
[4] Univ Calif Berkeley, USDA, Ctr Plant Gene Express, Albany, CA 94710 USA
[5] Stanford Genome Technol Ctr, Palo Alto, CA 94304 USA
[6] Salk Inst Biol Studies, La Jolla, CA 92037 USA
[7] Exelixis Inc, San Francisco, CA 94083 USA
[8] AGOWA Gmbh, D-12489 Berlin, Germany
[9] John Innes Ctr, Norwich NR4 7UH, Norfolk, England
[10] QIAGEN Gmbh, D-40724 Hilden, Germany
[11] Plant Res Int, Greenomics, NL-6700 AA Wageningen, Netherlands
[12] GATC Gmbh, D-78467 Constance, Germany
[13] SRD Gmbh, D-61440 Oberursel, Germany
[14] Univ Ghent VIB, Dept Plant Genet, B-9000 Ghent, Belgium
[15] Katholieke Univ Leuven, Lab Gene Technol, B-3001 Leuven, Belgium
[16] Genoscope & CNRS FRE2231, F-91057 Evry, France
[17] Genotype Gmbh, D-69259 Wilhemlsfeld, Germany
[18] European Mol Biol Lab, Biochem Instrumentat Program, D-69117 Heidelberg, Germany
[19] LION Biosci AG, D-69120 Heidelberg, Germany
[20] MWG Biotech AG, D-69117 Ebersberg, Germany
[21] Univ Padua, CRIBI, I-35131 Padua, Italy
[22] Washington Univ, St Louis Sch Med, Washington Univ Genome Sequencing Ctr, St Louis, MO 63108 USA
[23] Cold Spring Harbor Lab, Lita Annenberg Hazen Genome Ctr, Cold Spring Harbor, NY 11724 USA
[24] Celera Genomics, Foster City, CA 94494 USA
[25] Cold Spring Harbor Lab, Plant Biol Grp, Cold Spring Harbor, NY 11724 USA
[26] Max Planck Inst Biochem, GSF Forschungszentrum Umwelt & Gesundheit, Munich Informat Ctr Protein Sequences, D-82152 Munich, Germany
[27] Inst Genomic Res, Rockville, MD 20850 USA
[28] John Innes Ctr, Molec Genet Dept, Norwich NR4 7UH, Norfolk, England
[29] Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA
[30] Cereon Genomics LLC, Cambridge, MA 02139 USA
[31] Max Planck Gesell, Max Delbruck Lab, D-50829 Cologne, Germany
[32] John Innes Ctr, Brassicas & Oilseeds Res Dept, Norwich NR4 7UJ, England
[33] Ctr Natl Sequencage, Genoscope, F-91057 Evry, France
[34] Univ Ulm, Molek Bot, D-89069 Ulm, Germany
[35] McGill Univ, Dept Biol, Montreal, PQ H3A 1B1, Canada
[36] Univ Chicago, Howard Hughes Med Inst, Chicago, IL USA
[37] Washington Univ, Biol Dept, St Louis, MO 63130 USA
[38] Univ Wisconsin, Biotechnol Ctr, Madison, WI 53706 USA
[39] Univ Calif Davis, Sect Plant Biol, Davis, CA 95616 USA
[40] Univ Arizona, Dept Plant Sci, Tucson, AZ 85721 USA
[41] Univ Arizona, Dept Molec & Cell Biol, Tucson, AZ 85721 USA
[42] ZMBP, Ctr Plant Molec Biol, Entwicklungsgenet, D-72076 Tubingen, Germany
[43] Calif Inst Biol, Div Biol, Pasadena, CA 91125 USA
[44] Univ N Carolina, Dept Biol, Coker Hall, Chapel Hill, NC 27599 USA
[45] John Innes Ctr, Sainsbury Lab, Norwich NR4 7UI, Norfolk, England
[46] Salk Inst Biol Studies, Howard Hughes Med Inst, Plant Biol Lab, La Jolla, CA 92037 USA
[47] Carnegie Inst, Stanford, CA 94305 USA
关键词
D O I
10.1038/35048692
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The flowering plant Arabidopsis thaliana is an important model system for identifying genes and determining their functions. Here we report the analysis of the genomic sequence of Arabidopsis. The sequenced regions cover 115.4 megabases of the 125-megabase genome and extend into centromeric regions. The evolution of Arabidopsis involved a whole-genome duplication, followed by subsequent gene loss and extensive local gene duplications, giving rise to a dynamic genome enriched by lateral gene transfer from a cyanobacterial-like ancestor of the plastid. The genome contains 25,498 genes encoding proteins from 11,000 families, similar to the functional diversity of Drosophila and Caenorhabditis elegans - the other sequenced multicellular eukaryotes. Arabidopsis has many families of new proteins but also lacks several common protein families, indicating that the sets of common proteins have undergone differential expansion and contraction in the three multicellular eukaryotes. This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.
引用
收藏
页码:796 / 815
页数:20
相关论文
共 122 条
  • [1] Comparative genome analysis reveals extensive conservation of genome organisation for Arabidopsis thaliana and Capsella rubella
    Acarkan, A
    Rossberg, M
    Koch, M
    Schmidt, R
    [J]. PLANT JOURNAL, 2000, 23 (01) : 55 - 62
  • [2] The genome sequence of Drosophila melanogaster
    Adams, MD
    Celniker, SE
    Holt, RA
    Evans, CA
    Gocayne, JD
    Amanatides, PG
    Scherer, SE
    Li, PW
    Hoskins, RA
    Galle, RF
    George, RA
    Lewis, SE
    Richards, S
    Ashburner, M
    Henderson, SN
    Sutton, GG
    Wortman, JR
    Yandell, MD
    Zhang, Q
    Chen, LX
    Brandon, RC
    Rogers, YHC
    Blazej, RG
    Champe, M
    Pfeiffer, BD
    Wan, KH
    Doyle, C
    Baxter, EG
    Helt, G
    Nelson, CR
    Miklos, GLG
    Abril, JF
    Agbayani, A
    An, HJ
    Andrews-Pfannkoch, C
    Baldwin, D
    Ballew, RM
    Basu, A
    Baxendale, J
    Bayraktaroglu, L
    Beasley, EM
    Beeson, KY
    Benos, PV
    Berman, BP
    Bhandari, D
    Bolshakov, S
    Borkova, D
    Botchan, MR
    Bouck, J
    Brokstein, P
    [J]. SCIENCE, 2000, 287 (5461) : 2185 - 2195
  • [3] ALONSOBLANCO C, 1999, TRENDS PLANT SCI, V5, P1360
  • [4] ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
  • [5] APWEILER R, 2000, 11 COLL COMP PROJ
  • [6] Unique features of the plant vacuolar sorting machinery
    Bassham, DC
    Raikhel, NV
    [J]. CURRENT OPINION IN CELL BIOLOGY, 2000, 12 (04) : 491 - 495
  • [7] A conserved signaling pathway: The Drosophila Toll-Dorsal pathway
    Belvin, MP
    Anderson, KV
    [J]. ANNUAL REVIEW OF CELL AND DEVELOPMENTAL BIOLOGY, 1996, 12 : 393 - 416
  • [8] RPS2 OF ARABIDOPSIS-THALIANA - A LEUCINE-RICH REPEAT CLASS OF PLANT-DISEASE RESISTANCE GENES
    BENT, AF
    KUNKEL, BN
    DAHLBECK, D
    BROWN, KL
    SCHMIDT, R
    GIRAUDAT, J
    LEUNG, J
    STASKAWICZ, BJ
    [J]. SCIENCE, 1994, 265 (5180) : 1856 - 1860
  • [9] BAC representation of two low-copy regions of the genome of Arabidopsis thaliana
    Bent, E
    Johnson, S
    Bancroft, I
    [J]. PLANT JOURNAL, 1998, 13 (06) : 849 - 855
  • [10] Extensive duplication and reshuffling in the arabidopsis genome
    Blanc, G
    Barakat, A
    Guyot, R
    Cooke, R
    Delseny, I
    [J]. PLANT CELL, 2000, 12 (07) : 1093 - 1101