Sequence and analysis of rice chromosome 4

被引:418
作者
Feng, Q
Zhang, YJ
Hao, P
Wang, SY
Fu, G
Huang, YC
Li, Y
Zhu, JJ
Liu, YL
Hu, X
Jia, PX
Zhang, Y
Zhao, Q
Ying, K
Yu, SL
Tang, YS
Weng, QJ
Zhang, L
Lu, Y
Mu, J
Lu, YQ
Zhang, LS
Yu, Z
Fan, DL
Liu, XH
Lu, TT
Li, C
Wu, YR
Sun, TG
Lei, HY
Li, T
Hu, H
Guan, JP
Wu, M
Zhang, RQ
Zhou, B
Chen, ZH
Chen, L
Jin, ZQ
Wang, R
Yin, HF
Cai, Z
Ren, SX
Lv, G
Gu, WY
Zhu, GF
Tu, YF
Jia, J
Zhang, Y
Chen, J
机构
[1] Chinese Acad Sci, Shanghai Inst Biol Sci, Natl Ctr Gene Res, Shanghai 200233, Peoples R China
[2] Chinese Natl Human Genome Ctr Shanghai, Shanghai 201203, Peoples R China
[3] Chinese Acad Sci, Inst Genet & Dev Biol, Beijing 100101, Peoples R China
[4] Yangzhou Univ, Yangzhou 225009, Jiangsu, Peoples R China
[5] Univ Wisconsin, Dept Hort, Madison, WI 53706 USA
关键词
D O I
10.1038/nature01183
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 [理学]; 0710 [生物学]; 09 [农学];
摘要
Rice is the principal food for over half of the population of the world. With its genome size of 430 megabase pairs ( Mb), the cultivated rice species Oryza sativa is a model plant for genome research(1). Here we report the sequence analysis of chromosome 4 of O. sativa, one of the first two rice chromosomes to be sequenced completely(2). The finished sequence spans 34.6 Mb and represents 97.3% of the chromosome. In addition, we report the longest known sequence for a plant centromere, a completely sequenced contig of 1.16 Mb corresponding to the centromeric region of chromosome 4. We predict 4,658 protein coding genes and 70 transfer RNA genes. A total of 1,681 predicted genes match available unique rice expressed sequence tags. Transposable elements have a pronounced bias towards the euchromatic regions, indicating a close correlation of their distributions to genes along the chromosome. Comparative genome analysis between cultivated rice subspecies shows that there is an overall syntenic relationship between the chromosomes and divergence at the level of single-nucleotide polymorphisms and insertions and deletions. By contrast, there is little conservation in gene order between rice and Arabidopsis.
引用
收藏
页码:316 / 320
页数:6
相关论文
共 30 条
[1]
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]
Prediction of complete gene structures in human genomic DNA [J].
Burge, C ;
Karlin, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) :78-94
[3]
An integrated physical and genetic map of the rice genome [J].
Chen, MS ;
Presting, G ;
Barbazuk, WB ;
Goicoechea, JL ;
Blackmon, B ;
Fang, FC ;
Kim, H ;
Frisch, D ;
Yu, YS ;
Sun, SH ;
Higingbottom, S ;
Phimphilai, J ;
Phimphilai, D ;
Thurmond, S ;
Gaudette, B ;
Li, P ;
Liu, JD ;
Hatfield, J ;
Main, D ;
Farrar, K ;
Henderson, C ;
Barnett, L ;
Costa, R ;
Williams, B ;
Walser, S ;
Atkins, M ;
Hall, C ;
Budiman, MA ;
Tomkins, JP ;
Luo, MZ ;
Bancroft, I ;
Salse, J ;
Regad, F ;
Mohapatra, T ;
Singh, NK ;
Tyagi, AK ;
Soderlund, C ;
Dean, RA ;
Wing, RA .
PLANT CELL, 2002, 14 (03) :537-545
[4]
Genetic definition and sequence analysis of Arabidopsis centromeres [J].
Copenhaver, GP ;
Nickel, K ;
Kuromori, T ;
Benito, MI ;
Kaul, S ;
Lin, XY ;
Bevan, M ;
Murphy, G ;
Harris, B ;
Parnell, LD ;
McCombie, WR ;
Martienssen, RA ;
Marra, M ;
Preuss, D .
SCIENCE, 1999, 286 (5449) :2468-2474
[5]
Rice (Oryza sativa) centromeric regions consist of complex DNA [J].
Dong, FG ;
Miller, JT ;
Jackson, SA ;
Wang, GL ;
Ronald, PC ;
Jiang, JM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (14) :8135-8140
[6]
Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[7]
Comparative genetics in the grasses [J].
Gale, MD ;
Devos, KM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (05) :1971-1974
[8]
A draft sequence of the rice genome (Oryza sativa L. ssp japonica) [J].
Goff, SA ;
Ricke, D ;
Lan, TH ;
Presting, G ;
Wang, RL ;
Dunn, M ;
Glazebrook, J ;
Sessions, A ;
Oeller, P ;
Varma, H ;
Hadley, D ;
Hutchinson, D ;
Martin, C ;
Katagiri, F ;
Lange, BM ;
Moughamer, T ;
Xia, Y ;
Budworth, P ;
Zhong, JP ;
Miguel, T ;
Paszkowski, U ;
Zhang, SP ;
Colbert, M ;
Sun, WL ;
Chen, LL ;
Cooper, B ;
Park, S ;
Wood, TC ;
Mao, L ;
Quail, P ;
Wing, R ;
Dean, R ;
Yu, YS ;
Zharkikh, A ;
Shen, R ;
Sahasrabudhe, S ;
Thomas, A ;
Cannings, R ;
Gutin, A ;
Pruss, D ;
Reid, J ;
Tavtigian, S ;
Mitchell, J ;
Eldredge, G ;
Scholl, T ;
Miller, RM ;
Bhatnagar, S ;
Adey, N ;
Rubano, T ;
Tusneem, N .
SCIENCE, 2002, 296 (5565) :92-100
[9]
Consed: A graphical tool for sequence finishing [J].
Gordon, D ;
Abajian, C ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :195-202
[10]
Harushima Y, 1998, GENETICS, V148, P479