Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data

被引:1532
作者
Kawahara, Yoshihiro [1 ]
de la Bastide, Melissa [2 ]
Hamilton, John P. [3 ]
Kanamori, Hiroyuki [1 ]
McCombie, W. Richard [2 ]
Ouyang, Shu [4 ]
Schwartz, David C. [5 ]
Tanaka, Tsuyoshi [1 ]
Wu, Jianzhong [1 ]
Zhou, Shiguo [5 ]
Childs, Kevin L. [3 ]
Davidson, Rebecca M. [3 ,6 ]
Lin, Haining [3 ,7 ]
Quesada-Ocampo, Lina [3 ]
Vaillancourt, Brieanne [3 ]
Sakai, Hiroaki [1 ]
Lee, Sung Shin [1 ]
Kim, Jungsok [1 ]
Numa, Hisataka [1 ]
Itoh, Takeshi [1 ]
Buell, C. Robin [3 ]
Matsumoto, Takashi [1 ]
机构
[1] Natl Inst Agrobiol Sci, Agrogen Res Ctr, Tsukuba, Ibaraki 3058602, Japan
[2] CSHL, Cold Spring Harbor, NY 11723 USA
[3] Michigan State Univ, Dept Plant Biol, Plant Biol Labs, E Lansing, MI 48824 USA
[4] Perkin Elmer, Frederick, MD 21701 USA
[5] Univ Wisconsin, Lab Mol & Computat Genom, UW Biotechnol Ctr, Madison, WI 53706 USA
[6] Natl Jewish Hlth, Integrated Ctr Genes Environm & Hlth, Denver, CO USA
[7] Dupont Pioneer, Johnston, IA 50131 USA
基金
美国国家科学基金会;
关键词
Oryza sativa; Nipponbare; Unified rice reference genome; Pseudomolecules; Minimum tiling path; Optical mapping; Genome re-sequencing; Next-generation sequencing; SINGLE-NUCLEOTIDE POLYMORPHISMS; RICE GENOME; ANNOTATION; DNA; ASSOCIATION; RESOLUTION; ALIGNMENT; RESOURCE; JAPONICA; INDICA;
D O I
10.1186/1939-8433-6-4
中图分类号
S3 [农学(农艺学)];
学科分类号
0901 ;
摘要
Rice research has been enabled by access to the high quality reference genome sequence generated in 2005 by the International Rice Genome Sequencing Project (IRGSP). To further facilitate genomic-enabled research, we have updated and validated the genome assembly and sequence for the Nipponbare cultivar of Oryza sativa (japonica group). The Nipponbare genome assembly was updated by revising and validating the minimal tiling path of clones with the optical map for rice. Sequencing errors in the revised genome assembly were identified by re-sequencing the genome of two different Nipponbare individuals using the Illumina Genome Analyzer II/IIx platform. A total of 4,886 sequencing errors were identified in 321 Mb of the assembled genome indicating an error rate in the original IRGSP assembly of only 0.15 per 10,000 nucleotides. A small number (five) of insertions/deletions were identified using longer reads generated using the Roche 454 pyrosequencing platform. As the re-sequencing data were generated from two different individuals, we were able to identify a number of allelic differences between the original individual used in the IRGSP effort and the two individuals used in the re-sequencing effort. The revised assembly, termed Os-Nipponbare-Reference-IRGSP-1.0, is now being used in updated releases of the Rice Annotation Project and the Michigan State University Rice Genome Annotation Project, thereby providing a unified set of pseudomolecules for the rice community. A revised, error-corrected, and validated assembly of the Nipponbare cultivar of rice was generated using optical map data, re-sequencing data, and manual curation that will facilitate on-going and future research in rice. Detection of polymorphisms between three different Nipponbare individuals highlights that allelic differences between individuals should be considered in diversity studies.
引用
收藏
页码:3 / 10
页数:10
相关论文
共 27 条
[1]   A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3 [J].
Cingolani, Pablo ;
Platts, Adrian ;
Wang, Le Lily ;
Coon, Melissa ;
Tung Nguyen ;
Wang, Luan ;
Land, Susan J. ;
Lu, Xiangyi ;
Ruden, Douglas M. .
FLY, 2012, 6 (02) :80-92
[2]   B73-Mo17 Near-Isogenic Lines Demonstrate Dispersed Structural Variation in Maize [J].
Eichten, Steven R. ;
Foerster, Jillian M. ;
de Leon, Natalia ;
Kai, Ying ;
Yeh, Cheng-Ting ;
Liu, Sanzhen ;
Jeddeloh, Jeffrey A. ;
Schnable, Patrick S. ;
Kaeppler, Shawn M. ;
Springer, Nathan M. .
PLANT PHYSIOLOGY, 2011, 156 (04) :1679-1690
[3]   A draft sequence of the rice genome (Oryza sativa L. ssp japonica) [J].
Goff, SA ;
Ricke, D ;
Lan, TH ;
Presting, G ;
Wang, RL ;
Dunn, M ;
Glazebrook, J ;
Sessions, A ;
Oeller, P ;
Varma, H ;
Hadley, D ;
Hutchinson, D ;
Martin, C ;
Katagiri, F ;
Lange, BM ;
Moughamer, T ;
Xia, Y ;
Budworth, P ;
Zhong, JP ;
Miguel, T ;
Paszkowski, U ;
Zhang, SP ;
Colbert, M ;
Sun, WL ;
Chen, LL ;
Cooper, B ;
Park, S ;
Wood, TC ;
Mao, L ;
Quail, P ;
Wing, R ;
Dean, R ;
Yu, YS ;
Zharkikh, A ;
Shen, R ;
Sahasrabudhe, S ;
Thomas, A ;
Cannings, R ;
Gutin, A ;
Pruss, D ;
Reid, J ;
Tavtigian, S ;
Mitchell, J ;
Eldredge, G ;
Scholl, T ;
Miller, RM ;
Bhatnagar, S ;
Adey, N ;
Rubano, T ;
Tusneem, N .
SCIENCE, 2002, 296 (5565) :92-100
[4]   Genome-wide association study of flowering time and grain yield traits in a worldwide collection of rice germplasm [J].
Huang, Xuehui ;
Zhao, Yan ;
Wei, Xinghua ;
Li, Canyang ;
Wang, Ahong ;
Zhao, Qiang ;
Li, Wenjun ;
Guo, Yunli ;
Deng, Liuwei ;
Zhu, Chuanrang ;
Fan, Danlin ;
Lu, Yiqi ;
Weng, Qijun ;
Liu, Kunyan ;
Zhou, Taoying ;
Jing, Yufeng ;
Si, Lizhen ;
Dong, Guojun ;
Huang, Tao ;
Lu, Tingting ;
Feng, Qi ;
Qian, Qian ;
Li, Jiayang ;
Han, Bin .
NATURE GENETICS, 2012, 44 (01) :32-U53
[5]   Genome-wide association studies of 14 agronomic traits in rice landraces [J].
Huang, Xuehui ;
Wei, Xinghua ;
Sang, Tao ;
Zhao, Qiang ;
Feng, Qi ;
Zhao, Yan ;
Li, Canyang ;
Zhu, Chuanrang ;
Lu, Tingting ;
Zhang, Zhiwu ;
Li, Meng ;
Fan, Danlin ;
Guo, Yunli ;
Wang, Ahong ;
Wang, Lu ;
Deng, Liuwei ;
Li, Wenjun ;
Lu, Yiqi ;
Weng, Qijun ;
Liu, Kunyan ;
Huang, Tao ;
Zhou, Taoying ;
Jing, Yufeng ;
Li, Wei ;
Lin, Zhang ;
Buckler, Edward S. ;
Qian, Qian ;
Zhang, Qi-Fa ;
Li, Jiayang ;
Han, Bin .
NATURE GENETICS, 2010, 42 (11) :961-U76
[6]   The Sequence Alignment/Map format and SAMtools [J].
Li, Heng ;
Handsaker, Bob ;
Wysoker, Alec ;
Fennell, Tim ;
Ruan, Jue ;
Homer, Nils ;
Marth, Gabor ;
Abecasis, Goncalo ;
Durbin, Richard .
BIOINFORMATICS, 2009, 25 (16) :2078-2079
[7]   Fast and accurate short read alignment with Burrows-Wheeler transform [J].
Li, Heng ;
Durbin, Richard .
BIOINFORMATICS, 2009, 25 (14) :1754-1760
[8]   Function annotation of the rice transcriptome at single-nucleotide resolution by RNA-seq [J].
Lu, Tingting ;
Lu, Guojun ;
Fan, Danlin ;
Zhu, Chuanrang ;
Li, Wei ;
Zhao, Qiang ;
Feng, Qi ;
Zhao, Yan ;
Guo, Yunli ;
Li, Wenjun ;
Huang, Xuehui ;
Han, Bin .
GENOME RESEARCH, 2010, 20 (09) :1238-1249
[9]   The map-based sequence of the rice genome [J].
Matsumoto, T ;
Wu, JZ ;
Kanamori, H ;
Katayose, Y ;
Fujisawa, M ;
Namiki, N ;
Mizuno, H ;
Yamamoto, K ;
Antonio, BA ;
Baba, T ;
Sakata, K ;
Nagamura, Y ;
Aoki, H ;
Arikawa, K ;
Arita, K ;
Bito, T ;
Chiden, Y ;
Fujitsuka, N ;
Fukunaka, R ;
Hamada, M ;
Harada, C ;
Hayashi, A ;
Hijishita, S ;
Honda, M ;
Hosokawa, S ;
Ichikawa, Y ;
Idonuma, A ;
Iijima, M ;
Ikeda, M ;
Ikeno, M ;
Ito, K ;
Ito, S ;
Ito, T ;
Ito, Y ;
Ito, Y ;
Iwabuchi, A ;
Kamiya, K ;
Karasawa, W ;
Kurita, K ;
Katagiri, S ;
Kikuta, A ;
Kobayashi, H ;
Kobayashi, N ;
Machita, K ;
Maehara, T ;
Masukawa, M ;
Mizubayashi, T ;
Mukai, Y ;
Nagasaki, H ;
Nagata, Y .
NATURE, 2005, 436 (7052) :793-800
[10]   Genetic Properties of the Maize Nested Association Mapping Population [J].
McMullen, Michael D. ;
Kresovich, Stephen ;
Villeda, Hector Sanchez ;
Bradbury, Peter ;
Li, Huihui ;
Sun, Qi ;
Flint-Garcia, Sherry ;
Thornsberry, Jeffry ;
Acharya, Charlotte ;
Bottoms, Christopher ;
Brown, Patrick ;
Browne, Chris ;
Eller, Magen ;
Guill, Kate ;
Harjes, Carlos ;
Kroon, Dallas ;
Lepak, Nick ;
Mitchell, Sharon E. ;
Peterson, Brooke ;
Pressoir, Gael ;
Romero, Susan ;
Rosas, Marco Oropeza ;
Salvo, Stella ;
Yates, Heather ;
Hanson, Mark ;
Jones, Elizabeth ;
Smith, Stephen ;
Glaubitz, Jeffrey C. ;
Goodman, Major ;
Ware, Doreen ;
Holland, James B. ;
Buckler, Edward S. .
SCIENCE, 2009, 325 (5941) :737-740