Sorghum genome sequencing by methylation filtration

被引:111
作者
Bedell, JA [1 ]
Budiman, MA
Nunberg, A
Citek, RW
Robbins, D
Jones, J
Flick, E
Rohlfing, T
Fries, J
Bradford, K
McMenamy, J
Smith, M
Holeman, H
Roe, BA
Wiley, G
Korf, IF
Rabinowicz, PD
Lakey, N
McCombie, WR
Jeddeloh, JA
Martienssen, RA
机构
[1] Orion Genom, Bioinformat, St Louis, MO 63110 USA
[2] Orion Genom, Lib Construct, St Louis, MO USA
[3] Orion Genom, Sequencing, St Louis, MO USA
[4] Orion Genom, Biomarkers, St Louis, MO USA
[5] Univ Oklahoma, Dept Chem & Biochem, Norman, OK 73019 USA
[6] Univ Calif Davis, Genome Ctr, Davis, CA 95616 USA
[7] Inst Genom Res, Rockville, MD USA
[8] Orion Genom, Business, St Louis, MO USA
[9] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
关键词
D O I
10.1371/journal.pbio.0030013
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Sorghum bicolor is a close relative of maize and is a staple crop in Africa and much of the developing world because of its superior tolerance of arid growth conditions. We have generated sequence from the hypomethylated portion of the sorghum genome by applying methylation filtration ( MF) technology. The evidence suggests that 96% of the genes have been sequence tagged, with an average coverage of 65% across their length. Remarkably, this level of gene discovery was accomplished after generating a raw coverage of less than 300 megabases of the 735- megabase genome. MF preferentially captures exons and introns, promoters, microRNAs, and simple sequence repeats, and minimizes interspersed repeats, thus providing a robust view of the functional parts of the genome. The sorghum MF sequence set is beneficial to research on sorghum and is also a powerful resource for comparative genomics among the grasses and across the entire plant kingdom. Thousands of hypothetical gene predictions in rice and Arabidopsis are supported by the sorghum dataset, and genomic similarities highlight evolutionarily conserved regions that will lead to a better understanding of rice and Arabidopsis.
引用
收藏
页码:103 / 115
页数:13
相关论文
共 62 条
  • [1] [Anonymous], PLANT DNA C VALUES D
  • [2] Analysis of the genome sequence of the flowering plant Arabidopsis thaliana
    Kaul, S
    Koo, HL
    Jenkins, J
    Rizzo, M
    Rooney, T
    Tallon, LJ
    Feldblyum, T
    Nierman, W
    Benito, MI
    Lin, XY
    Town, CD
    Venter, JC
    Fraser, CM
    Tabata, S
    Nakamura, Y
    Kaneko, T
    Sato, S
    Asamizu, E
    Kato, T
    Kotani, H
    Sasamoto, S
    Ecker, JR
    Theologis, A
    Federspiel, NA
    Palm, CJ
    Osborne, BI
    Shinn, P
    Conway, AB
    Vysotskaia, VS
    Dewar, K
    Conn, L
    Lenz, CA
    Kim, CJ
    Hansen, NF
    Liu, SX
    Buehler, E
    Altafi, H
    Sakano, H
    Dunn, P
    Lam, B
    Pham, PK
    Chao, Q
    Nguyen, M
    Yu, GX
    Chen, HM
    Southwick, A
    Lee, JM
    Miranda, M
    Toriumi, MJ
    Davis, RW
    [J]. NATURE, 2000, 408 (6814) : 796 - 815
  • [3] MaskerAid:: a performance enhancement to RepeatMasker
    Bedell, JA
    Korf, I
    Gish, W
    [J]. BIOINFORMATICS, 2000, 16 (11) : 1040 - 1041
  • [4] Bennetzen J L, 1998, Symp Soc Exp Biol, V51, P123
  • [5] The genetic colinearity of rice and other cereals on the basis of genomic sequence analysis
    Bennetzen, JL
    Ma, JX
    [J]. CURRENT OPINION IN PLANT BIOLOGY, 2003, 6 (02) : 128 - 133
  • [6] ACTIVE MAIZE GENES ARE UNMODIFIED AND FLANKED BY DIVERSE CLASSES OF MODIFIED, HIGHLY REPETITIVE DNA
    BENNETZEN, JL
    SCHRICK, K
    SPRINGER, PS
    BROWN, WE
    SANMIGUEL, P
    [J]. GENOME, 1994, 37 (04) : 565 - 576
  • [7] *BOARD SCI TECHN I, 1996, LOST CROPS AFR
  • [8] REPEATED SEQUENCES IN DNA
    BRITTEN, RJ
    KOHNE, DE
    [J]. SCIENCE, 1968, 161 (3841) : 529 - &
  • [9] BRITTEN RJ, 1976, FED PROC, V35, P2151
  • [10] Prediction of complete gene structures in human genomic DNA
    Burge, C
    Karlin, S
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) : 78 - 94