Identification and characterization of lineage-specific genes within the Poaceae

被引:63
作者
Campbell, Matthew A.
Zhu, Wei
Jiang, Ning
Lin, Haining
Ouyang, Shu
Childs, Kevin L.
Haas, Brian J.
Hamilton, John P.
Buell, C. Robin
机构
[1] J Craig Venter Inst, Inst Genom Res, Rockville, MD 20850 USA
[2] Michigan State Univ, Dept Hort, E Lansing, MI 48824 USA
基金
美国国家科学基金会;
关键词
D O I
10.1104/pp.107.104513
中图分类号
Q94 [植物学];
学科分类号
071001 [植物学];
摘要
Using the rice ( Oryza sativa) sp. japonica genome annotation, along with genomic sequence and clustered transcript assemblies from 184 species in the plant kingdom, we have identified a set of 861 rice genes that are evolutionarily conserved among six diverse species within the Poaceae yet lack significant sequence similarity with plant species outside the Poaceae. This set of evolutionarily conserved and lineage-specific rice genes is termed conserved Poaceae-specific genes (CPSGs) to reflect the presence of significant sequence similarity across three separate Poaceae subfamilies. The vast majority of rice CPSGs (86.6%) encode proteins with no putative function or functionally characterized protein domain. For the remaining CPSGs, 8.8% encode an F-box domain-containing protein and 4.5% encode a protein with a putative function. Onaverage, the CPSGs have fewer exons, shorter total gene length, and elevatedGC content when compared with genes annotated as either transposable elements (TEs) or those genes having significant sequence similarity in a species outside the Poaceae. Multiple sequence alignments of the CPSGs with sequences from other Poaceae species show conservation across a putative domain, a novel domain, or the entire coding length of the protein. At the genome level, syntenic alignments between sorghum ( Sorghum bicolor) and 103 of the 861 rice CPSGs (12.0%) could be made, demonstrating an additional level of conservation for this set of genes within the Poaceae. The extensive sequence similarity in evolutionarily distinct species within the Poaceae family and an additional screen for TE-related structural characteristics and sequence discounts these CPSGs as being misannotated TEs. Collectively, these data confirm that we have identified a specific set of genes that are highly conserved within, as well as specific to, the Poaceae.
引用
收藏
页码:1311 / 1322
页数:12
相关论文
共 64 条
[1]
Assaying gene content in Arabidopsis [J].
Allen, KD .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (14) :9568-9572
[2]
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]
Automated de novo identification of repeat sequence families in sequenced genomes [J].
Bao, ZR ;
Eddy, SR .
GENOME RESEARCH, 2002, 12 (08) :1269-1276
[4]
Sorghum genome sequencing by methylation filtration [J].
Bedell, JA ;
Budiman, MA ;
Nunberg, A ;
Citek, RW ;
Robbins, D ;
Jones, J ;
Flick, E ;
Rohlfing, T ;
Fries, J ;
Bradford, K ;
McMenamy, J ;
Smith, M ;
Holeman, H ;
Roe, BA ;
Wiley, G ;
Korf, IF ;
Rabinowicz, PD ;
Lakey, N ;
McCombie, WR ;
Jeddeloh, JA ;
Martienssen, RA .
PLOS BIOLOGY, 2005, 3 (01) :103-115
[5]
Consistent over-estimation of gene number in complex plant genomes [J].
Bennetzen, JL ;
Coleman, C ;
Liu, RY ;
Ma, JX ;
Ramakrishna, W .
CURRENT OPINION IN PLANT BIOLOGY, 2004, 7 (06) :732-736
[6]
Intraspecies sequence comparisons for annotating genomes [J].
Boffelli, D ;
Weer, CV ;
Weng, L ;
Lewis, KD ;
Shoukry, MI ;
Pachter, L ;
Keys, DN ;
Rubin, EM .
GENOME RESEARCH, 2004, 14 (12) :2406-2411
[7]
Comparative genomics at the vertebrate extremes [J].
Boffelli, D ;
Nobrega, MA ;
Rubin, EM .
NATURE REVIEWS GENETICS, 2004, 5 (06) :456-465
[8]
Databases and information integration for the Medicago truncatula genome and transcriptome [J].
Cannon, SB ;
Crow, JA ;
Heuer, ML ;
Wang, XH ;
Cannon, EKS ;
Dwan, C ;
Lamblin, AF ;
Vasdewani, J ;
Mudge, J ;
Cook, A ;
Gish, J ;
Cheung, F ;
Kenton, S ;
Kunau, TM ;
Brown, D ;
May, GD ;
Kim, D ;
Cook, DR ;
Roe, BA ;
Town, CD ;
Young, ND ;
Retzel, EF .
PLANT PHYSIOLOGY, 2005, 138 (01) :38-46
[9]
The SCF ubiquitin ligase: Insights into a molecular machine [J].
Cardozo, T ;
Pagano, M .
NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2004, 5 (09) :739-751
[10]
Carels N, 2000, GENETICS, V154, P1819