Comparative genomics as a tool for gene discovery

被引:27
作者
Windsor, AJ [1 ]
Mitchell-Olds, T [1 ]
机构
[1] Max Planck Inst Chem Oekol, Abt Genet & Evolut, D-07745 Jena, Germany
关键词
D O I
10.1016/j.copbio.2006.01.007
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
With the increasing availability of data from multiple eukaryotic genome sequencing projects, attention has focused on interspecific comparisons to discover novel genes and transcribed genomic sequences. Generally, these extrinsic strategies combine ab initio gene prediction with expression and/or homology data to identify conserved gene candidates between two or more genomes. Interspecific sequence analyses have proven invaluable for the improvement of existing annotations, automation of annotation, and identification of novel coding regions and splice variants. Further, comparative genomic approaches hold the promise of improved prediction of terminal or small exons, microRNA precursors, and small peptide-encoding open reading frames - sequence elements that are difficult to identify through purely intrinsic methodologies in the absence of experimental data.
引用
收藏
页码:161 / 167
页数:7
相关论文
共 52 条
[1]   The genome sequence of Drosophila melanogaster [J].
Adams, MD ;
Celniker, SE ;
Holt, RA ;
Evans, CA ;
Gocayne, JD ;
Amanatides, PG ;
Scherer, SE ;
Li, PW ;
Hoskins, RA ;
Galle, RF ;
George, RA ;
Lewis, SE ;
Richards, S ;
Ashburner, M ;
Henderson, SN ;
Sutton, GG ;
Wortman, JR ;
Yandell, MD ;
Zhang, Q ;
Chen, LX ;
Brandon, RC ;
Rogers, YHC ;
Blazej, RG ;
Champe, M ;
Pfeiffer, BD ;
Wan, KH ;
Doyle, C ;
Baxter, EG ;
Helt, G ;
Nelson, CR ;
Miklos, GLG ;
Abril, JF ;
Agbayani, A ;
An, HJ ;
Andrews-Pfannkoch, C ;
Baldwin, D ;
Ballew, RM ;
Basu, A ;
Baxendale, J ;
Bayraktaroglu, L ;
Beasley, EM ;
Beeson, KY ;
Benos, PV ;
Berman, BP ;
Bhandari, D ;
Bolshakov, S ;
Borkova, D ;
Botchan, MR ;
Bouck, J ;
Brokstein, P .
SCIENCE, 2000, 287 (5461) :2185-2195
[2]   SLAM: Cross-species gene finding and alignment with a generalized pair hidden Markov model [J].
Alexandersson, M ;
Cawley, S ;
Pachter, L .
GENOME RESEARCH, 2003, 13 (03) :496-502
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]   Adaptive evolution of non-coding DNA in Drosophila [J].
Andolfatto, P .
NATURE, 2005, 437 (7062) :1149-1152
[5]  
[Anonymous], 1998, SCIENCE, V282, P2012
[6]   Analysis of the genome sequence of the flowering plant Arabidopsis thaliana [J].
Kaul, S ;
Koo, HL ;
Jenkins, J ;
Rizzo, M ;
Rooney, T ;
Tallon, LJ ;
Feldblyum, T ;
Nierman, W ;
Benito, MI ;
Lin, XY ;
Town, CD ;
Venter, JC ;
Fraser, CM ;
Tabata, S ;
Nakamura, Y ;
Kaneko, T ;
Sato, S ;
Asamizu, E ;
Kato, T ;
Kotani, H ;
Sasamoto, S ;
Ecker, JR ;
Theologis, A ;
Federspiel, NA ;
Palm, CJ ;
Osborne, BI ;
Shinn, P ;
Conway, AB ;
Vysotskaia, VS ;
Dewar, K ;
Conn, L ;
Lenz, CA ;
Kim, CJ ;
Hansen, NF ;
Liu, SX ;
Buehler, E ;
Altafi, H ;
Sakano, H ;
Dunn, P ;
Lam, B ;
Pham, PK ;
Chao, Q ;
Nguyen, M ;
Yu, GX ;
Chen, HM ;
Southwick, A ;
Lee, JM ;
Miranda, M ;
Toriumi, MJ ;
Davis, RW .
NATURE, 2000, 408 (6814) :796-815
[7]   Whole genome shotgun sequencing of Brassica oleracea and its application to gene discovery and annotation in Arabidopsis [J].
Ayele, M ;
Haas, BJ ;
Kumar, N ;
Wu, H ;
Xiao, YL ;
Van Aken, S ;
Utterback, TR ;
Wortman, JR ;
White, OR ;
Town, CD .
GENOME RESEARCH, 2005, 15 (04) :487-495
[8]   Phylogenetic shadowing and computational identification of human microRNA genes [J].
Berezikov, E ;
Guryev, V ;
van de Belt, J ;
Wienholds, E ;
Plasterk, RHA ;
Cuppen, E .
CELL, 2005, 120 (01) :21-24
[9]   Detection of 91 potential in plant conserved plant microRNAs in Arabidopsis thaliana and Oryza sativa identifies important target genes [J].
Bonnet, E ;
Wuyts, J ;
Rouzé, P ;
Van de Peer, Y .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (31) :11511-11516
[10]   Gene structure prediction from consensus spliced alignment of multiple ESTs matching the same genomic locus [J].
Brendel, V ;
Xing, LQ ;
Zhu, W .
BIOINFORMATICS, 2004, 20 (07) :1157-1169