Integrating alternative splicing detection into gene prediction

被引:23
作者
Foissac, S [1 ]
Schiex, T [1 ]
机构
[1] INRA, Unite Biometrie & Intelligence Artificielle, F-31326 Castanet Tolosan, France
关键词
D O I
10.1186/1471-2105-6-25
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Alternative splicing ( AS) is now considered as a major actor in transcriptome/proteome diversity and it cannot be neglected in the annotation process of a new genome. Despite considerable progresses in term of accuracy in computational gene prediction, the ability to reliably predict AS variants when there is local experimental evidence of it remains an open challenge for gene finders. Results: We have used a new integrative approach that allows to incorporate AS detection into ab initio gene prediction. This method relies on the analysis of genomically aligned transcript sequences ( ESTs and/or cDNAs), and has been implemented in the dynamic programming algorithm of the graph-based gene finder EuGENE. Given a genomic sequence and a set of aligned transcripts, this new version identifies the set of transcripts carrying evidence of alternative splicing events, and provides, in addition to the classical optimal gene prediction, alternative optimal predictions ( among those which are consistent with the AS events detected). This allows for multiple annotations of a single gene in a way such that each predicted variant is supported by a transcript evidence ( but not necessarily with a full-length coverage). Conclusions: This automatic combination of experimental data analysis and ab initio gene finding offers an ideal integration of alternatively spliced gene prediction inside a single annotation pipeline.
引用
收藏
页数:10
相关论文
共 30 条
  • [1] SLAM: Cross-species gene finding and alignment with a generalized pair hidden Markov model
    Alexandersson, M
    Cawley, S
    Pachter, L
    [J]. GENOME RESEARCH, 2003, 13 (03) : 496 - 502
  • [2] Bellman R., 1957, DYNAMIC PROGRAMMING
  • [3] DBEST - DATABASE FOR EXPRESSED SEQUENCE TAGS
    BOGUSKI, MS
    LOWE, TMJ
    TOLSTOSHEV, CM
    [J]. NATURE GENETICS, 1993, 4 (04) : 332 - 333
  • [4] BONIZZONI P, 2003, LNCS, P63
  • [5] Gene structure prediction from consensus spliced alignment of multiple ESTs matching the same genomic locus
    Brendel, V
    Xing, LQ
    Zhu, W
    [J]. BIOINFORMATICS, 2004, 20 (07) : 1157 - 1169
  • [6] Prediction of complete gene structures in human genomic DNA
    Burge, C
    Karlin, S
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) : 78 - 94
  • [7] HMM sampling and applications to gene finding and alternative splicing
    Cawley, Simon L.
    Pachter, Lior
    [J]. BIOINFORMATICS, 2003, 19 : II36 - II41
  • [8] The Ensembl automatic gene annotation system
    Curwen, V
    Eyras, E
    Andrews, TD
    Clarke, L
    Mongin, E
    Searle, SMJ
    Clamp, M
    [J]. GENOME RESEARCH, 2004, 14 (05) : 942 - 950
  • [9] PlantGDB, plant genome database and analysis tools
    Dong, QF
    Schlueter, SD
    Brendel, V
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D354 - D359
  • [10] ESTGenes: Alternative splicing from ESTs in Ensembl
    Eyras, E
    Caccamo, M
    Curwen, V
    Clamp, M
    [J]. GENOME RESEARCH, 2004, 14 (05) : 976 - 987