The Ensembl gene annotation system

被引:523
作者
Aken, Bronwen L. [1 ,2 ]
Ayling, Sarah [2 ,3 ]
Barrell, Daniel [1 ,2 ,4 ]
Clarke, Laura [2 ,5 ]
Curwen, Valery [2 ]
Fairley, Susan [2 ,5 ]
Banet, Julio Fernandez [2 ,6 ]
Billis, Konstantinos [1 ,2 ]
Giron, Carlos Garcia [1 ,2 ]
Hourlier, Thibaut [1 ,2 ]
Howe, Kevin [2 ,5 ]
Kahari, Andreas [2 ,7 ]
Kokocinski, Felix [2 ]
Martin, Fergal J. [1 ,2 ]
Murphy, Daniel N. [1 ,2 ]
Nag, Rishi [1 ,2 ]
Ruffier, Magali [2 ,5 ]
Schuster, Michael [1 ,8 ]
Tang, Y. Amy [2 ,5 ]
Vogel, Jan-Hinnerk [2 ,9 ]
White, Simon [2 ,10 ]
Zadissa, Amonida [2 ,5 ]
Flicek, Paul [1 ,2 ]
Searle, Stephen M. J. [2 ]
机构
[1] European Bioinformat Inst Wellcome Genome Campus, European Mol Biol Lab, Cambridge CB10 1SD, England
[2] Wellcome Trust Sanger Inst Wellcome Genome Campus, Cambridge CB10 1SA, England
[3] Genome Anal Ctr, Norwich Res Pk, Norwich NR4 7UH, Norfolk, England
[4] Eagle Genom Ltd, Babraham Res Campus, Cambridge CB22 3AT, England
[5] European Bioinformat Inst, European Mol Biol Lab, Wellcome Genome Campus, Cambridge CB10 1SD, England
[6] Pfizer Inc, 10646 Sci Ctr Dr, San Diego, CA 92121 USA
[7] Uppsala Univ, Inst Cell Molekylarbiol, Husargatan 3, S-75237 Uppsala, Sweden
[8] Austrian Acad Sci, CeMM Res Ctr Mol Med, A-1090 Vienna, Austria
[9] Genentech Inc, 1 DNAWay, San Francisco, CA 94080 USA
[10] Baylor Coll Med, Human Genome Sequencing Ctr, Houston, TX 77030 USA
来源
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION | 2016年
基金
英国生物技术与生命科学研究理事会; 英国惠康基金;
关键词
GENOME PROVIDES INSIGHTS; SEQUENCE; REVEALS; EVOLUTION; ALIGNMENT; DATABASE; ZEBRAFISH; IDENTIFICATION; COMPLEXITY; GENERATION;
D O I
10.1093/database/baw093
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The Ensembl gene annotation system has been used to annotate over 70 different vertebrate species across a wide range of genome projects. Furthermore, it generates the automatic alignment-based annotation for the human and mouse GENCODE gene sets. The system is based on the alignment of biological sequences, including cDNAs, proteins and RNA-seq reads, to the target genome in order to construct candidate transcript models. Careful assessment and filtering of these candidate transcripts ultimately leads to the final gene set, which is made available on the Ensembl website. Here, we describe the annotation process in detail.
引用
收藏
页数:19
相关论文
共 106 条
  • [61] A survey of sequence alignment algorithms for next-generation sequencing
    Li, Heng
    Homer, Nils
    [J]. BRIEFINGS IN BIOINFORMATICS, 2010, 11 (05) : 473 - 483
  • [62] Fast and accurate short read alignment with Burrows-Wheeler transform
    Li, Heng
    Durbin, Richard
    [J]. BIOINFORMATICS, 2009, 25 (14) : 1754 - 1760
  • [63] Genome sequence, comparative analysis and haplotype structure of the domestic dog
    Lindblad-Toh, K
    Wade, CM
    Mikkelsen, TS
    Karlsson, EK
    Jaffe, DB
    Kamal, M
    Clamp, M
    Chang, JL
    Kulbokas, EJ
    Zody, MC
    Mauceli, E
    Xie, XH
    Breen, M
    Wayne, RK
    Ostrander, EA
    Ponting, CP
    Galibert, F
    Smith, DR
    deJong, PJ
    Kirkness, E
    Alvarez, P
    Biagi, T
    Brockman, W
    Butler, J
    Chin, CW
    Cook, A
    Cuff, J
    Daly, MJ
    DeCaprio, D
    Gnerre, S
    Grabherr, M
    Kellis, M
    Kleber, M
    Bardeleben, C
    Goodstadt, L
    Heger, A
    Hitte, C
    Kim, L
    Koepfli, KP
    Parker, HG
    Pollinger, JP
    Searle, SMJ
    Sutter, NB
    Thomas, R
    Webber, C
    Lander, ES
    [J]. NATURE, 2005, 438 (7069) : 803 - 819
  • [64] A high-resolution map of human evolutionary constraint using 29 mammals
    Lindblad-Toh, Kerstin
    Garber, Manuel
    Zuk, Or
    Lin, Michael F.
    Parker, Brian J.
    Washietl, Stefan
    Kheradpour, Pouya
    Ernst, Jason
    Jordan, Gregory
    Mauceli, Evan
    Ward, Lucas D.
    Lowe, Craig B.
    Holloway, Alisha K.
    Clamp, Michele
    Gnerre, Sante
    Alfoeldi, Jessica
    Beal, Kathryn
    Chang, Jean
    Clawson, Hiram
    Cuff, James
    Di Palma, Federica
    Fitzgerald, Stephen
    Flicek, Paul
    Guttman, Mitchell
    Hubisz, Melissa J.
    Jaffe, David B.
    Jungreis, Irwin
    Kent, W. James
    Kostka, Dennis
    Lara, Marcia
    Martins, Andre L.
    Massingham, Tim
    Moltke, Ida
    Raney, Brian J.
    Rasmussen, Matthew D.
    Robinson, Jim
    Stark, Alexander
    Vilella, Albert J.
    Wen, Jiayu
    Xie, Xiaohui
    Zody, Michael C.
    Worley, Kim C.
    Kovar, Christie L.
    Muzny, Donna M.
    Gibbs, Richard A.
    Warren, Wesley C.
    Mardis, Elaine R.
    Weinstock, George M.
    Wilson, Richard K.
    Birney, Ewan
    [J]. NATURE, 2011, 478 (7370) : 476 - 482
  • [65] Comparative and demographic analysis of orang-utan genomes
    Locke, Devin P.
    Hillier, LaDeana W.
    Warren, Wesley C.
    Worley, Kim C.
    Nazareth, Lynne V.
    Muzny, Donna M.
    Yang, Shiaw-Pyng
    Wang, Zhengyuan
    Chinwalla, Asif T.
    Minx, Pat
    Mitreva, Makedonka
    Cook, Lisa
    Delehaunty, Kim D.
    Fronick, Catrina
    Schmidt, Heather
    Fulton, Lucinda A.
    Fulton, Robert S.
    Nelson, Joanne O.
    Magrini, Vincent
    Pohl, Craig
    Graves, Tina A.
    Markovic, Chris
    Cree, Andy
    Dinh, Huyen H.
    Hume, Jennifer
    Kovar, Christie L.
    Fowler, Gerald R.
    Lunter, Gerton
    Meader, Stephen
    Heger, Andreas
    Ponting, Chris P.
    Marques-Bonet, Tomas
    Alkan, Can
    Chen, Lin
    Cheng, Ze
    Kidd, Jeffrey M.
    Eichler, Evan E.
    White, Simon
    Searle, Stephen
    Vilella, Albert J.
    Chen, Yuan
    Flicek, Paul
    Ma, Jian
    Raney, Brian
    Suh, Bernard
    Burhans, Richard
    Herrero, Javier
    Haussler, David
    Faria, Rui
    Fernando, Olga
    [J]. NATURE, 2011, 469 (7331) : 529 - 533
  • [66] tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence
    Lowe, TM
    Eddy, SR
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (05) : 955 - 964
  • [67] PREDICTING COILED COILS FROM PROTEIN SEQUENCES
    LUPAS, A
    VANDYKE, M
    STOCK, J
    [J]. SCIENCE, 1991, 252 (5009) : 1162 - 1164
  • [68] A Phase I Trial of BKM120 (Buparlisib) in Combination with Fulvestrant in Postmenopausal Women with Estrogen Receptor-Positive Metastatic Breast Cancer
    Ma, Cynthia X.
    Luo, Jingqin
    Naughton, Michael
    Ademuyiwa, Foluso
    Suresh, Rama
    Griffith, Malachi
    Griffith, Obi L.
    Skidmore, Zachary L.
    Spies, Nicholas C.
    Ramu, Avinash
    Trani, Lee
    Pluard, Timothy
    Nagaraj, Gayathri
    Thomas, Shana
    Guo, Zhanfang
    Hoog, Jeremy
    Han, Jing
    Mardis, Elaine
    Lockhart, Craig
    Ellis, Matthew J.
    [J]. CLINICAL CANCER RESEARCH, 2016, 22 (07) : 1583 - 1591
  • [69] Choice of transcripts and software has a large effect on variant annotation
    McCarthy, Davis J.
    Humburg, Peter
    Kanapin, Alexander
    Rivas, Manuel A.
    Gaulton, Kyle
    Cazier, Jean-Baptiste
    Donnelly, Peter
    [J]. GENOME MEDICINE, 2014, 6
  • [70] The cavefish genome reveals candidate genes for eye loss
    McGaugh, Suzanne E.
    Gross, Joshua B.
    Aken, Bronwen
    Blin, Maryline
    Borowsky, Richard
    Chalopin, Domitille
    Hinaux, Helene
    Jeffery, William R.
    Keene, Alex
    Ma, Li
    Minx, Patrick
    Murphy, Daniel
    O'Quin, Kelly E.
    Retaux, Sylvie
    Rohner, Nicolas
    Searle, Steve M. J.
    Stahl, Bethany A.
    Tabin, Cliff
    Volff, Jean-Nicolas
    Yoshizawa, Masato
    Warren, Wesley C.
    [J]. NATURE COMMUNICATIONS, 2014, 5