GENCODE reference annotation for the human and mouse genomes

被引:2040
作者
Frankish, Adam [1 ]
Diekhans, Mark [2 ]
Ferreira, Anne-Maud [3 ]
Johnson, Rory [4 ,5 ]
Jungreis, Irwin [6 ,7 ]
Loveland, Jane [1 ]
Mudge, Jonathan M. [1 ]
Sisu, Cristina [8 ,9 ]
Wright, James [10 ]
Armstrong, Joel [2 ]
Barnes, If [1 ]
Berry, Andrew [1 ]
Bignell, Alexandra [1 ]
Sala, Silvia Carbonell [11 ]
Chrast, Jacqueline [3 ]
Cunningham, Fiona [1 ]
Di Domenico, Tomas [12 ]
Donaldson, Sarah [1 ]
Fiddes, Ian T. [2 ]
Giron, Carlos Garcia [1 ]
Gonzalez, Jose Manuel [1 ]
Grego, Tiago [1 ]
Hardy, Matthew [1 ]
Hourlier, Thibaut [1 ]
Hunt, Toby [1 ]
Izuogu, Osagie G. [1 ]
Lagarde, Julien [11 ]
Martin, Fergal J. [1 ]
Martinez, Laura [12 ]
Mohanan, Shamika [1 ]
Muir, Paul [13 ,14 ]
Navarro, Fabio C. P. [8 ]
Parker, Anne [1 ]
Pei, Baikang [8 ]
Pozo, Fernando [12 ]
Ruffier, Magali [1 ]
Schmitt, Bianca M. [1 ]
Stapleton, Eloise [1 ]
Suner, Marie-Marthe [1 ]
Sycheva, Irina [1 ]
Uszczynska-Ratajczak, Barbara [15 ]
Xu, Jinuri [8 ]
Yates, Andrew [1 ]
Zerbino, Daniel [1 ]
Zhang, Yan [8 ,16 ]
Aken, Bronwen [1 ]
Choudhary, Jyoti S. [10 ]
Gerstein, Mark [8 ,17 ,18 ]
Guigo, Roderic [11 ,19 ]
Hubbard, Tim J. P. [20 ]
机构
[1] European Bioinformat Inst, European Mol Biol Lab, Wellcome Genome Campus, Cambridge CB10 1SD, England
[2] Univ Calif Santa Cruz, UC Santa Cruz Genom Inst, Santa Cruz, CA 95064 USA
[3] Univ Lausanne, CTr Integrat Genom, CH-1015 Lausanne, Switzerland
[4] Univ Bern, Univ Hosp, Inselspital, Dept Med Oncol, Bern, Switzerland
[5] Univ Bern, Dept Biomed Res DBMR, Bern, Switzerland
[6] MIT, Comp Sci & Artificial Intelligence Lab, 32 Vasser St, Cambridge, MA 02139 USA
[7] Broad Inst MIT & Harvard, 415 Main St, Cambridge, MA 02142 USA
[8] Yale Univ, Dept Mol Biophys & Biochem, POB 6666, New Haven, CT 06520 USA
[9] Brunel Univ London, Dept Biosci, Uxbridge UB8 3PH, Middx, England
[10] Inst Canc Res, Div Canc Biol, Funct Prote, 123 Old Brompton Rd, London SW7 3RP, England
[11] Barcelona Inst Sci & Technol, CRG, Dr Aiguader 88, E-08003 Barcelona, Spain
[12] Spanish Natl Canc Res Ctr CNIO, Bioinformat Unit, Madrid, Spain
[13] Yale Univ, Dept Mol Cellular & Dev Biol, New Haven, CT 06520 USA
[14] Yale Univ, Syst Biol Inst, West Haven, CT 06516 USA
[15] Univ Warsaw, Ctr New Technol, Warsaw, Poland
[16] Ohio State Univ, Coll Med, Dept Biomed Informat, Columbus, OH 43210 USA
[17] Yale Univ, Program Computat Biol & Bioinformat, Bass 432,266 Whitney Ave, New Haven, CT 06520 USA
[18] Yale Univ, Dept Comp Sci, Bass 432,266 Whitney Ave, New Haven, CT 06520 USA
[19] UPF, E-08003 Barcelona, Catalonia, Spain
[20] Guys Hosp, Kings Coll London, Dept Med & Mol Genet, London SE1 9RT, England
基金
英国惠康基金; 英国生物技术与生命科学研究理事会; 美国国家卫生研究院; 瑞士国家科学基金会;
关键词
LONG NONCODING RNAS; DNA ELEMENTS; PROTEIN; GENES; IDENTIFICATION; ENCYCLOPEDIA; PRINCIPAL; DATABASE; PROGRAM; CATALOG;
D O I
10.1093/nar/gky955
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
070307 [化学生物学]; 071010 [生物化学与分子生物学];
摘要
The accurate identification and description of the genes in the human and mouse genomes is a fundamental requirement for high quality analysis of data informing both genome biology and clinical genomics. Over the last 15 years, the GENCODE consortium has been producing reference quality gene annotations to provide this foundational resource. The GENCODE consortium includes both experimental and computational biology groups who work together to improve and extend the GENCODE gene annotation. Specifically, we generate primary data, create bioinformatics tools and provide analysis to support the work of expert manual gene annotators and automated gene annotation pipelines. In addition, manual and computational annotation workflows use any and all publicly available data and analysis, along with the research literature to identify and characterise gene loci to the highest standard. GENCODE gene annotations are accessible via the Ensembl and UCSC Genome Browsers, the Ensembl FTP site, Ensembl Biomart, Ensembl Perl and REST APIs as well as https://www.gencodegenes.org.
引用
收藏
页码:D766 / D773
页数:8
相关论文
共 48 条
[1]
Loose ends: almost one in five human genes still have unresolved coding status [J].
Abascal, Federico ;
Juan, David ;
Jungreis, Irwin ;
Martinez, Laura ;
Rigau, Maria ;
Manuel Rodriguez, Jose ;
Vazquez, Jesus ;
Tress, Michael L. .
NUCLEIC ACIDS RESEARCH, 2018, 46 (14) :7070-7084
[2]
Genetic effects on gene expression across human tissues [J].
Aguet, Francois ;
Brown, Andrew A. ;
Castel, Stephane E. ;
Davis, Joe R. ;
He, Yuan ;
Jo, Brian ;
Mohammadi, Pejman ;
Park, Yoson ;
Parsana, Princy ;
Segre, Ayellet V. ;
Strober, Benjamin J. ;
Zappala, Zachary ;
Cummings, Beryl B. ;
Gelfand, Ellen T. ;
Hadley, Kane ;
Huang, Katherine H. ;
Lek, Monkol ;
Li, Xiao ;
Nedzel, Jared L. ;
Nguyen, Duyen Y. ;
Noble, Michael S. ;
Sullivan, Timothy J. ;
Tukiainen, Taru ;
MacArthur, Daniel G. ;
Getz, Gad ;
Management, Nih Program ;
Addington, Anjene ;
Guan, Ping ;
Koester, Susan ;
Little, A. Roger ;
Lockhart, Nicole C. ;
Moore, Helen M. ;
Rao, Abhi ;
Struewing, Jeffery P. ;
Volpi, Simona ;
Collection, Biospecimen ;
Brigham, Lori E. ;
Hasz, Richard ;
Hunter, Marcus ;
Johns, Christopher ;
Johnson, Mark ;
Kopen, Gene ;
Leinweber, William F. ;
Lonsdale, John T. ;
McDonald, Alisa ;
Mestichelli, Bernadette ;
Myer, Kevin ;
Roe, Bryan ;
Salvatore, Michael ;
Shad, Saboor .
NATURE, 2017, 550 (7675) :204-+
[3]
The Ensembl gene annotation system [J].
Aken, Bronwen L. ;
Ayling, Sarah ;
Barrell, Daniel ;
Clarke, Laura ;
Curwen, Valery ;
Fairley, Susan ;
Banet, Julio Fernandez ;
Billis, Konstantinos ;
Giron, Carlos Garcia ;
Hourlier, Thibaut ;
Howe, Kevin ;
Kahari, Andreas ;
Kokocinski, Felix ;
Martin, Fergal J. ;
Murphy, Daniel N. ;
Nag, Rishi ;
Ruffier, Magali ;
Schuster, Michael ;
Tang, Y. Amy ;
Vogel, Jan-Hinnerk ;
White, Simon ;
Zadissa, Amonida ;
Flicek, Paul ;
Searle, Stephen M. J. .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
[4]
BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[5]
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkw1099, 10.1093/nar/gkh131]
[6]
Retrocopy contributions to the evolution of the human genome [J].
Baertsch, Robert ;
Diekhans, Mark ;
Kent, W. James ;
Haussler, David ;
Brosius, Juergen .
BMC GENOMICS, 2008, 9 (1)
[7]
Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project [J].
Birney, Ewan ;
Stamatoyannopoulos, John A. ;
Dutta, Anindya ;
Guigo, Roderic ;
Gingeras, Thomas R. ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Snyder, Michael ;
Dermitzakis, Emmanouil T. ;
Stamatoyannopoulos, John A. ;
Thurman, Robert E. ;
Kuehn, Michael S. ;
Taylor, Christopher M. ;
Neph, Shane ;
Koch, Christoph M. ;
Asthana, Saurabh ;
Malhotra, Ankit ;
Adzhubei, Ivan ;
Greenbaum, Jason A. ;
Andrews, Robert M. ;
Flicek, Paul ;
Boyle, Patrick J. ;
Cao, Hua ;
Carter, Nigel P. ;
Clelland, Gayle K. ;
Davis, Sean ;
Day, Nathan ;
Dhami, Pawandeep ;
Dillon, Shane C. ;
Dorschner, Michael O. ;
Fiegler, Heike ;
Giresi, Paul G. ;
Goldy, Jeff ;
Hawrylycz, Michael ;
Haydock, Andrew ;
Humbert, Richard ;
James, Keith D. ;
Johnson, Brett E. ;
Johnson, Ericka M. ;
Frum, Tristan T. ;
Rosenzweig, Elizabeth R. ;
Karnani, Neerja ;
Lee, Kirsten ;
Lefebvre, Gregory C. ;
Navas, Patrick A. ;
Neri, Fidencio ;
Parker, Stephen C. J. ;
Sabo, Peter J. ;
Sandstrom, Richard ;
Shafer, Anthony .
NATURE, 2007, 447 (7146) :799-816
[8]
The UCSC Genome Browser database: 2018 update [J].
Casper, Jonathan ;
Zweig, Ann S. ;
Villarreal, Chris ;
Tyner, Cath ;
Speir, Matthew L. ;
Rosenbloom, Kate R. ;
Raney, Brian J. ;
Lee, Christopher M. ;
Lee, Brian T. ;
Karolchik, Donna ;
Hinrichs, Angie S. ;
Haeussler, Maximilian ;
Guruvadoo, Luvina ;
Gonzalez, Jairo Navarro ;
Gibson, David ;
Fiddes, Ian T. ;
Eisenhart, Christopher ;
Diekhans, Mark ;
Clawson, Hiram ;
Barber, Galt P. ;
Armstrong, Joel ;
Haussler, David ;
Kuhn, Robert M. ;
Kent, W. James .
NUCLEIC ACIDS RESEARCH, 2018, 46 (D1) :D762-D769
[9]
Consortium G. P., 2015, NATURE, V526, P68, DOI [DOI 10.1038/NATURE15393, 10.1038/nature15393]
[10]
The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression [J].
Derrien, Thomas ;
Johnson, Rory ;
Bussotti, Giovanni ;
Tanzer, Andrea ;
Djebali, Sarah ;
Tilgner, Hagen ;
Guernec, Gregory ;
Martin, David ;
Merkel, Angelika ;
Knowles, David G. ;
Lagarde, Julien ;
Veeravalli, Lavanya ;
Ruan, Xiaoan ;
Ruan, Yijun ;
Lassmann, Timo ;
Carninci, Piero ;
Brown, James B. ;
Lipovich, Leonard ;
Gonzalez, Jose M. ;
Thomas, Mark ;
Davis, Carrie A. ;
Shiekhattar, Ramin ;
Gingeras, Thomas R. ;
Hubbard, Tim J. ;
Notredame, Cedric ;
Harrow, Jennifer ;
Guigo, Roderic .
GENOME RESEARCH, 2012, 22 (09) :1775-1789