Pseudogene.org: a comprehensive database and comparison platform for pseudogene annotation

被引:126
作者
Karro, John E.
Yan, Yangpan
Zheng, Deyou
Zhang, Zhaolei
Carriero, Nicholas
Cayting, Philip
Harrrison, Paul
Gerstein, Mark
机构
[1] Penn State Univ, Ctr Comparat Genomics & Bioinformat, University Pk, PA 16802 USA
[2] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
[3] Univ Toronto, Donnelly CCBR, Banting & Best Dept Med Res, Toronto, ON M5S 3E1, Canada
[4] Yale Univ, Dept Comp Sci, New Haven, CT 06520 USA
[5] McGill Univ, Dept Biol, Montreal, PQ H3A 1B1, Canada
[6] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06520 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1093/nar/gkl851
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Pseudogene.org knowledgebase serves as a comprehensive repository for pseudogene annotation. The definition of a pseudogene varies within the literature, resulting in significantly different approaches to the problem of identification. Consequently, it is difficult to maintain a consistent collection of pseudogenes in detail necessary for their effective use. Our database is designed to address this issue. It integrates a variety of heterogeneous resources and supports a subset structure that highlights specific groups of pseudogenes that are of interest to the research community. Tools are provided for the comparison of sets and the creation of layered set unions, enabling researchers to derive a current 'consensus' set of pseudogenes. Additional features include versatile search, the capacity for robust interaction with other databases, the ability to reconstruct older versions of the database (accounting for changing genome builds) and an underlying object-oriented interface designed for researchers with a minimal knowledge of programming. At the present time, the database contains more than 100 000 pseudogenes spanning 64 prokaryote and 11 eukaryote genomes, including a collection of human annotations compiled from 16 sources.
引用
收藏
页码:D55 / D60
页数:6
相关论文
共 28 条
  • [1] HOPPSIGEN: a database of human and mouse processed pseudogenes
    Adel, K
    Laurent, D
    Dominique, M
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 : D59 - D66
  • [2] Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkh131, 10.1093/nar/gkw1099]
  • [3] Benson Dennis A, 2005, Nucleic Acids Res, V33, pD34
  • [4] Ensembl: A genome infrastructure
    Birney, E
    [J]. COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY, 2003, 68 : 213 - 215
  • [5] Reevaluating human gene annotation: A second-generation analysis of chromosome 22
    Collins, JE
    Goward, ME
    Cole, CG
    Smink, LJ
    Huckle, EJ
    Knowles, S
    Bye, JM
    Beare, DM
    Dunham, I
    [J]. GENOME RESEARCH, 2003, 13 (01) : 27 - 36
  • [6] DENNIS W, 2003, ISWC BIOINFORMATICS
  • [7] Transcribed processed pseudogenes in the human genome: an intermediate form of expressed retrosequence lacking protein-coding ability
    Harrison, PM
    Zheng, DY
    Zhang, ZL
    Carriero, N
    Gerstein, M
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 (08) : 2374 - 2383
  • [8] Identification of pseudogenes in the Drosophila melanogaster genome
    Harrison, PM
    Milburn, D
    Zhang, Z
    Bertone, P
    Gerstein, M
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (03) : 1033 - 1037
  • [9] Molecular fossils in the human genome: Identification and analysis of the pseudogenes in chromosomes 21 and 22
    Harrison, PM
    Hegyi, H
    Balasubramanian, S
    Luscombe, NM
    Bertone, P
    Echols, N
    Johnson, T
    Gerstein, M
    [J]. GENOME RESEARCH, 2002, 12 (02) : 272 - 280
  • [10] Ensembl 2005
    Hubbard, T
    Andrews, D
    Caccamo, M
    Cameron, G
    Chen, Y
    Clamp, M
    Clarke, L
    Coates, G
    Cox, T
    Cunningham, F
    Curwen, V
    Cutts, T
    Down, T
    Durbin, R
    Fernandez-Suarez, XM
    Gilbert, J
    Hammond, M
    Herrero, J
    Hotz, H
    Howe, K
    Iyer, V
    Jekosch, K
    Kahari, A
    Kasprzyk, A
    Keefe, D
    Keenan, S
    Kokocinsci, F
    London, D
    Longden, I
    McVicker, G
    Melsopp, C
    Meidl, P
    Potter, S
    Proctor, G
    Rae, M
    Rios, D
    Schuster, M
    Searle, S
    Severin, J
    Slater, G
    Smedley, D
    Smith, J
    Spooner, W
    Stabenau, A
    Stalker, J
    Storey, R
    Trevanion, S
    Ureta-Vidal, A
    Vogel, J
    White, S
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 : D447 - D453