Inparanoid: a comprehensive database of eukaryotic orthologs

被引:543
作者
O'Brien, KP
Remm, M
Sonnhammer, ELL [1 ]
机构
[1] Karolinska Inst, Ctr Genom & Bioinformat, S-17177 Stockholm, Sweden
[2] Univ Tartu, Inst Mol & Cell Biol, Dept Bioinformat, EE-50090 Tartu, Estonia
关键词
D O I
10.1093/nar/gki107
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Inparanoid eukaryotic ortholog database (http://inparanoid.cgb.ki.se/)is a collection of pairwise ortholog groups between 17 whole genomes; Anopheles gambiae, Caenorhabditis briggsae, Caenorhabditis elegans, Drosophila melanogaster, Danio rerio, Takifugu rubripes, Gallus gallus, Homo sapiens, Mus musculus, Pan troglodytes, Rattus norvegicus, Oryza sativa, Plasmodium falciparum, Arabidopsis thaliana, Escherichia coli, Saccharomyces cerevisiae and Schizosaccharomyces pombe. Complete proteomes for these genomes were derived from Ensembl and UniProt and compared pairwise using Blast, followed by a clustering step using the Inparanoid program. An Inparanoid cluster is seeded by a reciprocally best-matching ortholog pair, around which inparalogs (should they exist) are gathered independently, while outparalogs are excluded. The ortholog clusters can be searched on the website using Ensembl gene/protein or UniProt identifiers, annotation text or by Blast alignment against our protein datasets. The entire dataset can be downloaded, as can the Inparanoid program itself.
引用
收藏
页码:D476 / D480
页数:5
相关论文
共 10 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkw1099, 10.1093/nar/gkh131]
  • [3] Ensembl 2004
    Birney, E
    Andrews, D
    Bevan, P
    Caccamo, M
    Cameron, G
    Chen, Y
    Clarke, L
    Coates, G
    Cox, T
    Cuff, J
    Curwen, V
    Cutts, T
    Down, T
    Durbin, R
    Eyras, E
    Fernandez-Suarez, XM
    Gane, P
    Gibbins, B
    Gilbert, J
    Hammond, M
    Hotz, H
    Iyer, V
    Kahari, A
    Jekosch, K
    Kasprzyk, A
    Keefe, D
    Keenan, S
    Lehvaslaiho, H
    McVicker, G
    Melsopp, C
    Meidl, P
    Mongin, E
    Pettett, R
    Potter, S
    Proctor, G
    Rae, M
    Searle, S
    Slater, G
    Smedley, D
    Smith, J
    Spooner, W
    Stabenau, A
    Stalker, J
    Storey, R
    Ureta-Vidal, A
    Woodwark, C
    Clamp, M
    Hubbard, T
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D468 - D470
  • [4] The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003
    Boeckmann, B
    Bairoch, A
    Apweiler, R
    Blatter, MC
    Estreicher, A
    Gasteiger, E
    Martin, MJ
    Michoud, K
    O'Donovan, C
    Phan, I
    Pilbout, S
    Schneider, M
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 365 - 370
  • [5] Genome sequence of the human malaria parasite Plasmodium falciparum
    Gardner, MJ
    Hall, N
    Fung, E
    White, O
    Berriman, M
    Hyman, RW
    Carlton, JM
    Pain, A
    Nelson, KE
    Bowman, S
    Paulsen, IT
    James, K
    Eisen, JA
    Rutherford, K
    Salzberg, SL
    Craig, A
    Kyes, S
    Chan, MS
    Nene, V
    Shallom, SJ
    Suh, B
    Peterson, J
    Angiuoli, S
    Pertea, M
    Allen, J
    Selengut, J
    Haft, D
    Mather, MW
    Vaidya, AB
    Martin, DMA
    Fairlamb, AH
    Fraunholz, MJ
    Roos, DS
    Ralph, SA
    McFadden, GI
    Cummings, LM
    Subramanian, GM
    Mungall, C
    Venter, JC
    Carucci, DJ
    Hoffman, SL
    Newbold, C
    Davis, RW
    Fraser, CM
    Barrell, B
    [J]. NATURE, 2002, 419 (6906) : 498 - 511
  • [6] OrthoDisease: A database of human disease orthologs
    O'Brien, KP
    Westerlund, I
    Sonnhammer, ELL
    [J]. HUMAN MUTATION, 2004, 24 (02) : 112 - 119
  • [7] Automatic clustering of orthologs and in-paralogs from pairwise species comparisons
    Remm, M
    Storm, CEV
    Sonnhammer, ELL
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 314 (05) : 1041 - 1052
  • [8] Orthology, paralogy and proposed classification for paralog subtypes
    Sonnhammer, ELL
    Koonin, EV
    [J]. TRENDS IN GENETICS, 2002, 18 (12) : 619 - 620
  • [9] The bioperl toolkit:: Perl modules for the life sciences
    Stajich, JE
    Block, D
    Boulez, K
    Brenner, SE
    Chervitz, SA
    Dagdigian, C
    Fuellen, G
    Gilbert, JGR
    Korf, I
    Lapp, H
    Lehväslaiho, H
    Matsalla, C
    Mungall, CJ
    Osborne, BI
    Pocock, MR
    Schattner, P
    Senger, M
    Stein, LD
    Stupka, E
    Wilkinson, MD
    Birney, E
    [J]. GENOME RESEARCH, 2002, 12 (10) : 1611 - 1618
  • [10] The TIGR rice genome annotation resource: annotating the rice genome and creating resources for plant biologists
    Yuan, QP
    Ouyang, S
    Liu, J
    Suh, B
    Cheung, F
    Sultana, R
    Lee, D
    Quackenbush, J
    Buell, CR
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 229 - 233