The ensembl core software libraries

被引:83
作者
Stabenau, A
McVicker, G
Melsopp, C
Proctor, G
Clamp, M
Birney, E
机构
[1] EMBL European Bioinformat Inst, Hinxton CB10 1SD, England
[2] Broad Inst, Cambridge, MA 02141 USA
基金
英国惠康基金;
关键词
D O I
10.1101/gr.1857204
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Systems for managing genomic data must store a Vast quantity of information. Ensembl stores these data in several MySQL databases. The core software libraries provide a practical and effective means for programmers to access these data. By encapsulating the underlying database structure, the libraries present end users with a simple, abstract interface to a complex data model. Programs that use the libraries rather than SQL to access the data are unaffected by most schema changes. The architecture of the core software libraries, the schema, and the factors influencing their design are described. All code and data are freely available.
引用
收藏
页码:929 / 933
页数:5
相关论文
共 14 条
[1]   Toucan:: deciphering the cis-regulatory logic of coregulated genes [J].
Aerts, S ;
Thijs, G ;
Coessens, B ;
Staes, M ;
Moreau, Y ;
Moor, BD .
NUCLEIC ACIDS RESEARCH, 2003, 31 (06) :1753-1764
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]   PlasmoDB:: the Plasmodium genome resource.: A database integrating experimental and computational data [J].
Bahl, A ;
Brunk, B ;
Crabtree, J ;
Fraunholz, MJ ;
Gajria, B ;
Grant, GR ;
Ginsburg, H ;
Gupta, D ;
Kissinger, JC ;
Labo, P ;
Li, L ;
Mailman, MD ;
Milgram, AJ ;
Pearson, DS ;
Roos, DS ;
Schug, J ;
Stoeckert, CJ ;
Whetzel, P .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :212-215
[4]   Connecting sequence and biology in the laboratory mouse [J].
Baldarelli, RM ;
Hill, DP ;
Blake, JA ;
Adachi, J ;
Furuno, M ;
Bradt, D ;
Corbani, LE ;
Cousins, S ;
Frazer, KS ;
Qi, D ;
Yang, LL ;
Ramachandran, S ;
Reed, D ;
Zhu, YX ;
Kasukawa, T ;
Ringwald, M ;
King, BL ;
Maltais, LJ ;
McKenzie, LM ;
Schriml, LM ;
Maglott, D ;
Church, DM ;
Pruitt, K ;
Eppig, JT ;
Richardson, JE ;
Kadin, JA ;
Bult, CJ .
GENOME RESEARCH, 2003, 13 (6B) :1505-1519
[5]   Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins [J].
Bateman, A ;
Birney, E ;
Durbin, R ;
Eddy, SR ;
Finn, RD ;
Sonnhammer, ELL .
NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :260-262
[6]  
Eeckman FH, 1995, METHOD CELL BIOL, V48, P583
[7]   The UCSC Genome Browser Database [J].
Karolchik, D ;
Baertsch, R ;
Diekhans, M ;
Furey, TS ;
Hinrichs, A ;
Lu, YT ;
Roskin, KM ;
Schwartz, M ;
Sugnet, CW ;
Thomas, DJ ;
Weber, RJ ;
Haussler, D ;
Kent, WJ .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :51-54
[8]  
MUNGALL C, 2002, GENOME BIOL, V3
[9]   The bioperl toolkit:: Perl modules for the life sciences [J].
Stajich, JE ;
Block, D ;
Boulez, K ;
Brenner, SE ;
Chervitz, SA ;
Dagdigian, C ;
Fuellen, G ;
Gilbert, JGR ;
Korf, I ;
Lapp, H ;
Lehväslaiho, H ;
Matsalla, C ;
Mungall, CJ ;
Osborne, BI ;
Pocock, MR ;
Schattner, P ;
Senger, M ;
Stein, LD ;
Stupka, E ;
Wilkinson, MD ;
Birney, E .
GENOME RESEARCH, 2002, 12 (10) :1611-1618
[10]   WormBase:: network access to the genome and biology of Caenorhabditis elegans [J].
Stein, L ;
Sternberg, P ;
Durbin, R ;
Thierry-Mieg, J ;
Spieth, J .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :82-86