Infrastructure for the life sciences: design and implementation of the UniProt website

被引:371
作者
Jain, Eric [1 ]
Bairoch, Amos [1 ,2 ]
Duvaud, Severine [1 ]
Phan, Isabelle [1 ]
Redaschi, Nicole [1 ]
Suzek, Baris E. [4 ]
Martin, Maria J. [3 ]
McGarvey, Peter [4 ]
Gasteiger, Elisabeth [1 ]
机构
[1] CMU, Swiss Inst Bioinformat, Swiss Prot Grp, CH-1211 Geneva 4, Switzerland
[2] Univ Geneva, Dept Struct Biol & Bioinformat, Fac Med, CH-1211 Geneva 4, Switzerland
[3] EMBL Outstn European Bioinformat Inst, Cambridge CB10 1SD, England
[4] Georgetown Univ, Med Ctr, Washington, DC 20007 USA
来源
BMC BIOINFORMATICS | 2009年 / 10卷
基金
美国国家卫生研究院;
关键词
FASTA Format; Sequence Similarity Search; European Bioinformatics Institute; Search Form; Protein Information Resource;
D O I
10.1186/1471-2105-10-136
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The UniProt consortium was formed in 2002 by groups from the Swiss Institute of Bioinformatics ( SIB), the European Bioinformatics Institute (EBI) and the Protein Information Resource (PIR) at Georgetown University, and soon afterwards the website http://www.uniprot.org was set up as a central entry point to UniProt resources. Requests to this address were redirected to one of the three organisations' websites. While these sites shared a set of static pages with general information about UniProt, their pages for searching and viewing data were different. To provide users with a consistent view and to cut the cost of maintaining three separate sites, the consortium decided to develop a common website for UniProt. Following several years of intense development and a year of public beta testing, the http://www.uniprot.org domain was switched to the newly developed site described in this paper in July 2008. Description: The UniProt consortium is the main provider of protein sequence and annotation data for much of the life sciences community. The http://www.uniprot.org website is the primary access point to this data and to documentation and basic tools for the data. These tools include full text and field-based text search, similarity search, multiple sequence alignment, batch retrieval and database identifier mapping. This paper discusses the design and implementation of the new website, which was released in July 2008, and shows how it improves data access for users with different levels of experience, as well as to machines for programmatic access. http://www.uniprot.org/ is open for both academic and commercial use. The site was built with open source tools and libraries. Feedback is very welcome and should be sent to help@uniprot.org. Conclusion: The new UniProt website makes accessing and understanding UniProt easier than ever. The two main lessons learned are that getting the basics right for such a data provider website has huge benefits, but is not trivial and easy to underestimate, and that there is no substitute for using empirical data throughout the development process to decide on what is and what is not working for your users.
引用
收藏
页数:19
相关论文
共 9 条
[1]  
[Anonymous], 1999, A practical guide to usability testing
[2]   Swiss-Prot: Juggling between evolution and stability [J].
Bairoch, A ;
Boeckmann, B ;
Ferro, S ;
Gasteiger, E .
BRIEFINGS IN BIOINFORMATICS, 2004, 5 (01) :39-55
[3]   The Universal Protein Resource (UniProt) [J].
Bairoch, Amos ;
Bougueleret, Lydie ;
Altairac, Severine ;
Amendolia, Valeria ;
Auchincloss, Andrea ;
Puy, Ghislaine Argoud ;
Axelsen, Kristian ;
Baratin, Delphine ;
Blatter, Marie-Claude ;
Boeckmann, Brigitte ;
Bollondi, Laurent ;
Boutet, Emmanuel ;
Quintaje, Silvia Braconi ;
Breuza, Lionel ;
Bridge, Alan ;
Saux, Virginie Bulliard-Le ;
decastro, Edouard ;
Ciampina, Luciane ;
Coral, Danielle ;
Coudert, Elisabeth ;
Cusin, Isabelle ;
David, Fabrice ;
Delbard, Gwennaelle ;
Dornevil, Dolnide ;
Duek-Roggli, Paula ;
Duvaud, Severine ;
Estreicher, Anne ;
Famiglietti, Livia ;
Farriol-Mathis, Nathalie ;
Ferro, Serenella ;
Feuermann, Marc ;
Gasteiger, Elisabeth ;
Gateau, Alain ;
Gehant, Sebastian ;
Gerritsen, Vivienne ;
Gos, Arnaud ;
Gruaz-Gumowski, Nadine ;
Hinz, Ursula ;
Hulo, Chantal ;
Hulo, Nicolas ;
Innocenti, Alessandro ;
James, Janet ;
Jain, Eric ;
Jimenez, Silvia ;
Jungo, Florence ;
Junker, Vivien ;
Keller, Guillaume ;
Lachaize, Corinne ;
Lane-Guermonprez, Lydie ;
Langendijk-Genevaux, Petra .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D190-D195
[4]  
Berners-Lee T., COOL URIS DONT CHANG
[5]  
Fielding R.T., 2000, ARCHITECTURAL STYLES
[6]  
Hoekman R., 2006, Designing the obvious: a common sense approach to web application design
[7]   UniSave: the UniProtKB Sequence/Annotation Version database [J].
Leinonen, R ;
Nardone, F ;
Zhu, WM ;
Apweiler, R .
BIOINFORMATICS, 2006, 22 (10) :1284-1285
[8]   Calling on a million minds for community annotation in WikiProteins [J].
Mons, Barend ;
Ashburner, Michael ;
Chichester, Christine ;
van Mulligen, Erik ;
Weeber, Marc ;
den Dunnen, Johan ;
van Ommen, Gert-Jan ;
Musen, Mark ;
Cockerill, Matthew ;
Hermjakob, Henning ;
Mons, Albert ;
Packer, Abel ;
Pacheco, Roberto ;
Lewis, Suzanna ;
Berkeley, Alfred ;
Melton, William ;
Barris, Nickolas ;
Wales, Jimmy ;
Meijssen, Gerard ;
Moeller, Erik ;
Roes, Peter Jan ;
Borner, Katy ;
Bairoch, Amos .
GENOME BIOLOGY, 2008, 9 (05)
[9]  
*SIT, SIT PROT 0 9