ArrayExpress update-simplifying data submissions

被引:497
作者
Kolesnikov, Nikolay [1 ]
Hastings, Emma [1 ]
Keays, Maria [1 ]
Melnichuk, Olga [1 ]
Tang, Y. Amy [1 ]
Williams, Eleanor [1 ]
Dylag, Miroslaw [1 ]
Kurbatova, Natalja [1 ]
Brandizi, Marco [1 ]
Burdett, Tony [1 ]
Megy, Karyn [1 ]
Pilicheva, Ekaterina [1 ]
Rustici, Gabriella [1 ,2 ]
Tikhonov, Andrew [1 ]
Parkinson, Helen [1 ]
Petryszak, Robert [1 ]
Sarkans, Ugis [1 ]
Brazma, Alvis [1 ]
机构
[1] EBI, EMBL, Hinxton CB10 1SD, Cambs, England
[2] Cambridge Syst Biol Ctr, Sch Biol Sci, Cambridge CB2 1QR, England
基金
美国国家卫生研究院;
关键词
GENE-EXPRESSION DATA; MICROARRAY DATA; BIOINFORMATICS; ARCHIVE; MIAME;
D O I
10.1093/nar/gku1057
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The ArrayExpress Archive of Functional Genomics Data ( ext-link-type="uri" xlink:href="http://www.ebi.ac.uk/arrayexpress" xlink:type="simple">http://www.ebi.ac.uk/arrayexpress) is an international functional genomics database at the European Bioinformatics Institute (EMBL-EBI) recommended by most journals as a repository for data supporting peer-reviewed publications. It contains data from over 7000 public sequencing and 42 000 array-based studies comprising over 1.5 million assays in total. The proportion of sequencing-based submissions has grown significantly over the last few years and has doubled in the last 18 months, whilst the rate of microarray submissions is growing slightly. All data in ArrayExpress are available in the MAGE-TAB format, which allows robust linking to data analysis and visualization tools and standardized analysis. The main development over the last two years has been the release of a new data submission tool Annotare, which has reduced the average submission time almost 3-fold. In the near future, Annotare will become the only submission route into ArrayExpress, alongside MAGE-TAB format-based pipelines. ArrayExpress is a stable and highly accessed resource. Our future tasks include automation of data flows and further integration with other EMBL-EBI resources for the representation of multi-omics data.
引用
收藏
页码:D1113 / D1116
页数:4
相关论文
共 14 条
[1]   NCBI GEO: archive for functional genomics data sets-update [J].
Barrett, Tanya ;
Wilhite, Stephen E. ;
Ledoux, Pierre ;
Evangelista, Carlos ;
Kim, Irene F. ;
Tomashevsky, Maxim ;
Marshall, Kimberly A. ;
Phillippy, Katherine H. ;
Sherman, Patti M. ;
Holko, Michelle ;
Yefanov, Andrey ;
Lee, Hyeseung ;
Zhang, Naigong ;
Robertson, Cynthia L. ;
Serova, Nadezhda ;
Davis, Sean ;
Soboleva, Alexandra .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D991-D995
[2]   Minimum information about a microarray experiment (MIAME) - toward standards for microarray data [J].
Brazma, A ;
Hingamp, P ;
Quackenbush, J ;
Sherlock, G ;
Spellman, P ;
Stoeckert, C ;
Aach, J ;
Ansorge, W ;
Ball, CA ;
Causton, HC ;
Gaasterland, T ;
Glenisson, P ;
Holstege, FCP ;
Kim, IF ;
Markowitz, V ;
Matese, JC ;
Parkinson, H ;
Robinson, A ;
Sarkans, U ;
Schulze-Kremer, S ;
Stewart, J ;
Taylor, R ;
Vilo, J ;
Vingron, M .
NATURE GENETICS, 2001, 29 (04) :365-371
[3]   ArrayExpress - a public repository for microarray gene expression data at the EBI [J].
Brazma, A ;
Parkinson, H ;
Sarkans, U ;
Shojatalab, M ;
Vilo, J ;
Abeygunawardena, N ;
Holloway, E ;
Kapushesky, M ;
Kemmeren, P ;
Lara, GG ;
Oezcimen, A ;
Rocca-Serra, P ;
Sansone, SA .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :68-71
[4]   Petabyte-scale innovations at the European Nucleotide Archive [J].
Cochrane, Guy ;
Akhtar, Ruth ;
Bonfield, James ;
Bower, Lawrence ;
Demiralp, Fehmi ;
Faruque, Nadeem ;
Gibson, Richard ;
Hoad, Gemma ;
Hubbard, Tim ;
Hunter, Christopher ;
Jang, Mikyung ;
Juhos, Szilveszter ;
Leinonen, Rasko ;
Leonard, Steven ;
Lin, Quan ;
Lopez, Rodrigo ;
Lorenc, Dariusz ;
McWilliam, Hamish ;
Mukherjee, Gaurab ;
Plaister, Sheila ;
Radhakrishnan, Rajesh ;
Robinson, Stephen ;
Sobhany, Siamak ;
Hoopen, Petra Ten ;
Vaughan, Robert ;
Zalunin, Vadim ;
Birney, Ewan .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D19-D25
[5]   Updates to BioSamples database at European Bioinformatics Institute [J].
Faulconbridge, Adam ;
Burdett, Tony ;
Brandizi, Marco ;
Gostev, Mikhail ;
Pereira, Rui ;
Vasant, Drashtti ;
Sarkans, Ugis ;
Brazma, Alvis ;
Parkinson, Helen .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D50-D52
[6]   Bioconductor: open software development for computational biology and bioinformatics [J].
Gentleman, RC ;
Carey, VJ ;
Bates, DM ;
Bolstad, B ;
Dettling, M ;
Dudoit, S ;
Ellis, B ;
Gautier, L ;
Ge, YC ;
Gentry, J ;
Hornik, K ;
Hothorn, T ;
Huber, W ;
Iacus, S ;
Irizarry, R ;
Leisch, F ;
Li, C ;
Maechler, M ;
Rossini, AJ ;
Sawitzki, G ;
Smith, C ;
Smyth, G ;
Tierney, L ;
Yang, JYH ;
Zhang, JH .
GENOME BIOLOGY, 2004, 5 (10)
[7]   Repeatability of published microarray gene expression analyses [J].
Ioannidis, John P. A. ;
Allison, David B. ;
Ball, Catherine A. ;
Coulibaly, Issa ;
Cui, Xiangqin ;
Culhane, Aedin C. ;
Falchi, Mario ;
Furlanello, Cesare ;
Game, Laurence ;
Jurman, Giuseppe ;
Mangion, Jon ;
Mehta, Tapan ;
Nitzberg, Michael ;
Page, Grier P. ;
Petretto, Enrico ;
van Noort, Vera .
NATURE GENETICS, 2009, 41 (02) :149-155
[8]   Database Citation in Full Text Biomedical Articles [J].
Kafkas, Senay ;
Kim, Jee-Hyub ;
McEntyre, Johanna R. .
PLOS ONE, 2013, 8 (05)
[9]   Modeling sample variables with an Experimental Factor Ontology [J].
Malone, James ;
Holloway, Ele ;
Adamusiak, Tomasz ;
Kapushesky, Misha ;
Zheng, Jie ;
Kolesnikov, Nikolay ;
Zhukova, Anna ;
Brazma, Alvis ;
Parkinson, Helen .
BIOINFORMATICS, 2010, 26 (08) :1112-1118
[10]   Expression Atlas update-a database of gene and transcript expression from microarray- and sequencing-based functional genomics experiments [J].
Petryszak, Robert ;
Burdett, Tony ;
Fiorelli, Benedetto ;
Fonseca, Nuno A. ;
Gonzalez-Porta, Mar ;
Hastings, Emma ;
Huber, Wolfgang ;
Jupp, Simon ;
Keays, Maria ;
Kryvych, Nataliya ;
McMurry, Julie ;
Marioni, John C. ;
Malone, James ;
Megy, Karine ;
Rustici, Gabriella ;
Tang, Amy Y. ;
Taubert, Jan ;
Williams, Eleanor ;
Mannion, Oliver ;
Parkinson, Helen E. ;
Brazma, Alvis .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D926-D932