Linking entries in protein interaction database to structured text: The FEBS Letters experiment

被引:48
作者
Ceol, Arnaud [1 ]
Chatr-Aryamontri, Andrew [1 ]
Licata, Luana [1 ]
Cesareni, Gianni [1 ,2 ]
机构
[1] Univ Rome, Dept Biol, Rome, Italy
[2] IRCCS, Fdn Santa Lucia, I-00143 Rome, Italy
关键词
protein interaction; database; information extraction; network;
D O I
10.1016/j.febslet.2008.02.071
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The corpus of the scientific literature has reached such size that a lot of useful data, dispersed throughout millions different articles, are now hard to recover. For instance, many articles in the biological domain describe relationships between entities (gene, proteins, small molecules, etc.) yet this crucial information cannot be efficiently used because of the difficulties in retrieving it automatically from unstructured text. Databases are striving to capture this valuable information and to organize it in a structured format ready for automatic analysis. However, the current database model, based on manual curation, is not sustainable because the limited support is not compatible with complete and accurate coverage of published information. Several proposals have been put forward to increase the efficiency and accuracy of the curation process. Here we present an experiment, designed by the editorial board of FEBS Letters, aimed at integrating each manuscript with a structured summary precisely reporting, with database identifiers and predefined controlled vocabularies, the protein interactions reported in the manuscript. The authors play an important role in this process as they are requested to provide structured information to be appended, in the form of human-readable paragraphs, at the end of traditional summaries. It is envisaged that the structured text will become an integral part of Medline abstracts. In 6 months time the experience gained with this experiment will form the basis for a community discussion to propose a widely accepted strategy for information storage and retrieval. (C) 2008 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:1171 / 1177
页数:7
相关论文
共 19 条
  • [1] The Biomolecular Interaction Network Database and related tools 2005 update
    Alfarano, C
    Andrade, CE
    Anthony, K
    Bahroos, N
    Bajec, M
    Bantoft, K
    Betel, D
    Bobechko, B
    Boutilier, K
    Burgess, E
    Buzadzija, K
    Cavero, R
    D'Abreo, C
    Donaldson, I
    Dorairajoo, D
    Dumontier, MJ
    Dumontier, MR
    Earles, V
    Farrall, R
    Feldman, H
    Garderman, E
    Gong, Y
    Gonzaga, R
    Grytsan, V
    Gryz, E
    Gu, V
    Haldorsen, E
    Halupa, A
    Haw, R
    Hrvojic, A
    Hurrell, L
    Isserlin, R
    Jack, F
    Juma, F
    Khan, A
    Kon, T
    Konopinsky, S
    Le, V
    Lee, E
    Ling, S
    Magidin, M
    Moniakis, J
    Montojo, J
    Moore, S
    Muskat, B
    Ng, I
    Paraiso, JP
    Parker, B
    Pintilie, G
    Pirone, R
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 : D418 - D424
  • [2] The Universal Protein Resource (UniProt)
    Bairoch, Amos
    Bougueleret, Lydie
    Altairac, Severine
    Amendolia, Valeria
    Auchincloss, Andrea
    Puy, Ghislaine Argoud
    Axelsen, Kristian
    Baratin, Delphine
    Blatter, Marie-Claude
    Boeckmann, Brigitte
    Bollondi, Laurent
    Boutet, Emmanuel
    Quintaje, Silvia Braconi
    Breuza, Lionel
    Bridge, Alan
    Saux, Virginie Bulliard-Le
    decastro, Edouard
    Ciampina, Luciane
    Coral, Danielle
    Coudert, Elisabeth
    Cusin, Isabelle
    David, Fabrice
    Delbard, Gwennaelle
    Dornevil, Dolnide
    Duek-Roggli, Paula
    Duvaud, Severine
    Estreicher, Anne
    Famiglietti, Livia
    Farriol-Mathis, Nathalie
    Ferro, Serenella
    Feuermann, Marc
    Gasteiger, Elisabeth
    Gateau, Alain
    Gehant, Sebastian
    Gerritsen, Vivienne
    Gos, Arnaud
    Gruaz-Gumowski, Nadine
    Hinz, Ursula
    Hulo, Chantal
    Hulo, Nicolas
    Innocenti, Alessandro
    James, Janet
    Jain, Eric
    Jimenez, Silvia
    Jungo, Florence
    Junker, Vivien
    Keller, Guillaume
    Lachaize, Corinne
    Lane-Guermonprez, Lydie
    Langendijk-Genevaux, Petra
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D190 - D195
  • [3] The BioGRID interaction database:: 2008 update
    Breitkreutz, Bobby-Joe
    Stark, Chris
    Reguly, Teresa
    Boucher, Lorrie
    Breitkreutz, Ashton
    Livstone, Michael
    Oughtred, Rose
    Lackner, Daniel H.
    Bahler, Jurg
    Wood, Valerie
    Dolinski, Kara
    Tyers, Mike
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D637 - D640
  • [4] MINT: the molecular INTeraction database
    Chatr-aryamontri, Andrew
    Ceol, Arnaud
    Palazzi, Luisa Montecchi
    Nardelli, Giuliano
    Schneider, Maria Victoria
    Castagnoli, Luisa
    Cesareni, Gianni
    [J]. NUCLEIC ACIDS RESEARCH, 2007, 35 : D572 - D574
  • [5] Gene name ambiguity of eukaryotic nomenclatures
    Chen, LF
    Liu, HF
    Friedman, C
    [J]. BIOINFORMATICS, 2005, 21 (02) : 248 - 256
  • [6] The Ontology Lookup Service, a lightweight cross-platform tool for controlled vocabulary queries
    Côté, RG
    Jones, P
    Apweiler, R
    Hermjakob, H
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [7] Structured digital abstract makes text mining easy
    Gerstein, Mark
    Seringhaus, Michael
    Fields, Stanley
    [J]. NATURE, 2007, 447 (7141) : 142 - 142
  • [8] Text mining: powering the database revolution
    Hahn, Udo
    Wermter, Joachim
    Blasczyk, Rainer
    Horn, Peter A.
    [J]. NATURE, 2007, 448 (7150) : 130 - 130
  • [9] The Gene Ontology (GO) database and informatics resource
    Harris, MA
    Clark, J
    Ireland, A
    Lomax, J
    Ashburner, M
    Foulger, R
    Eilbeck, K
    Lewis, S
    Marshall, B
    Mungall, C
    Richter, J
    Rubin, GM
    Blake, JA
    Bult, C
    Dolan, M
    Drabkin, H
    Eppig, JT
    Hill, DP
    Ni, L
    Ringwald, M
    Balakrishnan, R
    Cherry, JM
    Christie, KR
    Costanzo, MC
    Dwight, SS
    Engel, S
    Fisk, DG
    Hirschman, JE
    Hong, EL
    Nash, RS
    Sethuraman, A
    Theesfeld, CL
    Botstein, D
    Dolinski, K
    Feierbach, B
    Berardini, T
    Mundodi, S
    Rhee, SY
    Apweiler, R
    Barrell, D
    Camon, E
    Dimmer, E
    Lee, V
    Chisholm, R
    Gaudet, P
    Kibbe, W
    Kishore, R
    Schwarz, EM
    Sternberg, P
    Gwinn, M
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D258 - D261
  • [10] IntAct: an open source molecular interaction database
    Hermjakob, H
    Montecchi-Palazzi, L
    Lewington, C
    Mudali, S
    Kerrien, S
    Orchard, S
    Vingron, M
    Roechert, B
    Roepstorff, P
    Valencia, A
    Margalit, H
    Armstrong, J
    Bairoch, A
    Cesareni, G
    Sherman, D
    Apweller, R
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D452 - D455