Mapping biomedical concepts onto the human genome by mining literature on chromosomal aberrations

被引:21
作者
Van Vooren, Steven
Thienpont, Bernard
Menten, Bjorn
Speleman, Frank
De Moor, Bart
Vermeesch, Joris
Moreau, Yves
机构
[1] Katholieke Univ Leuven, Dept Electrotech Engn, B-3001 Heverlee, Belgium
[2] Katholieke Univ Leuven Hosp, Ctr Human Genet, B-3000 Louvain, Belgium
[3] Ghent Univ Hosp, Ctr Genet Med, B-9000 Ghent, Belgium
关键词
D O I
10.1093/nar/gkm054
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Biomedical literature provides a rich but unstructured source of associations between chromosomal regions and biomedical concepts. By mining MEDLINE abstracts, we annotate the human genome at the level of cytogenetic bands. Our method creates a set of chromosomal aberration maps that associate cytogenetic bands to biomedical concepts from a variety of controlled vocabularies, including disease, dysmorphology, anatomy, development and Gene Ontology branches. The association between a band (e.g. 4p16.3) and a concept (e.g. microcephaly) is assessed by the statistical overrepresentation of this concept in the abstracts relating to this band. Our method is validated using existing genome annotation resources and known chromosomal aberration maps and is further illustrated through a case study on heart disease. Our chromosomal aberration maps provide diagnostics support to clinical geneticists, aid cytogeneticists to interpret and report cytogenetic findings and support researchers interested in human gene function. The method is available as a web application, aBandApart, at http://www.esat.kuleuven.be/abandapart/.
引用
收藏
页码:2533 / 2543
页数:11
相关论文
共 37 条
[21]   GOToolBox: functional analysis of gene datasets based on Gene Ontology [J].
Martin, D ;
Brun, C ;
Remy, E ;
Mouren, P ;
Thieffry, D ;
Jacq, B .
GENOME BIOLOGY, 2004, 5 (12)
[22]   GFINDer: genetic disease and phenotype location statistical analysis and mining of dynamically annotated gene lists [J].
Masseroli, M ;
Galati, O ;
Pinciroli, F .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W717-W723
[23]  
MOHNISH S, 2002, J MED GENET, V39, P782
[24]   G2D: a tool for mining genes associated with disease [J].
Perez-Iratxeta, C ;
Wjst, M ;
Bork, P ;
Andrade, MA .
BMC GENETICS, 2005, 6 (1)
[25]   Efficacy of three corticosteroids for the treatment of heaves [J].
Robinson, NE ;
Jackson, C ;
Jefcoat, A ;
Berney, C ;
Peroni, D ;
Derksen, FJ .
EQUINE VETERINARY JOURNAL, 2002, 34 (01) :17-22
[26]  
SCHINZEL A, 2001, CATALOGUE UNBALACED
[27]  
SHAFFER LG, 2005, ISCN 2005
[28]   Information extraction from full text scientific articles: Where are the keywords? [J].
Shah, PK ;
Perez-Iratxeta, C ;
Bork, P ;
Andrade, MA .
BMC BIOINFORMATICS, 2003, 4 (1)
[29]   Integration of text- and data-mining using ontologies successfully selects disease gene candidates [J].
Tiffin, N ;
Kelso, JF ;
Powell, AR ;
Pan, H ;
Bajic, VB ;
Hide, WA .
NUCLEIC ACIDS RESEARCH, 2005, 33 (05) :1544-1552
[30]   A text-mining analysis of the human phenome [J].
van Driel, MA ;
Bruggeman, J ;
Vriend, G ;
Brunner, HG ;
Leunissen, JA .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2006, 14 (05) :535-542