OVA: integrating molecular and physical phenotype data from multiple biomedical domain ontologies with variant filtering for enhanced variant prioritization

被引:15
作者
Antanaviciute, Agne [1 ]
Watson, Christopher M. [1 ,2 ]
Harrison, Sally M. [1 ]
Lascelles, Carolina [1 ]
Crinnion, Laura [1 ,2 ]
Markham, Alexander F. [1 ]
Bonthron, David T. [1 ]
Carr, Ian M. [1 ]
机构
[1] Univ Leeds, Sch Med, Inst Biomed & Clin Sci, Genet Sect, Leeds LS2 9JT, W Yorkshire, England
[2] St James Univ Hosp, Yorkshire Reg Genet Serv, Leeds, W Yorkshire, England
基金
英国医学研究理事会;
关键词
SEMANTIC SIMILARITY; GENE ONTOLOGY; ENRICHMENT; RESOURCE; SEQUENCE; DISEASE; UPDATE; TOOL;
D O I
10.1093/bioinformatics/btv473
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Motivation: Exome sequencing has become a de facto standard method for Mendelian disease gene discovery in recent years, yet identifying disease-causing mutations among thousands of candidate variants remains a non-trivial task. Results: Here we describe a new variant prioritization tool, OVA (ontology variant analysis), in which user-provided phenotypic information is exploited to infer deeper biological context. OVA combines a knowledge-based approach with a variant-filtering framework. It reduces the number of candidate variants by considering genotype and predicted effect on protein sequence, and scores the remainder on biological relevance to the query phenotype. We take advantage of several ontologies in order to bridge knowledge across multiple biomedical domains and facilitate computational analysis of annotations pertaining to genes, diseases, phenotypes, tissues and pathways. In this way, OVA combines information regarding molecular and physical phenotypes and integrates both human and model organism data to effectively prioritize variants. By assessing performance on both known and novel disease mutations, we show that OVA performs biologically meaningful candidate variant prioritization and can be more accurate than another recently published candidate variant prioritization tool.
引用
收藏
页码:3822 / 3829
页数:8
相关论文
共 42 条
[1]
SUSPECTS: enabling fast and effective prioritization of positional candidates [J].
Adie, EA ;
Adams, RR ;
Evans, KL ;
Porteous, DJ ;
Pickard, BS .
BIOINFORMATICS, 2006, 22 (06) :773-774
[2]
Adzhubei Ivan, 2013, Curr Protoc Hum Genet, VChapter 7, DOI 10.1002/0471142905.hg0720s76
[3]
When to use the Bonferroni correction [J].
Armstrong, Richard A. .
OPHTHALMIC AND PHYSIOLOGICAL OPTICS, 2014, 34 (05) :502-508
[4]
Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[5]
An unbiased evaluation of gene prioritization tools [J].
Bornigen, Daniela ;
Tranchevent, Leon-Charles ;
Bonachela-Capdevila, Francisco ;
Devriendt, Koenraad ;
De Moor, Bart ;
De Causmaecker, Patrick ;
Moreau, Yves .
BIOINFORMATICS, 2012, 28 (23) :3081-3088
[6]
Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]
GPSy: a cross-species gene prioritization system for conserved biological processes-application in male gamete development [J].
Britto, Ramona ;
Sallou, Olivier ;
Collin, Olivier ;
Michaux, Gregoire ;
Primig, Michael ;
Chalmel, Frederic .
NUCLEIC ACIDS RESEARCH, 2012, 40 (W1) :W458-W465
[8]
mentha: a resource for browsing integrated protein-interaction networks [J].
Calderone, Alberto ;
Castagnoli, Luisa ;
Cesareni, Gianni .
NATURE METHODS, 2013, 10 (08) :690-691
[9]
ToppGene Suite for gene list enrichment analysis and candidate gene prioritization [J].
Chen, Jing ;
Bardes, Eric E. ;
Aronow, Bruce J. ;
Jegga, Anil G. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W305-W311
[10]
Chen YT, 2011, PLOS ONE, V6, DOI [10.1371/journal.pone.0017876, 10.1371/journal.pone.0023237]