Gene prioritization through genomic data fusion

被引:685
作者
Aerts, S
Lambrechts, D
Maity, S
Van Loo, P
Coessens, B
De Smet, F
Tranchevent, LC
De Moor, B
Marynen, P
Hassan, B
Carmeliet, P
Moreau, Y
机构
[1] Univ Leuven VIB, Neurogenet Lab, Dept Human Genet, B-3000 Louvain, Belgium
[2] Univ Leuven VIB, Ctr Transgene Technol & Gene Therapy, B-3000 Louvain, Belgium
[3] Univ Leuven VIB, Human Genome Lab, Dept Human Genet, B-3000 Louvain, Belgium
[4] Univ Leuven, ESAT SCD, Bioinformat Grp, Dept Elect Engn, Louvain, Belgium
关键词
D O I
10.1038/nbt1203
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 [微生物学]; 0836 [生物工程]; 090102 [作物遗传育种]; 100705 [微生物与生化药学];
摘要
The identification of genes involved in health and disease remains a challenge. We describe a bioinformatics approach, together with a freely accessible, interactive and flexible software termed Endeavour, to prioritize candidate genes underlying biological processes or diseases, based on their similarity to known genes involved in these phenomena. Unlike previous approaches, ours generates distinct prioritizations for multiple heterogeneous data sources, which are then integrated, or fused, into a global ranking using order statistics. In addition, it offers the flexibility of including additional data sources. Validation of our approach revealed it was able to efficiently prioritize 627 genes in disease data sets and 76 genes in biological pathway sets, identify candidates of 16 mono- or polygenic diseases, and discover regulatory genes of myeloid differentiation. Furthermore, the approach identified a novel gene involved in craniofacial development from a 2-Mb chromosomal region, deleted in some patients with DiGeorge-like birth defects. The approach described here offers an alternative integrative method for gene discovery.
引用
收藏
页码:537 / 544
页数:8
相关论文
共 51 条
[1]
Speeding disease gene discovery by sequence based candidate prioritization [J].
Adie, EA ;
Adams, RR ;
Evans, KL ;
Porteous, DJ ;
Pickard, BS .
BMC BIOINFORMATICS, 2005, 6 (1)
[2]
TOUCAN 2: the all-inclusive open source workbench for regulatory sequence analysis [J].
Aerts, S ;
Van Loo, P ;
Thijs, G ;
Mayer, H ;
de Martin, R ;
Moreau, Y ;
De Moor, B .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W393-W396
[3]
A genetic algorithm for the detection of new cis-regulatory modules in sets of coregulated genes [J].
Aerts, S ;
Van Loo, P ;
Moreau, Y ;
De Moor, B .
BIOINFORMATICS, 2004, 20 (12) :1974-1976
[4]
Computational detection of cis-regulatory modules [J].
Aerts, Stein ;
Van Loo, Peter ;
Thijs, Gert ;
Moreau, Yves ;
De Moor, Bart .
BIOINFORMATICS, 2003, 19 :II5-II14
[5]
Mutations in the glucocerebrosidase gene and Parkinson's disease in Ashkenazi Jews [J].
Aharon-Peretz, J ;
Rosenbaum, H ;
Gershoni-Baruch, R .
NEW ENGLAND JOURNAL OF MEDICINE, 2004, 351 (19) :1972-1977
[6]
PathwayVoyager: pathway mapping using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database [J].
Altermann, E ;
Klaenhammer, TR .
BMC GENOMICS, 2005, 6 (1)
[7]
Bader GD, 2003, NUCLEIC ACIDS RES, V31, P248, DOI 10.1093/nar/gkg056
[8]
Dissecting contiguous gene defects:: TBX1 [J].
Baldini, A .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 2005, 15 (03) :279-284
[9]
Funding high-throughput data sharing [J].
Ball, CA ;
Sherlock, G ;
Brazma, A .
NATURE BIOTECHNOLOGY, 2004, 22 (09) :1179-1183
[10]
A missense single-nucleotide polymorphism in a gene encoding a protein tyrosine phosphatase (PTPN22) is associated with rheumatoid arthritis [J].
Begovich, AB ;
Carlton, VEH ;
Honigberg, LA ;
Schrodi, SJ ;
Chokkalingam, AP ;
Alexander, HC ;
Ardlie, KG ;
Huang, QQ ;
Smith, AM ;
Spoerke, JM ;
Conn, MT ;
Chang, M ;
Chang, SYP ;
Saiki, RK ;
Catanese, JJ ;
Leong, DU ;
Garcia, VE ;
McAllister, LB ;
Jeffery, DA ;
Lee, AT ;
Batliwalla, F ;
Remmers, E ;
Criswell, LA ;
Seldin, MF ;
Kastner, DL ;
Amos, CI ;
Sninsky, JJ ;
Gregersen, PK .
AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 75 (02) :330-337