Learning Causal Biological Networks With the Principle of Mendelian Randomization

被引:45
作者
Badsha, Md Bahadur [1 ]
Fu, Audrey Qiuyan [1 ]
机构
[1] Univ Idaho, Inst Bioinformat & Evolutionary Studies, Ctr Modeling Complex Interact, Dept Stat Sci, Moscow, ID 83844 USA
关键词
causal inference; graphical models; biological networks; bioinformatics; Mendelian randomization; GENOME-WIDE ASSOCIATION; GENE-EXPRESSION; RISK LOCI; INFERENCE; TISSUES;
D O I
10.3389/fgene.2019.00460
中图分类号
Q3 [遗传学];
学科分类号
071007 [遗传学];
摘要
Although large amounts of genomic data are available, it remains a challenge to reliably infer causal (i. e., regulatory) relationships among molecular phenotypes (such as gene expression), especially when multiple phenotypes are involved. We extend the interpretation of the Principle of Mendelian randomization (PMR) and present MRPC, a novel machine learning algorithm that incorporates the PMR in the PC algorithm, a classical algorithm for learning causal graphs in computer science. MRPC learns a causal biological network efficiently and robustly from integrating individual-level genotype and molecular phenotype data, in which directed edges indicate causal directions. We demonstrate through simulation that MRPC outperforms several popular general-purpose network inference methods and PMR-based methods. We apply MRPC to distinguish direct and indirect targets among multiple genes associated with expression quantitative trait loci. Our method is implemented in the R package MRPC, available on GRAN (https://cran.r-project.org/web/packages/MRPC/index.html).
引用
收藏
页数:16
相关论文
共 49 条
[1]
Genetic effects on gene expression across human tissues [J].
Aguet, Francois ;
Brown, Andrew A. ;
Castel, Stephane E. ;
Davis, Joe R. ;
He, Yuan ;
Jo, Brian ;
Mohammadi, Pejman ;
Park, Yoson ;
Parsana, Princy ;
Segre, Ayellet V. ;
Strober, Benjamin J. ;
Zappala, Zachary ;
Cummings, Beryl B. ;
Gelfand, Ellen T. ;
Hadley, Kane ;
Huang, Katherine H. ;
Lek, Monkol ;
Li, Xiao ;
Nedzel, Jared L. ;
Nguyen, Duyen Y. ;
Noble, Michael S. ;
Sullivan, Timothy J. ;
Tukiainen, Taru ;
MacArthur, Daniel G. ;
Getz, Gad ;
Management, Nih Program ;
Addington, Anjene ;
Guan, Ping ;
Koester, Susan ;
Little, A. Roger ;
Lockhart, Nicole C. ;
Moore, Helen M. ;
Rao, Abhi ;
Struewing, Jeffery P. ;
Volpi, Simona ;
Collection, Biospecimen ;
Brigham, Lori E. ;
Hasz, Richard ;
Hunter, Marcus ;
Johns, Christopher ;
Johnson, Mark ;
Kopen, Gene ;
Leinweber, William F. ;
Lonsdale, John T. ;
McDonald, Alisa ;
Mestichelli, Bernadette ;
Myer, Kevin ;
Roe, Bryan ;
Salvatore, Michael ;
Shad, Saboor .
NATURE, 2017, 550 (7675) :204-+
[2]
Assessing the Effectiveness of Causality Inference Methods for Gene Regulatory Networks [J].
Ahmed, Syed Sazzad ;
Roy, Swarup ;
Kalita, Jugal .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (01) :56-70
[3]
Network motifs: theory and experimental approaches [J].
Alon, Uri .
NATURE REVIEWS GENETICS, 2007, 8 (06) :450-461
[4]
[Anonymous], 1924, Metron
[5]
Badsha M. B., 2018, BIORXIV, DOI [10.1101/171348, DOI 10.1101/171348]
[6]
Robust complementary hierarchical clustering for gene expression data analysis by β-divergence [J].
Badsha, Md. Bahadur ;
Mollah, Md. Nurul Hague ;
Jahan, Nusrat ;
Kurata, Hiroyuki .
JOURNAL OF BIOSCIENCE AND BIOENGINEERING, 2013, 116 (03) :397-407
[7]
Genetics of human gene expression: mapping DNA variants that influence gene expression [J].
Cheung, Vivian G. ;
Spielman, Richard S. .
NATURE REVIEWS GENETICS, 2009, 10 (09) :595-604
[8]
Chickering DM, 1996, Learning from Data: Artificial Intelligence and Statistics V, P121, DOI 10.1007/978-1-4612-2404-4_12
[9]
Colombo D, 2014, J MACH LEARN RES, V15, P3741
[10]
Connor L. J., 2017, BIORXIV205435, DOI [10.1101/205435, DOI 10.1101/205435]