Expression Quantitative Trait Loci Mapping With Multivariate Sparse Partial Least Squares Regression

被引:51
作者
Chun, Hyonho [1 ]
Keles, Suenduez [1 ,2 ]
机构
[1] Univ Wisconsin, Dept Stat, Madison, WI 53705 USA
[2] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI 53705 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
BAYESIAN VARIABLE SELECTION; GENE-EXPRESSION; STATISTICAL FRAMEWORK; MICE; MOUSE; MODEL; MOUSE-CHROMOSOME-2; ABUNDANCE; NETWORKS; CLONING;
D O I
10.1534/genetics.109.100362
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Expression qnantitative trait loci (eQTL) mapping concerns finding genomic variation to elucidate variation of expression traits. This problem poses significant challenges due to high dimensionality of both the gene expression and the genomic markerdata. We propose a multivariate response regression approach with simultaneous variable selection and dimension reduction for the eQTL mapping problem. Transcripts with similar expression are clustered into groups, and their expression profiles are viewed as a multivariate response. Then, we employ our recently developed sparse partial least-squares regression methodology to select markers associated with each cluster of genes. We demonstrate with extensive simulations that our eQTL mapping with multivariate response sparse partial least-squares regression (M-SPLS eQTL) method overcomes the issue of multiple transcript- or marker-specific analyses, thereby avoiding potential elevation of type I error. Additionally, joint. analysis of multiple transcripts by multivariate response regression increases power for detecting weak linkages. We illustrate that M-SPLS eQTL compares competitively with other approaches and has a number of significant advantages, including the ability to handle highly correlated genotype data and computational efficiency. We provide an application of this methodology to a mouse data set concerning obesity and diabetes.
引用
收藏
页码:79 / 90
页数:12
相关论文
共 46 条
  • [1] Multiple phenotype modeling in gene-mapping studies of quantitative traits: Power advantages
    Allison, DB
    Thiel, B
    St Jean, P
    Elston, RC
    Infante, MC
    Schork, NJ
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 1998, 63 (04) : 1190 - 1201
  • [2] [Anonymous], SPARSE PARTIAL LEAST
  • [3] Prediction by supervised principal components
    Bair, E
    Hastie, T
    Paul, D
    Tibshirani, R
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) : 119 - 137
  • [4] The landscape of genetic complexity across 5,700 gene expression traits in yeast
    Brem, RB
    Kruglyak, L
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (05) : 1572 - 1577
  • [5] Genetic dissection of transcriptional regulation in budding yeast
    Brem, RB
    Yvert, G
    Clinton, R
    Kruglyak, L
    [J]. SCIENCE, 2002, 296 (5568) : 752 - 755
  • [6] A statistical framework for expression quantitative trait loci mapping
    Chen, Meng
    Kendziorski, Christina
    [J]. GENETICS, 2007, 177 (02) : 761 - 771
  • [7] Complex trait analysis of gene expression uncovers polygenic and pleiotropic networks that modulate nervous system function
    Chesler, EJ
    Lu, L
    Shou, SM
    Qu, YH
    Gu, J
    Wang, JT
    Hsu, HC
    Mountz, JD
    Baldwin, NE
    Langston, MA
    Threadgill, DW
    Manly, KF
    Williams, RW
    [J]. NATURE GENETICS, 2005, 37 (03) : 233 - 242
  • [8] Genetic loci for diet-induced atherosclerotic lesions and plasma lipids in mice
    Colinayo, VV
    Qiao, JH
    Wang, XP
    Krass, KL
    Schadt, E
    Lusis, AJ
    Drake, TA
    [J]. MAMMALIAN GENOME, 2003, 14 (07) : 464 - 471
  • [9] SIMPLS - AN ALTERNATIVE APPROACH TO PARTIAL LEAST-SQUARES REGRESSION
    DEJONG, S
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1993, 18 (03) : 251 - 263
  • [10] A novel mouse chromosome 2 congenic strain with obesity phenotypes
    Diament, AL
    Farahani, P
    Chiu, S
    Fisler, J
    Warden, CH
    [J]. MAMMALIAN GENOME, 2004, 15 (06) : 452 - 459