Predicting the change of exon splicing caused by genetic variant using support vector regression

被引:5
作者
Chen, Ken [1 ]
Lu, Yutong [1 ]
Zhao, Huiying [2 ]
Yang, Yuedong [1 ,3 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510000, Guangdong, Peoples R China
[2] Sun Yat Sen Univ, Sun Yat Sen Mem Hosp, Guangzhou 510000, Guangdong, Peoples R China
[3] Sun Yat Sen Univ, Key Lab Machine Intelligence & Adv Comp, Minist Educ, Guangzhou, Guangdong, Peoples R China
基金
美国国家卫生研究院; 中国国家自然科学基金;
关键词
alternative splicing; splice site motif; support vector machine; synonymous mutation; Vex-seq; SYNONYMOUS MUTATIONS; RNA STRUCTURE; IDENTIFICATION;
D O I
10.1002/humu.23785
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Alternative splicing can be disrupted by genetic variants that are related to diseases like cancers. Discovering the influence of genetic variations on the alternative splicing will improve the understanding of the pathogenesis of variants. Here, we developed a new approach, PredPSI-SVR to predict the impact of variants on exon skipping events by using the support vector regression. From the sequence of a particular exon and its flanking regions, 42 comprehensive features related to splicing events were extracted. By using a greedy feature selection algorithm, we found eight features contributing most to the prediction. The trained model achieved a Pearson correlation coefficient (PCC) of 0.570 in the 10-fold cross-validation based on the training data set provided by the "vex-seq" challenge of the 5th Critical Assessment of Genome Interpretation. In the blind test also held by the challenge, our prediction ranked the 2nd with a PCC of 0.566 that demonstrates the robustness of our method. A further test indicated that the PredPSI-SVR is helpful in prioritizing deleterious synonymous mutations. The method is available on .
引用
收藏
页码:1235 / 1242
页数:8
相关论文
共 38 条
[1]   Splicing mutations in human genetic disorders: examples, detection, and confirmation (vol 59, pg 253, 2018) [J].
Abramowicz, Anna ;
Gos, Monika .
JOURNAL OF APPLIED GENETICS, 2019, 60 (02) :231-231
[2]   Vex-seq: high-throughput identification of the impact of genetic variation on pre-mRNA splicing efficiency [J].
Adamson, Scott I. ;
Zhan, Lijun ;
Graveley, Brenton R. .
GENOME BIOLOGY, 2018, 19
[3]   A global reference for human genetic variation [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Wang, Jun ;
Wilson, Richard K. ;
Boerwinkle, Eric ;
Doddapaneni, Harsha ;
Han, Yi ;
Korchina, Viktoriya ;
Kovar, Christie ;
Lee, Sandra ;
Muzny, Donna ;
Reid, Jeffrey G. ;
Zhu, Yiming ;
Chang, Yuqi ;
Feng, Qiang ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Lan, Tianming ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Liu, Shengmao ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Tang, Meifang ;
Wang, Bo .
NATURE, 2015, 526 (7571) :68-+
[4]  
[Anonymous], 2016, KDD16 P 22 ACM, DOI DOI 10.1145/2939672.2939785
[5]   Alternative splicing as a regulator of development and tissue identity [J].
Baralle, Francisco E. ;
Giudice, Jimena .
NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2017, 18 (07) :437-451
[6]   Identification of deleterious synonymous variants in human genomes [J].
Buske, Orion J. ;
Manickaraj, AshokKumar ;
Mital, Seema ;
Ray, Peter N. ;
Brudno, Michael .
BIOINFORMATICS, 2013, 29 (15) :1843-1850
[7]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]   MMSplice: modular modeling improves the predictions of genetic variant effects on splicing [J].
Cheng, Jun ;
Thi Yen Duong Nguyen ;
Cygan, Kamil J. ;
Celik, Muhammed Hasan ;
Fairbrother, William G. ;
Avsec, Ziga ;
Gagneur, Julien .
GENOME BIOLOGY, 2019, 20 (1)
[9]   The Functional Impact of Alternative Splicing in Cancer [J].
Climente-Gonzalez, Hector ;
Porta-Pardo, Eduard ;
Godzik, Adam ;
Eyras, Eduardo .
CELL REPORTS, 2017, 20 (09) :2215-2226
[10]   Nonclassical splicing mutations in the coding and noncoding regions of the ATM gene:: Maximum entropy estimates of splice junction strengths [J].
Eng, L ;
Coutinho, G ;
Nahas, S ;
Yeo, G ;
Tanouye, R ;
Babaei, M ;
Dörk, T ;
Burge, C ;
Gatti, RA .
HUMAN MUTATION, 2004, 23 (01) :67-76