CAGI 5 splicing challenge: Improved exon skipping and intron retention predictions with MMSplice

被引:8
作者
Cheng, Jun [1 ,2 ]
Celik, Muhammed Hasan [1 ]
Thi Yen Duong Nguyen [1 ]
Avsec, Ziga [1 ,2 ]
Gagneur, Julien [1 ,2 ]
机构
[1] Tech Univ Munich, Dept Informat, Boltzmannstr 3, D-85748 Garching, Germany
[2] Ludwig Maximilians Univ Munchen, Grad Sch Quantitat Biosci QBM, Munich, Germany
关键词
artificial neural network; splicing; variant effect; variant interpretation; SEQUENCE MOTIFS; RNA; VERTEBRATE; ENHANCERS;
D O I
10.1002/humu.23788
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Pathogenic genetic variants often primarily affect splicing. However, it remains difficult to quantitatively predict whether and how genetic variants affect splicing. In 2018, the fifth edition of the Critical Assessment of Genome Interpretation proposed two splicing prediction challenges based on experimental perturbation assays: Vex-seq, assessing exon skipping, and MaPSy, assessing splicing efficiency. We developed a modular modeling framework, MMSplice, the performance of which was among the best on both challenges. Here we provide insights into the modeling assumptions of MMSplice and its individual modules. We furthermore illustrate how MMSplice can be applied in practice for individual genome interpretation, using the MMSplice VEP plugin and the Kipoi variant interpretation plugin, which are directly applicable to VCF files.
引用
收藏
页码:1243 / 1251
页数:9
相关论文
共 30 条
[1]   Vex-seq: high-throughput identification of the impact of genetic variation on pre-mRNA splicing efficiency [J].
Adamson, Scott I. ;
Zhan, Lijun ;
Graveley, Brenton R. .
GENOME BIOLOGY, 2018, 19
[2]  
[Anonymous], 2004, Molecular Biology of the Cell
[3]   The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans [J].
Ardlie, Kristin G. ;
DeLuca, David S. ;
Segre, Ayellet V. ;
Sullivan, Timothy J. ;
Young, Taylor R. ;
Gelfand, Ellen T. ;
Trowbridge, Casandra A. ;
Maller, Julian B. ;
Tukiainen, Taru ;
Lek, Monkol ;
Ward, Lucas D. ;
Kheradpour, Pouya ;
Iriarte, Benjamin ;
Meng, Yan ;
Palmer, Cameron D. ;
Esko, Tonu ;
Winckler, Wendy ;
Hirschhorn, Joel N. ;
Kellis, Manolis ;
MacArthur, Daniel G. ;
Getz, Gad ;
Shabalin, Andrey A. ;
Li, Gen ;
Zhou, Yi-Hui ;
Nobel, Andrew B. ;
Rusyn, Ivan ;
Wright, Fred A. ;
Lappalainen, Tuuli ;
Ferreira, Pedro G. ;
Ongen, Halit ;
Rivas, Manuel A. ;
Battle, Alexis ;
Mostafavi, Sara ;
Monlong, Jean ;
Sammeth, Michael ;
Mele, Marta ;
Reverter, Ferran ;
Goldmann, Jakob M. ;
Koller, Daphne ;
Guigo, Roderic ;
McCarthy, Mark I. ;
Dermitzakis, Emmanouil T. ;
Gamazon, Eric R. ;
Im, Hae Kyung ;
Konkashbaev, Anuar ;
Nicolae, Dan L. ;
Cox, Nancy J. ;
Flutre, Timothee ;
Wen, Xiaoquan ;
Stephens, Matthew .
SCIENCE, 2015, 348 (6235) :648-660
[4]   The Kipoi repository accelerates community exchange and reuse of predictive models for genomics [J].
Avsec, Ziga ;
Kreuzhuber, Roman ;
Israeli, Johnny ;
Xu, Nancy ;
Cheng, Jun ;
Shrikumar, Avanti ;
Banerjee, Abhimanyu ;
Kim, Daniel S. ;
Beier, Thorsten ;
Urban, Lara ;
Kundaje, Anshul ;
Stegle, Oliver ;
Gagneur, Julien .
NATURE BIOTECHNOLOGY, 2019, 37 (06) :592-600
[5]   Alternative splicing as a regulator of development and tissue identity [J].
Baralle, Francisco E. ;
Giudice, Jimena .
NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2017, 18 (07) :437-451
[6]   Deciphering the splicing code [J].
Barash, Yoseph ;
Calarco, John A. ;
Gao, Weijun ;
Pan, Qun ;
Wang, Xinchen ;
Shai, Ofer ;
Blencowe, Benjamin J. ;
Frey, Brendan J. .
NATURE, 2010, 465 (7294) :53-59
[7]   From Structure to Systems: High-Resolution, Quantitative Genetic Analysis of RNA Polymerase II [J].
Braberg, Hannes ;
Jin, Huiyan ;
Moehle, Erica A. ;
Chan, Yujia A. ;
Wang, Shuyi ;
Shales, Michael ;
Benschop, Joris J. ;
Morris, John H. ;
Qiu, Chenxi ;
Hu, Fuqu ;
Tang, Leung K. ;
Fraser, James S. ;
Holstege, Frank C. P. ;
Hieter, Philip ;
Guthrie, Christine ;
Kaplan, Craig D. ;
Krogan, Nevan J. .
CELL, 2013, 154 (04) :775-788
[8]   COSSMO: predicting competitive alternative splice site selection using deep learning [J].
Bretschneider, Hannes ;
Gandhi, Shreshth ;
Deshwar, Amit G. ;
Zuberi, Khalid ;
Frey, Brendan J. .
BIOINFORMATICS, 2018, 34 (13) :429-437
[9]   RNA-BINDING SPECIFICITY OF HNRNP A1 - SIGNIFICANCE OF HNRNP A1 HIGH-AFFINITY BINDING-SITES IN PRE-MESSENGER-RNA SPLICING [J].
BURD, CG ;
DREYFUSS, G .
EMBO JOURNAL, 1994, 13 (05) :1197-1204
[10]   MMSplice: modular modeling improves the predictions of genetic variant effects on splicing [J].
Cheng, Jun ;
Thi Yen Duong Nguyen ;
Cygan, Kamil J. ;
Celik, Muhammed Hasan ;
Fairbrother, William G. ;
Avsec, Ziga ;
Gagneur, Julien .
GENOME BIOLOGY, 2019, 20 (1)