Predicting reservoir hosts and arthropod vectors from evolutionary signatures in RNA virus genomes

被引:126
作者
Babayan, Simon A. [1 ,2 ]
Orton, Richard J. [3 ]
Streicker, Daniel G. [1 ,3 ]
机构
[1] Univ Glasgow, Inst Biodivers Anim Hlth & Comparat Med, Glasgow G12 8QQ, Lanark, Scotland
[2] Pentlands Sci Pk, Moredun Res Inst, Penicuik EH26 0PZ, Midlothian, Scotland
[3] Univ Glasgow, Ctr Virus Res, MRC, Glasgow G61 1QH, Lanark, Scotland
基金
英国生物技术与生命科学研究理事会; 英国医学研究理事会; 英国惠康基金;
关键词
RESPIRATORY SYNDROME CORONAVIRUS; TO-HUMAN TRANSMISSION; BAT CORONAVIRUS; DATABASE; ORIGINS; USAGE;
D O I
10.1126/science.aap9072
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identifying the animal origins of RNA viruses requires years of field and laboratory studies that stall responses to emerging infectious diseases. Using large genomic and ecological datasets, we demonstrate that animal reservoirs and the existence and identity of arthropod vectors can be predicted directly from viral genome sequences via machine learning. We illustrate the ability of these models to predict the epidemiology of diverse viruses across most human-infective families of single-stranded RNA viruses, including 69 viruses with previously elusive or never-investigated reservoirs or vectors. Models such as these, which capitalize on the proliferation of low-cost genomic sequencing, can narrow the time lag between virus discovery and targeted research, surveillance, and management.
引用
收藏
页码:577 / +
页数:39
相关论文
共 44 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Synonymous Virus Genome Recocing as a Tool to Impact Viral Fitness [J].
Angel Martinez, Miguel ;
Jordan-Paiz, Ana ;
Franco, Sandra ;
Nevot, Maria .
TRENDS IN MICROBIOLOGY, 2016, 24 (02) :134-147
[3]   Evidence for Camel-to-Human Transmission of MERS Coronavirus [J].
Azhar, Esam I. ;
El-Kafrawy, Sherif A. ;
Farraj, Suha A. ;
Hassan, Ahmed M. ;
Al-Saeed, Muneera S. ;
Hashem, Anwar M. ;
Madani, Tariq A. .
NEW ENGLAND JOURNAL OF MEDICINE, 2014, 370 (26) :2499-2505
[4]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[5]  
Charif D., 2007, Sequence Evolution: Molecules, Networks, Populations, P207, DOI [DOI 10.1007/978-3-540-35306-5_10, 10.1007/978-3-540-35306-5_10]
[6]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[7]   Virus attenuation by genome-scale changes in codon pair bias [J].
Coleman, J. Robert ;
Papamichail, Dimitris ;
Skiena, Steven ;
Futcher, Bruce ;
Wimmer, Eckard ;
Mueller, Steffen .
SCIENCE, 2008, 320 (5884) :1784-1787
[8]   Dinucleotide Composition in Animal RNA Viruses Is Shaped More by Virus Family than by Host Species [J].
Di Giallonardo, Francesca ;
Schlub, Timothy E. ;
Shi, Mang ;
Holmes, Edward C. .
JOURNAL OF VIROLOGY, 2017, 91 (08)
[9]   Virus genomes reveal factors that spread and sustained the Ebola epidemic [J].
Dudas, Gytis ;
Carvalho, Luiz Max ;
Bedford, Trevor ;
Tatem, Andrew J. ;
Baele, Guy ;
Faria, Nuno R. ;
Park, Daniel J. ;
Ladner, Jason T. ;
Arias, Armando ;
Asogun, Danny ;
Bielejec, Filip ;
Caddy, Sarah L. ;
Cotten, Matthew ;
D'Ambrozio, Jonathan ;
Dellicour, Simon ;
Di Caro, Antonino ;
Diclaro, Joseph W. ;
Duraffour, Sophie ;
Elmore, Michael J. ;
Fakoli, Lawrence S. ;
Faye, Ousmane ;
Gilbert, Merle L. ;
Gevao, Sahr M. ;
Gire, Stephen ;
Gladden-Young, Adrianne ;
Gnirke, Andreas ;
Goba, Augustine ;
Grant, Donald S. ;
Haagmans, Bart L. ;
Hiscox, Julian A. ;
Jah, Umaru ;
Kugelman, Jeffrey R. ;
Liu, Di ;
Lu, Jia ;
Malboeuf, Christine M. ;
Mate, Suzanne ;
Matthews, David A. ;
Matranga, Christian B. ;
Meredith, Luke W. ;
Qu, James ;
Quick, Joshua ;
Pas, Suzan D. ;
Phan, My V. T. ;
Pollakis, Georgios ;
Reusken, Chantal B. ;
Sanchez-Lockhart, Mariano ;
Schaffner, Stephen F. ;
Schieffelin, John S. ;
Sealfon, Rachel S. ;
Simon-Loriere, Etienne .
NATURE, 2017, 544 (7650) :309-+
[10]   MERS-CoV recombination: implications about the reservoir and potential for adaptation [J].
Dudas, Gytis ;
Rambaut, Andrew .
VIRUS EVOLUTION, 2016, 2 (01)