De Novo Transcriptome Sequencing in Anopheles funestus Using Illumina RNA-Seq Technology

被引:127
作者
Crawford, Jacob E. [1 ]
Guelbeogo, Wamdaogo M. [2 ]
Sanou, Antoine [2 ]
Traore, Alphonse [2 ]
Vernick, Kenneth D. [3 ,4 ]
Sagnon, N'Fale [2 ]
Lazzaro, Brian P. [1 ]
机构
[1] Cornell Univ, Dept Entomol, Ithaca, NY 14853 USA
[2] Ctr Natl Rech & Format Paludisme, Ouagadougou, Burkina Faso
[3] Inst Pasteur, Dept Parasitol & Mycol, CNRS, Unit Hosts Vectors & Infect Agents URA3012, Paris, France
[4] Univ Minnesota, Dept Microbiol, St Paul, MN USA
来源
PLOS ONE | 2010年 / 5卷 / 12期
基金
美国国家卫生研究院;
关键词
MALARIA; VECTOR; TOOL; ANNOTATION; GENERATION; ALIGNMENT; DATABASE; REVEALS; GAMBIAE;
D O I
10.1371/journal.pone.0014202
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Anopheles funestus is one of the primary vectors of human malaria, which causes a million deaths each year in sub-Saharan Africa. Few scientific resources are available to facilitate studies of this mosquito species and relatively little is known about its basic biology and evolution, making development and implementation of novel disease control efforts more difficult. The An. funestus genome has not been sequenced, so in order to facilitate genome-scale experimental biology, we have sequenced the adult female transcriptome of An. funestus from a newly founded colony in Burkina Faso, West Africa, using the Illumina GAIIx next generation sequencing platform. Methodology/Principal Findings: We assembled short Illumina reads de novo using a novel approach involving iterative de novo assemblies and "target-based" contig clustering. We then selected a conservative set of 15,527 contigs through comparisons to four Dipteran transcriptomes as well as multiple functional and conserved protein domain databases. Comparison to the Anopheles gambiae immune system identified 339 contigs as putative immune genes, thus identifying a large portion of the immune system that can form the basis for subsequent studies of this important malaria vector. We identified 5,434 1: 1 orthologues between An. funestus and An. gambiae and found that among these 1: 1 orthologues, the protein sequence of those with putative immune function were significantly more diverged than the transcriptome as a whole. Short read alignments to the contig set revealed almost 367,000 genetic polymorphisms segregating in the An. funestus colony and demonstrated the utility of the assembled transcriptome for use in RNA-seq based measurements of gene expression. Conclusions/Significance: We developed a pipeline that makes de novo transcriptome sequencing possible in virtually any organism at a very reasonable cost ($6,300 in sequencing costs in our case). We anticipate that our approach could be used to develop genomic resources in a diversity of systems for which full genome sequence is currently unavailable. Our An. funestus contig set and analytical results provide a valuable resource for future studies in this non-model, but epidemiologically critical, vector insect.
引用
收藏
页数:12
相关论文
共 53 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
[Anonymous], SCIENCE
[3]  
[Anonymous], GEN AN VECT CAP MAJ
[4]  
[Anonymous], WORLD MAL REP
[5]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[6]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[7]   Into the unknown: expression profiling without genome sequence information in CHO by next generation sequencing [J].
Birzele, Fabian ;
Schaub, Jochen ;
Rust, Werner ;
Clemens, Christoph ;
Baum, Patrick ;
Kaufmann, Hitto ;
Weith, Andreas ;
Schulz, Torsten W. ;
Hildebrandt, Tobias .
NUCLEIC ACIDS RESEARCH, 2010, 38 (12) :3999-4010
[8]   An insight into the sialome of Anopheles funestus reveals an emerging pattern in anopheline salivary protein families [J].
Calvo, Eric ;
Dao, Adama ;
Pham, Van M. ;
Ribeiro, Jose M. C. .
INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY, 2007, 37 (02) :164-175
[9]  
Charif D., 2007, Structural Approaches to Sequence Evolution: Molecules, Networks, Populations, Biological and Medical Physics, Biomedical Engineering, P207, DOI [DOI 10.1007/978-3-540-35306-5_10, 10.1007/978-3-540-35306-5_10, 10.1007/978-3-540-35306-510]
[10]   Advances in the study of Anopheles funestus, a major vector of malaria in Africa [J].
Coetzee, M ;
Fontenille, D .
INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY, 2004, 34 (07) :599-605