De novo assembly and validation of planaria transcriptome by massive parallel sequencing and shotgun proteomics

被引:87
作者
Adamidi, Catherine [1 ]
Wang, Yongbo [1 ]
Gruen, Dominic [1 ]
Mastrobuoni, Guido [1 ]
You, Xintian [1 ]
Tolle, Dominic [1 ]
Dodt, Matthias [1 ]
Mackowiak, Sebastian D. [1 ]
Gogol-Doering, Andreas [1 ]
Oenal, Pinar [1 ]
Rybak, Agnieszka [1 ]
Ross, Eric [2 ]
Alvarado, Alejandro Sanchez [2 ]
Kempa, Stefan [1 ]
Dieterich, Christoph [1 ]
Rajewsky, Nikolaus [1 ]
Chen, Wei [1 ]
机构
[1] Max Delbruck Ctr Mol Med, Berlin Inst Med Syst Biol, D-13125 Berlin, Germany
[2] Univ Utah, Howard Hughes Med Inst, Dept Neurobiol & Anat, Salt Lake City, UT 84132 USA
关键词
DUPLEX-SPECIFIC NUCLEASE; STEM-CELLS; SCHMIDTEA-MEDITERRANEA; RNA-SEQ; REGENERATION; PROTEIN; MODEL; QUANTIFICATION; IDENTIFICATION; ORTHOLOGS;
D O I
10.1101/gr.113779.110
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Freshwater planaria are a very attractive model system for stem cell biology, tissue homeostasis, and regeneration. The genome of the planarian Schmidtea mediterranea has recently been sequenced and is estimated to contain >20,000 protein-encoding genes. However, the characterization of its transcriptome is far from complete. Furthermore, not a single proteome of the entire phylum has been assayed on a genome-wide level. We devised an efficient sequencing strategy that allowed us to de novo assemble a major fraction of the S. mediterranea transcriptome. We then used independent assays and massive shotgun proteomics to validate the authenticity of transcripts. In total, our de novo assembly yielded 18,619 candidate transcripts with a mean length of 1118 nt after filtering. A total of 17,564 candidate transcripts could be mapped to 15,284 distinct loci on the current genome reference sequence. RACE confirmed complete or almost complete 5' and 3' ends for 22/24 transcripts. The frequencies of frame shifts, fusion, and fission events in the assembled transcripts were computationally estimated to be 4.2%-13%, 0%-3.7%, and 2.6%, respectively. Our shotgun proteomics produced 16,135 distinct peptides that validated 4200 transcripts (FDR <= 1%). The catalog of transcripts assembled in this study, together with the identified peptides, dramatically expands and refines planarian gene annotation, demonstrated by validation of several previously unknown transcripts with stem cell-dependent expression patterns. In addition, our robust transcriptome characterization pipeline could be applied to other organisms without genome assembly. All of our data, including homology annotation, are freely available at SmedGD, the S. mediterranea genome database.
引用
收藏
页码:1193 / 1200
页数:8
相关论文
共 31 条
[1]   Regeneration and gene regulation in planarians [J].
Agata, K .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 2003, 13 (05) :492-496
[2]   Automatic clustering of orthologs and inparalogs shared by multiple proteomes [J].
Alexeyenko, Andrey ;
Tamas, Ivica ;
Liu, Gang ;
Sonnhammer, Erik L. L. .
BIOINFORMATICS, 2006, 22 (14) :E9-E15
[3]  
Alvarado A.Sanchez., 2006, CELL, V124, P241, DOI DOI 10.1016/J.CELL.2006.01.012
[4]   MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes [J].
Cantarel, Brandi L. ;
Korf, Ian ;
Robb, Sofia M. C. ;
Parra, Genis ;
Ross, Eric ;
Moore, Barry ;
Holt, Carson ;
Alvarado, Alejandro Sanchez ;
Yandell, Mark .
GENOME RESEARCH, 2008, 18 (01) :188-196
[5]   MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification [J].
Cox, Juergen ;
Mann, Matthias .
NATURE BIOTECHNOLOGY, 2008, 26 (12) :1367-1372
[6]   High-resolution profiling and discovery of planarian small RNAs [J].
Friedlaender, Marc R. ;
Adamidi, Catherine ;
Han, Ting ;
Lebedeva, Svetlana ;
Isenbarger, Thomas A. ;
Hirst, Martin ;
Marra, Marco ;
Nusbaum, Chad ;
Lee, William L. ;
Jenkin, James C. ;
Alvarado, Alejandro Sanchez ;
Kim, John K. ;
Rajewsky, Nikolaus .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (28) :11546-11551
[7]   Stem cells and regeneration in planarians [J].
Handberg-Thorsager, Mette ;
Fernandez, Enrique ;
Salo, Emili .
FRONTIERS IN BIOSCIENCE-LANDMARK, 2008, 13 :6374-6394
[8]   Expression and functional analysis of musashi-like genes in planarian CNS regeneration [J].
Higuchi, Sayaka ;
Hayashi, Tetsutaro ;
Tarui, Hiroshi ;
Nishimura, Osamu ;
Nishimura, Kaneyasu ;
Shibata, Norito ;
Sakamoto, Hiroshi ;
Agata, Kiyokazu .
MECHANISMS OF DEVELOPMENT, 2008, 125 (07) :631-645
[9]  
Kent WJ, 2002, GENOME RES, V12, P656, DOI [10.1101/gr.229202, 10.1101/gr.229202. Article published online before March 2002]
[10]   De novo assembly of human genomes with massively parallel short read sequencing [J].
Li, Ruiqiang ;
Zhu, Hongmei ;
Ruan, Jue ;
Qian, Wubin ;
Fang, Xiaodong ;
Shi, Zhongbin ;
Li, Yingrui ;
Li, Shengting ;
Shan, Gao ;
Kristiansen, Karsten ;
Li, Songgang ;
Yang, Huanming ;
Wang, Jian ;
Wang, Jun .
GENOME RESEARCH, 2010, 20 (02) :265-272