The Drosophila gene collection:: Identification of putative full-length cDNAs for 70% of D-melanogaster genes

被引:155
作者
Stapleton, M [1 ]
Liao, GC
Brokstein, P
Hong, L
Carninci, P
Shiraki, T
Hayashizaki, Y
Champe, M
Pacleb, J
Wan, K
Yu, C
Carlson, J
George, R
Celniker, S
Rubin, GM
机构
[1] Lawrence Berkley Natl Lab, Berkeley Drosophila Genom Project, Berkeley, CA 94720 USA
[2] Lawrence Berkley Natl Lab, Genome Sci Dept, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Dept Mol & Cell Biol, Berkeley, CA 94720 USA
[4] RIKEN, Yokohama Inst, Genom Sci Ctr, Genome Explorat Res Grp,Tsurumi Ku, Yokohama, Kanagawa 2300045, Japan
[5] Univ Calif Berkeley, Howard Hughes Med Inst, Berkeley, CA 94720 USA
关键词
D O I
10.1101/gr.269102
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Collections of full-length nonredundant cDNA clones are critical reagents for functional genomics. The first step toward these resources is the generation and single-pass sequencing of cDNA libraries that contain a high proportion of full-length clones. The first release of the Drosophila Gene Collection Release 1 (DGCr1) was produced from six libraries representing various tissues, developmental stages, and the Cultured S2 cell line. Nearly 80,000 random 5' expressed sequence tags (5' expressed sequence tags [ESTs]from these libraries were collapsed into a nonredundant set of 5849 cDNAs, corresponding to similar to40% of the 13,474 predicted genes in Drosophila. To obtain cDNA clones representing the remaining genes, we have generated an additional 157,835 S' ESTs from two previously existing and three new libraries. One new library is derived from adult testis, a tissue we previously did not exploit for gene discovery; two new cap-trapped normalized libraries are derived from 0-22-h embryos and adult heads. Taking advantage of the annotated D. melanogaster genome sequence, we clustered the ESTs by aligning them to the genome. Clusters that overlap genes not already represented by cDNA clones in the DGCr1 were analyzed further, and putative full-length clones were selected for inclusion in the new DGC. This second release of the DGC (DGCr2) contains 5061 additional clones, extending the collection to 10,910 cDNAs representing >70% of the predicted genes in Drosophila.
引用
收藏
页码:1294 / 1300
页数:7
相关论文
共 19 条
[1]   The genome sequence of Drosophila melanogaster [J].
Adams, MD ;
Celniker, SE ;
Holt, RA ;
Evans, CA ;
Gocayne, JD ;
Amanatides, PG ;
Scherer, SE ;
Li, PW ;
Hoskins, RA ;
Galle, RF ;
George, RA ;
Lewis, SE ;
Richards, S ;
Ashburner, M ;
Henderson, SN ;
Sutton, GG ;
Wortman, JR ;
Yandell, MD ;
Zhang, Q ;
Chen, LX ;
Brandon, RC ;
Rogers, YHC ;
Blazej, RG ;
Champe, M ;
Pfeiffer, BD ;
Wan, KH ;
Doyle, C ;
Baxter, EG ;
Helt, G ;
Nelson, CR ;
Miklos, GLG ;
Abril, JF ;
Agbayani, A ;
An, HJ ;
Andrews-Pfannkoch, C ;
Baldwin, D ;
Ballew, RM ;
Basu, A ;
Baxendale, J ;
Bayraktaroglu, L ;
Beasley, EM ;
Beeson, KY ;
Benos, PV ;
Berman, BP ;
Bhandari, D ;
Bolshakov, S ;
Borkova, D ;
Botchan, MR ;
Bouck, J ;
Brokstein, P .
SCIENCE, 2000, 287 (5461) :2185-2195
[2]   COMPLEMENTARY-DNA SEQUENCING - EXPRESSED SEQUENCE TAGS AND HUMAN GENOME PROJECT [J].
ADAMS, MD ;
KELLEY, JM ;
GOCAYNE, JD ;
DUBNICK, M ;
POLYMEROPOULOS, MH ;
XIAO, H ;
MERRIL, CR ;
WU, A ;
OLDE, B ;
MORENO, RF ;
KERLAVAGE, AR ;
MCCOMBIE, WR ;
VENTER, JC .
SCIENCE, 1991, 252 (5013) :1651-1656
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]   Gene discovery using computational and microarray analysis of transcription in the Drosophila melanogaster testis [J].
Andrews, J ;
Bouffard, GG ;
Cheadle, C ;
Lü, JN ;
Becker, KG ;
Oliver, B .
GENOME RESEARCH, 2000, 10 (12) :2030-2043
[5]   A biologist's view of the Drosophila genome annotation assessment project [J].
Ashburner, M .
GENOME RESEARCH, 2000, 10 (04) :391-393
[6]   Normalization and subtraction: Two approaches to facilitate gene discovery [J].
Bonaldo, MDF ;
Lennon, G ;
Soares, MB .
GENOME RESEARCH, 1996, 6 (09) :791-806
[7]   Normalization and subtraction of cap-trapper-selected cDNAs to prepare full-length cDNA libraries for rapid discovery of new genes [J].
Carninci, P ;
Shibata, Y ;
Hayatsu, N ;
Sugahara, Y ;
Shibata, K ;
Itoh, M ;
Konno, H ;
Okazaki, Y ;
Muramatsu, M ;
Hayashizaki, Y .
GENOME RESEARCH, 2000, 10 (10) :1617-1630
[8]   Thermostabilization and thermoactivation of thermolabile enzymes by trehalose and its application for the synthesis of full length cDNA [J].
Carninci, P ;
Nishiyama, Y ;
Westover, A ;
Itoh, M ;
Nagaoka, S ;
Sasaki, N ;
Okazaki, Y ;
Muramatsu, M ;
Hayashizaki, Y .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (02) :520-524
[9]   Balanced-size and long-size cloning of full-length, cap-trapped cDNAs into vectors of the novel λ-FLC family allows enhanced gene discovery rate and functional analysis [J].
Carninci, P ;
Shibata, Y ;
Hayatsu, N ;
Itoh, M ;
Shiraki, T ;
Hirozane, T ;
Watahiki, A ;
Shibata, K ;
Konno, H ;
Muramatsu, M ;
Hayashizaki, Y .
GENOMICS, 2001, 77 (1-2) :79-90
[10]   Y chromosomal fertility factors kl-2 and kl-3 of Drosophila melanogaster encode dynein heavy chain polypeptides [J].
Carvalho, AB ;
Lazzaro, BP ;
Clark, AG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (24) :13239-13244