A comprehensive collection of chicken cDNAs

被引:271
作者
Boardman, PE
Sanz-Ezquerro, J
Overton, IM
Burt, DW
Bosch, E
Fong, WT
Tickle, C
Brown, WRA
Wilson, SA
Hubbard, SJ
机构
[1] Univ Manchester, Inst Sci & Technol, Dept Biomol Sci, Manchester M60 1QD, Lancs, England
[2] Univ Dundee, Wellcome Trust Bioctr, Med Sci Inst, Dundee DD1 5EH, Scotland
[3] Roslin Inst, Dept Genom & Bioinformat, Roslin EH25 9PS, Midlothian, Scotland
[4] Incyte Genom, Palo Alto, CA 94304 USA
[5] Univ Nottingham, Queens Med Ctr, Inst Genet, Nottingham NG7 2UH, England
基金
英国生物技术与生命科学研究理事会; 英国医学研究理事会;
关键词
D O I
10.1016/S0960-9822(02)01296-4
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Birds have played a central role in many biological disciplines, particularly ecology, evolution, and behavior. The chicken, as a model vertebrate, also represents an important experimental system for developmental biologists, immunologists, cell biologists, and geneticists. However, genomic resources for the chicken have lagged behind those for other model organisms, with only 1845 nonredundant full-length chicken cDNA sequences currently deposited in the EMBL databank. We describe a large-scale expressed-sequence-tag (EST) project aimed at gene discovery in chickens (http://www.chick.umist.ac.uk). In total, 339,314 ESTs have been sequenced from 64 cDNA libraries generated from 21 different embryonic and adult tissues. These were clustered and assembled into 85,486 contiguous sequences (contigs). We find that a minimum of 38% of the contigs have orthologs in other organisms and define an upper limit of 13,000 new chicken genes. The remaining contigs may include novel avian specific or rapidly evolving genes. Comparison of the contigs with known chicken genes and orthologs indicates that 30% include cDNAs that contain the start codon and 20% of the contigs represent full-length cDNA sequences. Using this dataset, we estimate that chickens have approximately 35,000 genes in total, suggesting that this number may be a characteristic feature of vertebrates.
引用
收藏
页码:1965 / 1969
页数:5
相关论文
共 16 条
[1]   A large database of chicken bursal ESTs as a resource for the analysis of vertebrate gene function [J].
Abdrakhmanov, I ;
Lodygin, D ;
Geroth, P ;
Arakawa, H ;
Law, A ;
Plachy, J ;
Korn, B ;
Buerstedde, JM .
GENOME RESEARCH, 2000, 10 (12) :2062-2069
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes [J].
Aparicio, S ;
Chapman, J ;
Stupka, E ;
Putnam, N ;
Chia, J ;
Dehal, P ;
Christoffels, A ;
Rash, S ;
Hoon, S ;
Smit, A ;
Gelpke, MDS ;
Roach, J ;
Oh, T ;
Ho, IY ;
Wong, M ;
Detter, C ;
Verhoef, F ;
Predki, P ;
Tay, A ;
Lucas, S ;
Richardson, P ;
Smith, SF ;
Clark, MS ;
Edwards, YJK ;
Doggett, N ;
Zharkikh, A ;
Tavtigian, SV ;
Pruss, D ;
Barnstead, M ;
Evans, C ;
Baden, H ;
Powell, J ;
Glusman, G ;
Rowen, L ;
Hood, L ;
Tan, YH ;
Elgar, G ;
Hawkins, T ;
Venkatesh, B ;
Rokhsar, D ;
Brenner, S .
SCIENCE, 2002, 297 (5585) :1301-1310
[4]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[5]   The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :45-48
[6]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[7]   Analysis of expressed sequence tags indicates 35,000 human genes [J].
Ewing, B ;
Green, P .
NATURE GENETICS, 2000, 25 (02) :232-234
[8]   Base-calling of automated sequencer traces using phred.: I.: Accuracy assessment [J].
Ewing, B ;
Hillier, L ;
Wendl, MC ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :175-185
[9]  
FU GK, 2002, Patent No. 6387624
[10]   A comparison of the Celera and Ensembl predicted gene sets reveals little overlap in novel genes [J].
Hogenesch, JB ;
Ching, KA ;
Batalov, S ;
Su, AI ;
Walker, JR ;
Zhou, YY ;
Kay, SA ;
Schultz, PG ;
Cooke, MP .
CELL, 2001, 106 (04) :413-415