DBTSS: DataBase of human transcriptional start sites and full-length cDNAs

被引:168
作者
Suzuki, Y
Yamashita, R
Nakai, K
Sugano, S
机构
[1] Univ Tokyo, Inst Med Sci, Ctr Human Genome, Minato Ku, Tokyo 1088639, Japan
[2] Nara Inst Sci & Technol, Taisho Lab Funct Gen, Ikoma, Nara 6300101, Japan
关键词
D O I
10.1093/nar/30.1.328
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Although the information of cDNAs is indispensable for analyzing gene function, most of the cDNA sequences stored in current databases are imperfect in the sense that they lack the precise information of 5' end termini. To overcome this difficulty, we have developed the oligo-capping method to obtain full-length cDNAs, the information of which has been partly deposited in public databases. In this study, we further constructed human cDNA libraries enriched in clones containing the cap structure to systematically explore the 5' end structure of expressed genes. Of approximately 217 402 5' end sequences obtained, 111 382 have been matched to cDNA sequences of known genes (7889 genes) and are presented in our new database, DataBase of Transcriptional Start Sites (DBTSS; http://elmo.ims.u-tokyo.ac.jp/dbtss/). Sequence comparison between our entries and those of a reference sequence database, RefSeq, revealed that 4683 (34%) of RefSeq sequences should be extended towards the 5' ends. We also mapped each sequence on the human draft genome sequence to identify its transcriptional start site, which provides us with more detailed information on distribution patterns of transcriptional start sites and adjacent regulatory regions.
引用
收藏
页码:328 / 331
页数:4
相关论文
共 13 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] A computer program for aligning a cDNA sequence with a genomic DNA sequence
    Florea, L
    Hartzell, G
    Zhang, Z
    Rubin, GM
    Miller, W
    [J]. GENOME RESEARCH, 1998, 8 (09) : 967 - 974
  • [3] RAPID PRODUCTION OF FULL-LENGTH CDNAS FROM RARE TRANSCRIPTS - AMPLIFICATION USING A SINGLE GENE-SPECIFIC OLIGONUCLEOTIDE PRIMER
    FROHMAN, MA
    DUSH, MK
    MARTIN, GR
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1988, 85 (23) : 8998 - 9002
  • [4] Initial sequencing and analysis of the human genome
    Lander, ES
    Int Human Genome Sequencing Consortium
    Linton, LM
    Birren, B
    Nusbaum, C
    Zody, MC
    Baldwin, J
    Devon, K
    Dewar, K
    Doyle, M
    FitzHugh, W
    Funke, R
    Gage, D
    Harris, K
    Heaford, A
    Howland, J
    Kann, L
    Lehoczky, J
    LeVine, R
    McEwan, P
    McKernan, K
    Meldrim, J
    Mesirov, JP
    Miranda, C
    Morris, W
    Naylor, J
    Raymond, C
    Rosetti, M
    Santos, R
    Sheridan, A
    Sougnez, C
    Stange-Thomann, N
    Stojanovic, N
    Subramanian, A
    Wyman, D
    Rogers, J
    Sulston, J
    Ainscough, R
    Beck, S
    Bentley, D
    Burton, J
    Clee, C
    Carter, N
    Coulson, A
    Deadman, R
    Deloukas, P
    Dunham, A
    Dunham, I
    Durbin, R
    French, L
    [J]. NATURE, 2001, 409 (6822) : 860 - 921
  • [5] MARUYAMA K, 1994, GENE, V138, P171
  • [6] TRANSCRIPTIONAL CONTROL SIGNALS OF A EUKARYOTIC PROTEIN-CODING GENE
    MCKNIGHT, SL
    KINGSBURY, R
    [J]. SCIENCE, 1982, 217 (4557) : 316 - 324
  • [7] Identification and analysis of eukaryotic promoters: recent computational approaches
    Ohler, U
    Niemann, H
    [J]. TRENDS IN GENETICS, 2001, 17 (02) : 56 - 60
  • [8] RefSeq and LocusLink: NCBI gene-centered resources
    Pruitt, KD
    Maglott, DR
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (01) : 137 - 140
  • [9] Introducing RefSeq and LocusLink: curated human genome resources at the NCBI
    Pruitt, KD
    Katz, KS
    Sicotte, H
    Maglott, DR
    [J]. TRENDS IN GENETICS, 2000, 16 (01) : 44 - 47
  • [10] REVOLUTIONS IN RAPID AMPLIFICATION OF CDNA ENDS - NEW STRATEGIES FOR POLYMERASE CHAIN-REACTION CLONING OF FULL-LENGTH CDNA ENDS
    SCHAEFER, BC
    [J]. ANALYTICAL BIOCHEMISTRY, 1995, 227 (02) : 255 - 273