Estimate of human gene number provided by genome-wide analysis using Tetraodon nigroviridis DNA sequence

被引:244
作者
Roest Crollius, H
Jaillon, O
Bernot, A
Dasilva, C
Bouneau, L
Fischer, C
Fizames, C
Wincker, P
Brottier, P
Quétier, F
Saurin, W
Weissenbach, J [1 ]
机构
[1] Genoscope, Evry, France
[2] CNRS FRE2231, Evry, France
关键词
D O I
10.1038/76118
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The number of genes in the human genome is unknown, with estimates ranging from 50,000 to 90,000 (refs 1,2), and to more than 140,000 according to unpublished sources. We have developed 'Exofish', a procedure based on homology searches, to identify human genes quickly and reliably. This method relies on the sequence of another vertebrate, the pufferfish Tetraodon nigroviridis, to detect conserved sequences with a very low background. Similar to Fugu rubripes a marine pufferfish proposed by Brenner et al.(3) as a model for genomic studies, T. nigroviridis is a more practical alternative(4) with a genome also eight times more compact than that of human. Many comparisons have been made between F. rubripes and human DNA that demonstrate the potential of comparative genomics using the pufferfish genome(5). Application of Exofish to the December version of the working draft sequence of the human genome and to Unigene showed that the human genome contains 28,000-34,000 genes, and that Unigene contains less than 40% of the protein-coding fraction of the human genome.
引用
收藏
页码:235 / 238
页数:4
相关论文
共 14 条
  • [11] Glemet E, 1997, COMPUT APPL BIOSCI, V13, P137
  • [12] JIN L, 1994, AM J HUM GENET, V55, P582
  • [13] A gene map of the human genome
    Schuler, GD
    Boguski, MS
    Stewart, EA
    Stein, LD
    Gyapay, G
    Rice, K
    White, RE
    RodriguezTome, P
    Aggarwal, A
    Bajorek, E
    Bentolila, S
    Birren, BB
    Butler, A
    Castle, AB
    Chiannilkulchai, N
    Chu, A
    Clee, C
    Cowles, S
    Day, PJR
    Dibling, T
    Drouot, N
    Dunham, I
    Duprat, S
    East, C
    Edwards, C
    Fan, JB
    Fang, N
    Fizames, C
    Garrett, C
    Green, L
    Hadley, D
    Harris, M
    Harrison, P
    Brady, S
    Hicks, A
    Holloway, E
    Hui, L
    Hussain, S
    LouisDitSully, C
    Ma, J
    MacGilvery, A
    Mader, C
    Maratukulam, A
    Matise, TC
    McKusick, KB
    Morissette, J
    Mungall, A
    Muselet, D
    Nusbaum, HC
    Page, DC
    [J]. SCIENCE, 1996, 274 (5287) : 540 - 546
  • [14] IDENTIFICATION OF COMMON MOLECULAR SUBSEQUENCES
    SMITH, TF
    WATERMAN, MS
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1981, 147 (01) : 195 - 197