Comparative analysis of noncoding regions of 77 orthologous mouse and human gene pairs

被引:158
作者
Jareborg, N [1 ]
Birney, E [1 ]
Durbin, R [1 ]
机构
[1] Sanger Ctr, Cambridge CB10 1SA, England
基金
英国惠康基金;
关键词
D O I
10.1101/gr.9.9.815
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A data set of 77 genomic mouse/human gene pairs has been compiled from the EMBL nucleotide database, and their corresponding Features determined. This set was used to analyze the degree of conservation of noncoding sequences between mouse and human. A new alignment algorithm was developed to cope with the fact that large parts of noncoding sequences are not alignable in a meaningful way because of genetic drift. This new algorithm, DNA Block Aligner (DBA), finds colinear-conserved blocks that are Flanked by nonconserved sequences of varying lengths. The noncoding regions of the data set were aligned with DBA. The proportion of the noncoding regions covered by blocks >60% identical was 36% for upstream regions, 50% for 5' UTRs, 23% For introns, and 56% for 3' UTRs. These blocks of high identity were more or less evenly distributed across the length of the features, except for upstream regions in which the first 100 bp upstream of the transcription start site was covered in up to 70% of the gene pairs. This data set complements earlier sets on the basis of cDNA sequences and will be useful for Further comparative studies.
引用
收藏
页码:815 / 824
页数:10
相关论文
共 29 条
  • [1] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [2] [Anonymous], [No title captured]
  • [3] Ansari-Lari MA, 1998, GENOME RES, V8, P29
  • [4] NUMBER OF CPG ISLANDS AND GENES IN HUMAN AND MOUSE
    ANTEQUERA, F
    BIRD, A
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1993, 90 (24) : 11995 - 11999
  • [5] THE ISOCHORE ORGANIZATION OF THE HUMAN GENOME AND ITS EVOLUTIONARY HISTORY - A REVIEW
    BERNARDI, G
    [J]. GENE, 1993, 135 (1-2) : 57 - 66
  • [6] BIRNEY E, 1997, ISMB, V5, P56
  • [7] New goals for the US Human Genome Project: 1998-2003
    Collins, FS
    Patrinos, A
    Jordan, E
    Chakravarti, A
    Gesteland, R
    Walters, L
    Fearon, E
    Hartwelt, L
    Langley, CH
    Mathies, RA
    Olson, M
    Pawson, AJ
    Pollard, T
    Williamson, A
    Wold, B
    Buetow, K
    Branscomb, E
    Capecchi, M
    Church, G
    Garner, H
    Gibbs, RA
    Hawkins, T
    Hodgson, K
    Knotek, M
    Meisler, M
    Rubin, GM
    Smith, LM
    Smith, RF
    Westerfield, M
    Clayton, EW
    Fisher, NL
    Lerman, CE
    McInerney, JD
    Nebo, W
    Press, N
    Valle, D
    [J]. SCIENCE, 1998, 282 (5389) : 682 - 689
  • [8] STRONG CONSERVATION OF NONCODING SEQUENCES DURING VERTEBRATES EVOLUTION - POTENTIAL INVOLVEMENT IN POSTTRANSCRIPTIONAL REGULATION OF GENE-EXPRESSION
    DURET, L
    DORKELD, F
    GAUTIER, C
    [J]. NUCLEIC ACIDS RESEARCH, 1993, 21 (10) : 2315 - 2322
  • [9] Searching for regulatory elements in human noncoding sequences
    Duret, L
    Bucher, P
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 1997, 7 (03) : 399 - 406
  • [10] Quality not quantity: The pufferfish genome
    Elgar, G
    [J]. HUMAN MOLECULAR GENETICS, 1996, 5 : 1437 - 1442