In silico prediction of scaffold/matrix attachment regions in large genomic sequences

被引:44
作者
Frisch, M [1 ]
Frech, K
Klingenhoff, A
Cartharius, K
Liebich, I
Werner, T
机构
[1] Genomatix Software, D-80339 Munich, Germany
[2] Gesell Biotechnol Forsch mbH, Res Grp Bioinformat, D-38124 Braunschweig, Germany
[3] GSF, Natl Res Ctr Environm & Hlth, Inst Expt Genet, D-85764 Neuherberg, Germany
关键词
D O I
10.1101/gr.206602.ArticlepublishedonlinebeforeprintinJanuary2002
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Scaffold/matrix attachment regions (S/MARS) are essential regulatory DNA elements of eukaryotic cells. They are major determinants of locus control of gene expression and can shield gene expression from position effects. Experimental detection of S/MARS requires substantial effort and is not suitable for large-scale screening of genomic sequences. In silico prediction of S/MARS can provide a crucial first selection step to reduce the number of candidates. We used experimentally defined S/MAR sequences as the training set and generated a library of new S/MAR-associated, AT-rich patterns described as weight matrices. A new tool called SMARTest was developed that identifies potential S/MARS by performing a density analysis based on the S/MAR matrix library (http://www.genomatix.delcgi-bin/smartest_pd/slllartest.pl). S/MAR predictions were evaluated by using six genomic sequences from animal and plant for which S/MARS and non-S/MARS were experimentally mapped. SMARTest reached a sensitivity of 38% and a specificity of 68%. In contrast to previous algorithms, the SMARTest approach does not depend on the sequence context and is suitable to analyze long genomic sequences tip to the size of whole chromosomes. To demonstrate the feasibility of large-scale S/MAR prediction, we analyzed the recently published chromosome 22 sequence and found 1198 S/MAR candidates.
引用
收藏
页码:349 / 354
页数:6
相关论文
共 43 条
  • [11] WEIGHT MATRIX DESCRIPTIONS OF 4 EUKARYOTIC RNA POLYMERASE-II PROMOTER ELEMENTS DERIVED FROM 502 UNRELATED PROMOTER SEQUENCES
    BUCHER, P
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 212 (04) : 563 - 578
  • [12] CHEN QK, 1995, COMPUT APPL BIOSCI, V11, P563
  • [13] COCKERILL PN, 1987, J BIOL CHEM, V262, P5394
  • [14] CHROMOSOMAL LOOP ANCHORAGE OF THE KAPPA IMMUNOGLOBULIN GENE OCCURS NEXT TO THE ENHANCER IN A REGION CONTAINING TOPOISOMERASE-II SITES
    COCKERILL, PN
    GARRARD, WT
    [J]. CELL, 1986, 44 (02) : 273 - 282
  • [15] Dickinson LA, 1997, J BIOL CHEM, V272, P11463
  • [16] A TISSUE-SPECIFIC MAR/SAR DNA-BINDING PROTEIN WITH UNUSUAL BINDING-SITE RECOGNITION
    DICKINSON, LA
    JOH, T
    KOHWI, Y
    KOHWISHIGEMATSU, T
    [J]. CELL, 1992, 70 (04) : 631 - 645
  • [17] The DNA sequence of human chromosome 22
    Dunham, I
    Shimizu, N
    Roe, BA
    Chissoe, S
    Dunham, I
    Hunt, AR
    Collins, JE
    Bruskiewich, R
    Beare, DM
    Clamp, M
    Smink, LJ
    Ainscough, R
    Almeida, JP
    Babbage, A
    Bagguley, C
    Balley, J
    Barlow, K
    Bates, KN
    Beasley, O
    Bird, CP
    Blakey, S
    Bridgeman, AM
    Buck, D
    Burgess, J
    Burrill, WD
    Burton, J
    Carder, C
    Carter, NP
    Chen, Y
    Clark, G
    Clegg, SM
    Cobley, V
    Cole, CG
    Collier, RE
    Connor, RE
    Conroy, D
    Corby, N
    Coville, GJ
    Cox, AV
    Davis, J
    Dawson, E
    Dhami, PD
    Dockree, C
    Dodsworth, SJ
    Durbin, RM
    Ellington, A
    Evans, KL
    Fey, JM
    Fleming, K
    French, L
    [J]. NATURE, 1999, 402 (6761) : 489 - 495
  • [18] Frech K, 1998, In Silico Biol, V1, P29
  • [19] Common modular structure of lentivirus LTRs
    Frech, K
    BrackWerner, R
    Werner, T
    [J]. VIROLOGY, 1996, 224 (01) : 256 - 267
  • [20] FRISCH M, 2000, P GERM C BIOINF, P27