Protein-DNA interactions: Amino acid conservation and the effects of mutations on binding specificity

被引:216
作者
Luscombe, NM
Thornton, JM
机构
[1] UCL, Biomol Struct & Modelling Unit, Dept Biochem & Mol Biol, London WC1E 6BT, England
[2] Univ London Birkbeck Coll, Dept Crystallog, London WC1E 7HX, England
基金
英国生物技术与生命科学研究理事会;
关键词
bioinformatics; structural biology; protein-DNA interactions; transcription factors; sequence conservation;
D O I
10.1016/S0022-2836(02)00571-5
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We investigate the conservation of amino acid residue sequences in 21 DNA-binding protein families and study the effects that mutations have on DNA-sequence recognition. The observations are best understood by assigning each protein family to one of three classes: (i) non-specific, where binding is independent of DNA sequence; (ii) highly specific, where binding is specific and all members of the family target the same DNA sequence; and (iii) multi-specific, where binding is also specific, but individual family members target different DNA sequences. Overall, protein residues in contact with the DNA are better conserved than the rest of the protein surface, but there is a complex underlying trend of conservation for individual residue positions. Amino acid residues that interact with the DNA backbone are well conserved across all protein families and provide a core of stabilising contacts for homologous protein-DNA complexes. In contrast, amino acid residues that interact with DNA bases have variable levels of conservation depending on the family classification. In non-specific families, base-contacting residues are well conserved and interactions are always found in the minor groove where there is little discrimination between base types. In highly specific families, base-contacting residues are highly conserved and allow member proteins to recognise the same target sequence. In multi-specific families, base-contacting residues undergo frequent mutations and enable different proteins to recognise distinct target sequences. Finally, we report that interactions with bases in the target sequence often follow (though not always) a universal code of amino acid-base recognition and the effects of amino acid mutations can be most easily understood for these interactions (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:991 / 1009
页数:19
相关论文
共 45 条
[1]   The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :45-48
[2]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[3]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1977, 80 (02) :319-324
[4]   Indirect readout of DNA sequence at the primary-kink site in the CAP-DNA complex: DNA binding specificity based on energetics of DNA kinking [J].
Chen, SF ;
Vojtechovsky, J ;
Parkinson, GN ;
Ebright, RH ;
Berman, HM .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 314 (01) :63-74
[5]   Indirect readout of DNA sequence at the primary-kink site in the CAP-DNA complex: Alteration of DNA binding specificity through alteration of DNA kinking [J].
Chen, SF ;
Gunasekera, A ;
Zhang, XP ;
Kunkel, TA ;
Ebright, RH ;
Berman, HM .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 314 (01) :75-82
[6]   Physical basis of a protein-DNA recognition code [J].
Choo, Y ;
Klug, A .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1997, 7 (01) :117-125
[7]   DNA bending: The prevalence of kinkiness and the virtues of normality [J].
Dickerson, RE .
NUCLEIC ACIDS RESEARCH, 1998, 26 (08) :1906-1926
[8]   MECHANISTIC IMPLICATIONS FROM THE STRUCTURE OF A CATALYTIC FRAGMENT OF MOLONEY MURINE LEUKEMIA-VIRUS REVERSE-TRANSCRIPTASE [J].
GEORGIADIS, MM ;
JESSEN, SM ;
OGATA, CM ;
TELESNITSKY, A ;
GOFF, SP ;
HENDRICKSON, WA .
STRUCTURE, 1995, 3 (09) :879-892
[9]   THE SUBUNIT INTERFACES OF OLIGOMERIC ENZYMES ARE CONSERVED TO A SIMILAR EXTENT TO THE OVERALL PROTEIN SEQUENCES [J].
GRISHIN, NV ;
PHILLIPS, MA .
PROTEIN SCIENCE, 1994, 3 (12) :2455-2458
[10]   DNA-SEQUENCE RECOGNITION BY CAP - ROLE OF THE ADENINE N6 ATOM OF BASE PAIR-6 OF THE DNA SITE [J].
GUNASEKERA, A ;
EBRIGHT, YW ;
EBRIGHT, RH .
NUCLEIC ACIDS RESEARCH, 1990, 18 (23) :6853-6856