Palindromes in SARS and other coronaviruses

被引:12
作者
Chew, DSH [1 ]
Choi, KP
Heidner, H
Leung, MY
机构
[1] Natl Univ Singapore, Dept Math, Singapore 117543, Singapore
[2] Natl Univ Singapore, Dept Stat, Singapore 117543, Singapore
[3] Univ Texas, Dept Biol, San Antonio, TX 78249 USA
[4] Univ Texas, Dept Math Sci, El Paso, TX 79968 USA
关键词
Markov chain; palindrome counts; simulation; RNA viral genome; severe acute respiratory syndrome;
D O I
10.1287/ijoc.1040.0087
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the identification of a novel coronavirus associated with the severe acute respiratory syndrome (SARS), computational analysis of its RNA genome sequence is expected to give useful clues to help elucidate the origin, evolution, and pathogenicity of the virus. In this paper, we study the collective counts of palindromes in the SARS genome along with all the completely sequenced coronaviruses. Based on a Markov-chain model for the genome sequence, the mean and standard deviation for the number of palindromes at or above a given length are derived. These theoretical results are complemented by extensive simulations to provide empirical estimates. Using a z score obtained from these mathematical and empirical means and standard deviations, we have observed that palindromes of length four are significantly underrepresented in all the coronaviruses in our data set. In contrast, length-six palindromes are significantly underrepresented only in the SARS coronavirus. Two other features are unique to the SARS sequence. First, there is a length-22 palindrome TCTTTAACAAGCTTGTTAAAGA spanning positions 25962-25983. Second, there are two repeating length-12 palindromes TTATAATTATAA spanning positions 22712-22723 and 22796-22807. Some further investigations into possible biological implications of these palindrome features are proposed.
引用
收藏
页码:331 / 340
页数:10
相关论文
共 21 条
  • [1] Lessons from SARS
    Bloom, BR
    [J]. SCIENCE, 2003, 300 (5620) : 701 - 701
  • [2] Palindromic sequence plays a critical role in human foamy virus dimerization
    Cain, D
    Erlwein, O
    Grigg, A
    Russell, RA
    McClure, MO
    [J]. JOURNAL OF VIROLOGY, 2001, 75 (08) : 3731 - 3739
  • [3] Requirements for RNA heterodimerization of the human immunodeficiency virus type 1 (HIV-1) and HIV-2 genomes
    Dirac, AMG
    Huthoff, H
    Kjems, J
    Berkhout, B
    [J]. JOURNAL OF GENERAL VIROLOGY, 2002, 83 : 2533 - 2542
  • [4] Structure, stability and function of RNA pseudoknots involved in stimulating ribosomal frameshifting
    Giedroc, DP
    Theimer, CA
    Nixon, PL
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2000, 298 (02) : 167 - 185
  • [5] The dimer initiation sequence stem-loop of human immunodeficiency virus type 1 is dispensable for viral replication in peripheral blood mononuclear cells
    Hill, MK
    Shehu-Xhilaga, M
    Campbell, SM
    Poumbourios, P
    Crowe, SM
    Mak, J
    [J]. JOURNAL OF VIROLOGY, 2003, 77 (15) : 8329 - 8335
  • [6] STATISTICAL-ANALYSES OF COUNTS AND DISTRIBUTIONS OF RESTRICTION SITES IN DNA-SEQUENCES
    KARLIN, S
    BURGE, C
    CAMPBELL, AM
    [J]. NUCLEIC ACIDS RESEARCH, 1992, 20 (06) : 1363 - 1370
  • [7] LEUNG MY, 2002, IMS PREPRINT SERIES
  • [8] The genome sequence of the SARS-associated coronavirus
    Marra, MA
    Jones, SJM
    Astell, CR
    Holt, RA
    Brooks-Wilson, A
    Butterfield, YSN
    Khattra, J
    Asano, JK
    Barber, SA
    Chan, SY
    Cloutier, A
    Coughlin, SM
    Freeman, D
    Girn, N
    Griffith, OL
    Leach, SR
    Mayo, M
    McDonald, H
    Montgomery, SB
    Pandoh, PK
    Petrescu, AS
    Robertson, AG
    Schein, JE
    Siddiqui, A
    Smailus, DE
    Stott, JE
    Yang, GS
    Plummer, F
    Andonov, A
    Artsob, H
    Bastien, N
    Bernard, K
    Booth, TF
    Bowness, D
    Czub, M
    Drebot, M
    Fernando, L
    Flick, R
    Garbutt, M
    Gray, M
    Grolla, A
    Jones, S
    Feldmann, H
    Meyers, A
    Kabani, A
    Li, Y
    Normand, S
    Stroher, U
    Tipples, GA
    Tyler, S
    [J]. SCIENCE, 2003, 300 (5624) : 1399 - 1404
  • [9] Statistical evidence for a biochemical pathway of natural, sequence-targeted G/C to C/G transversion mutagenesis in Haemophilus influenzae Rd
    Merkl, R
    Fritz, HJ
    [J]. NUCLEIC ACIDS RESEARCH, 1996, 24 (21) : 4146 - 4151
  • [10] Qin L, 2003, ACTA PHARMACOL SIN, V24, P489