Identifying repeat domains in large genomes

被引:24
作者
Zhi, DG [1 ]
Raphael, BJ
Price, AL
Tang, HX
Pevzner, PA
机构
[1] Univ Calif San Diego, Bioinformat Program, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[3] Harvard Univ, Sch Med, Dept Genet, Boston, MA 02115 USA
[4] Indiana Univ, Sch Informat, Bloomington, IN 47408 USA
[5] Indiana Univ, Ctr Genom & Bioinformat, Bloomington, IN 47408 USA
关键词
D O I
10.1186/gb-2006-7-1-r7
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
We present a graph-based method for the analysis of repeat families in a repeat library. We build a repeat domain graph that decomposes a repeat library into repeat domains, short subsequences shared by multiple repeat families, and reveals the mosaic structure of repeat families. Our method recovers documented mosaic repeat structures and suggests additional putative ones. Our method is useful for elucidating the evolutionary history of repeats and annotating de novo generated repeat libraries.
引用
收藏
页数:14
相关论文
共 31 条
  • [1] An Alu transposition model for the origin and expansion of human segmental duplications
    Bailey, JA
    Liu, G
    Eichler, EE
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (04) : 823 - 834
  • [2] Human-specific duplication and mosaic transcripts: The recent paralogous structure of chromosome 22
    Bailey, JA
    Yavor, AM
    Viggiano, L
    Misceo, D
    Horvath, JE
    Archidiacono, N
    Schwartz, S
    Rocchi, M
    Eichler, EE
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2002, 70 (01) : 83 - 100
  • [3] Automated de novo identification of repeat sequence families in sequenced genomes
    Bao, ZR
    Eddy, SR
    [J]. GENOME RESEARCH, 2002, 12 (08) : 1269 - 1276
  • [4] Alu repeats and human genomic diversity
    Batzer, MA
    Deininger, PL
    [J]. NATURE REVIEWS GENETICS, 2002, 3 (05) : 370 - 379
  • [5] Aligning multiple genomic sequences with the threaded blockset aligner
    Blanchette, M
    Kent, WJ
    Riemer, C
    Elnitski, L
    Smit, AFA
    Roskin, KM
    Baertsch, R
    Rosenbloom, K
    Clawson, H
    Green, ED
    Haussler, D
    Miller, W
    [J]. GENOME RESEARCH, 2004, 14 (04) : 708 - 715
  • [6] BROSIUS J, 2003, BIOINFORMATICS, V19, P35
  • [7] Stress and transposable elements:: co-evolution or useful parasites?
    Capy, P
    Gasperi, G
    Biémont, C
    Bazin, C
    [J]. HEREDITY, 2000, 85 (02) : 101 - 106
  • [8] *CHIMP SEQ AN CONS, 2005, NATURE, V0437
  • [9] PILER: identification and classification of genomic repeats
    Edgar, RC
    Myers, EW
    [J]. BIOINFORMATICS, 2005, 21 : I152 - I158
  • [10] GALPERIN MY, 2002, FRONTIERS COMPUTATIO