DNPTrapper: an assembly editing tool for finishing and analysis of complex repeat regions

被引:11
作者
Arner, E [1 ]
Tammi, MT [1 ]
Tran, AN [1 ]
Kindlund, E [1 ]
Andersson, B [1 ]
机构
[1] Karolinska Inst, Ctr Genom & Bioinformat, Stockholm, Sweden
关键词
D O I
10.1186/1471-2105-7-155
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Many genome projects are left unfinished due to complex, repeated regions. Finishing is the most time consuming step in sequencing and current finishing tools are not designed with particular attention to the repeat problem. Results: We have developed DNPTrapper, a shotgun sequence finishing tool, specifically designed to address the problems posed by the presence of repeated regions in the target sequence. The program detects and visualizes single base differences between nearly identical repeat copies, and offers the overview and flexibility needed to rapidly resolve complex regions within a working session. The use of a database allows large amounts of data to be stored and handled, and allows viewing of mammalian size genomes. The program is available under an Open Source license. Conclusion: With DNPTrapper, it is possible to separate repeated regions that previously were considered impossible to resolve, and finishing tasks that previously took days or weeks can be resolved within hours or even minutes.
引用
收藏
页数:11
相关论文
共 18 条
  • [1] ReAligner: A program for refining DNA sequence multi-alignments
    Anson, EL
    Myers, EW
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 1997, 4 (03) : 369 - 383
  • [2] MECHANISM OF PEPTIDE BOND FORMATION IN POLYPEPTIDE SYNTHESIS
    ARLINGHAUS, R
    SHAEFFER, J
    SCHWEET, R
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1964, 51 (06) : 1291 - &
  • [3] A GENE FAMILY ENCODING HETEROGENEOUS HISTONE H1 PROTEINS IN TRYPANOSOMA-CRUZI
    ASLUND, L
    CARLSSON, L
    HENRIKSSON, J
    RYDAKER, M
    TORO, GC
    GALANTI, N
    PETTERSSON, U
    [J]. MOLECULAR AND BIOCHEMICAL PARASITOLOGY, 1994, 65 (02) : 317 - 330
  • [4] THE MAJOR CYSTEINE PROTEINASE (CRUZIPAIN) FROM TRYPANOSOMA-CRUZI IS ENCODED BY MULTIPLE POLYMORPHIC TANDEMLY ORGANIZED GENES LOCATED ON DIFFERENT CHROMOSOMES
    CAMPETELLA, O
    HENRIKSSON, J
    ASLUND, L
    FRASCH, ACC
    PETTERSSON, U
    CAZZULO, JJ
    [J]. MOLECULAR AND BIOCHEMICAL PARASITOLOGY, 1992, 50 (02) : 225 - 234
  • [5] EDWARDS A, 1990, COMPANION METHODS EN, V3, P41
  • [6] An assessment of the sequence gaps: Unfinished business in a finished human genome
    Eichler, EE
    Clark, RA
    She, XW
    [J]. NATURE REVIEWS GENETICS, 2004, 5 (05) : 345 - 354
  • [7] The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas disease
    El-Sayed, NM
    Myler, PJ
    Bartholomeu, DC
    Nilsson, D
    Aggarwal, G
    Tran, AN
    Ghedin, E
    Worthey, EA
    Delcher, AL
    Blandin, G
    Westenberger, SJ
    Caler, E
    Cerqueira, GC
    Branche, C
    Haas, B
    Anupama, A
    Arner, E
    Åslund, L
    Attipoe, P
    Bontempi, E
    Bringaud, F
    Burton, P
    Cadag, E
    Campbell, DA
    Carrington, M
    Crabtree, J
    Darban, H
    da Silveira, JF
    de Jong, P
    Edwards, K
    Englund, PT
    Fazelina, G
    Feldblyum, T
    Ferella, M
    Frasch, AC
    Gull, K
    Horn, D
    Hou, LH
    Huang, YT
    Kindlund, E
    Ktingbeil, M
    Kluge, S
    Koo, H
    Lacerda, D
    Levin, MJ
    Lorenzi, H
    Louie, T
    Machado, CR
    McCulloch, R
    McKenna, A
    [J]. SCIENCE, 2005, 309 (5733) : 409 - 415
  • [8] CAAT-Box, contigs-Assembly and Annotation Tool-Box for genome sequencing projects
    Frangeul, L
    Glaser, P
    Rusniok, C
    Buchrieser, C
    Duchaud, E
    Dehoux, P
    Kunst, F
    [J]. BIOINFORMATICS, 2004, 20 (05) : 790 - U758
  • [9] Consed: A graphical tool for sequence finishing
    Gordon, D
    Abajian, C
    Green, P
    [J]. GENOME RESEARCH, 1998, 8 (03) : 195 - 202
  • [10] Automated finishing with Autofinish
    Gordon, D
    Desmarais, C
    Green, P
    [J]. GENOME RESEARCH, 2001, 11 (04) : 614 - 625