Assembly of the working draft of the human genome with GigAssembler

被引:85
作者
Kent, WJ [1 ]
Haussler, D
机构
[1] Univ Calif Santa Cruz, Dept Biol, Santa Cruz, CA 95064 USA
[2] Univ Calif Santa Cruz, Howard Hughes Med Inst, Dept Comp Sci, Santa Cruz, CA 95064 USA
关键词
D O I
10.1101/gr.183201
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The data for the public working draft of the human genome contains roughly 400,000 initial sequence contigs in similar to 30,000 large insert clones. Many of these initial sequence contigs overlap. A program, GigAssembler, was built to merge them and to order and orient the resulting larger sequence contigs based on mRNA, paired plasmid ends, EST, BAC end pairs, and other information. This program produced the first publicly available assembly of the human genome, a working draft containing roughly 2.7 billion base pairs and covering an estimated 88% of the genome that has been used for several recent studies of the genome. Here we describe the algorithm used by GigAssembler.
引用
收藏
页码:1541 / 1548
页数:8
相关论文
共 30 条
[1]   The physical maps for sequencing human chromosomes 1, 6, 9, 10, 13, 20 and X [J].
Bentley, DR ;
Deloukas, P ;
Dunham, A ;
French, L ;
Gregory, SG ;
Humphray, SJ ;
Mungall, AJ ;
Ross, MT ;
Carter, NP ;
Dunham, I ;
Scott, CE ;
Ashcroft, KJ ;
Atkinson, AL ;
Aubin, K ;
Beare, DM ;
Bethel, G ;
Brady, N ;
Brook, JC ;
Burford, DC ;
Burrill, WD ;
Burrows, C ;
Butler, AP ;
Carder, C ;
Catanese, JJ ;
Clee, CM ;
Clegg, SM ;
Cobley, V ;
Coffey, AJ ;
Cole, CG ;
Collins, JE ;
Conquer, JS ;
Cooper, RA ;
Culley, KM ;
Dawson, E ;
Dearden, FL ;
Durbin, RM ;
de Jong, PJ ;
Dhami, PD ;
Earthrowl, ME ;
Edwards, CA ;
Evans, RS ;
Gillson, CJ ;
Ghori, J ;
Green, L ;
Gwilliam, R ;
Halls, KS ;
Hammond, S ;
Harper, GL ;
Heathcott, RW ;
Holden, JL .
NATURE, 2001, 409 (6822) :942-943
[2]   A genomic perspective on membrane compartment organization [J].
Bock, JB ;
Matern, HT ;
Peden, AA ;
Scheller, RH .
NATURE, 2001, 409 (6822) :839-841
[3]   A new DNA sequence assembly program [J].
Bonfield, JK ;
Smith, KF ;
Staden, R .
NUCLEIC ACIDS RESEARCH, 1995, 23 (24) :4992-4999
[4]   Keeping time with the human genome [J].
Clayton, JD ;
Kyriacou, CP ;
Reppert, SM .
NATURE, 2001, 409 (6822) :829-831
[5]  
Cormen T. H., 1990, INTRO ALGORITHMS
[6]   The DNA sequence of human chromosome 22 [J].
Dunham, I ;
Shimizu, N ;
Roe, BA ;
Chissoe, S ;
Dunham, I ;
Hunt, AR ;
Collins, JE ;
Bruskiewich, R ;
Beare, DM ;
Clamp, M ;
Smink, LJ ;
Ainscough, R ;
Almeida, JP ;
Babbage, A ;
Bagguley, C ;
Balley, J ;
Barlow, K ;
Bates, KN ;
Beasley, O ;
Bird, CP ;
Blakey, S ;
Bridgeman, AM ;
Buck, D ;
Burgess, J ;
Burrill, WD ;
Burton, J ;
Carder, C ;
Carter, NP ;
Chen, Y ;
Clark, G ;
Clegg, SM ;
Cobley, V ;
Cole, CG ;
Collier, RE ;
Connor, RE ;
Conroy, D ;
Corby, N ;
Coville, GJ ;
Cox, AV ;
Davis, J ;
Dawson, E ;
Dhami, PD ;
Dockree, C ;
Dodsworth, SJ ;
Durbin, RM ;
Ellington, A ;
Evans, KL ;
Fey, JM ;
Fleming, K ;
French, L .
NATURE, 1999, 402 (6761) :489-495
[7]   A genomic view of immunology [J].
Fahrer, AM ;
Bazan, JF ;
Papathanasiou, P ;
Nelms, KA ;
Goodnow, CC .
NATURE, 2001, 409 (6822) :836-838
[8]   Cancer and genomics [J].
Futreal, PA ;
Kasprzyk, A ;
Birney, E ;
Mullikin, JC ;
Wooster, R ;
Stratton, MR .
NATURE, 2001, 409 (6822) :850-852
[9]   The DNA sequence of human chromosome 21 [J].
Hattori, M ;
Fujiyama, A ;
Taylor, TD ;
Watanabe, H ;
Yada, T ;
Park, HS ;
Toyoda, A ;
Ishii, K ;
Totoki, Y ;
Choi, DK ;
Soeda, E ;
Ohki, M ;
Takagi, T ;
Sakaki, Y ;
Taudien, S ;
Blechschmidt, K ;
Polley, A ;
Menzel, U ;
Delabar, J ;
Kumpf, K ;
Lehmann, R ;
Patterson, D ;
Reichwald, K ;
Rump, A ;
Schillhabel, M ;
Schudy, A ;
Zimmermann, W ;
Rosenthal, A ;
Kudoh, J ;
Shibuya, K ;
Kawasaki, K ;
Asakawa, S ;
Shintani, A ;
Sasaki, T ;
Nagamine, K ;
Mitsuyama, S ;
Antonarakis, SE ;
Minoshima, S ;
Shimizu, N ;
Nordsiek, G ;
Hornischer, K ;
Brandt, P ;
Scharfe, M ;
Schön, O ;
Desario, A ;
Reichelt, J ;
Kauer, G ;
Blöcker, H ;
Ramser, J ;
Beck, A .
NATURE, 2000, 405 (6784) :311-319
[10]   CAP3: A DNA sequence assembly program [J].
Huang, XQ ;
Madan, A .
GENOME RESEARCH, 1999, 9 (09) :868-877