Assessing the Drosophila melanogaster and Anopheles gambiae genome annotations using genome-wide sequence comparisons

被引:7
作者
Jaillon, O
Dossat, C
Eckenberg, R
Eiglmeier, K
Segurens, A
Aury, JM
Roth, CW
Scarpelli, C
Brey, PT
Weissenbach, J
Wincker, P [1 ]
机构
[1] Ctr Natl Sequencage, Genoscope, F-91057 Evry, France
[2] CNRS UMR 8030, F-91057 Evry, France
[3] Inst Pasteur, Unite Biochim & Biol Mol Insectes, F-75724 Paris 15, France
关键词
D O I
10.1101/gr.922503
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We performed genome-wide sequence comparisons at the protein coding level between the genome sequences of Drosophila melanogaster and Anopheles gambiae. Such comparisons detect evolutionarily conserved regions (ecores) that can be used for a qualitative and quantitative evaluation of the available annotations of both genomes. They also provide novel candidate features for annotation. The percentage of ecores mapping outside annotations in the A. gambiae genome is about fourfold higher than in A melanogaster. The A. gambiae genome assembly also contains a high proportion of duplicated ecores, possibly resulting from artefactual sequence duplications in the genome assembly. The occurrence of 4063 ecores in the A melanogaster genome outside annotations suggests that some genes are not yet or only partially annotated. The present work illustrates the power of comparative genomics approaches towards an exhaustive and accurate establishment of gene models and gene catalogues in insect genomes.
引用
收藏
页码:1595 / 1599
页数:5
相关论文
共 16 条
  • [1] The genome sequence of Drosophila melanogaster
    Adams, MD
    Celniker, SE
    Holt, RA
    Evans, CA
    Gocayne, JD
    Amanatides, PG
    Scherer, SE
    Li, PW
    Hoskins, RA
    Galle, RF
    George, RA
    Lewis, SE
    Richards, S
    Ashburner, M
    Henderson, SN
    Sutton, GG
    Wortman, JR
    Yandell, MD
    Zhang, Q
    Chen, LX
    Brandon, RC
    Rogers, YHC
    Blazej, RG
    Champe, M
    Pfeiffer, BD
    Wan, KH
    Doyle, C
    Baxter, EG
    Helt, G
    Nelson, CR
    Miklos, GLG
    Abril, JF
    Agbayani, A
    An, HJ
    Andrews-Pfannkoch, C
    Baldwin, D
    Ballew, RM
    Basu, A
    Baxendale, J
    Bayraktaroglu, L
    Beasley, EM
    Beeson, KY
    Benos, PV
    Berman, BP
    Bhandari, D
    Bolshakov, S
    Borkova, D
    Botchan, MR
    Bouck, J
    Brokstein, P
    [J]. SCIENCE, 2000, 287 (5461) : 2185 - 2195
  • [2] ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
  • [3] Ashburner M, 1999, GENETICS, V153, P179
  • [4] From first base:: The sequence of the tip of the X chromosome of Drosophila melanogaster, a comparison of two sequencing strategies
    Benos, PV
    Gatt, MK
    Murphy, L
    Harris, D
    Barrell, B
    Ferraz, C
    Vidal, S
    Brun, C
    Demaille, J
    Cadieu, E
    Dreano, S
    Gloux, S
    Lelaure, V
    Mottier, S
    Galibert, F
    Borkova, D
    Miñana, B
    Kafatos, FC
    Bolshakov, S
    Sidén-Kiamos, I
    Papagiannakis, G
    Spanos, L
    Louis, C
    Madueño, E
    de Pablos, B
    Modolell, J
    Peter, A
    Schöttler, P
    Werner, M
    Mourkioti, F
    Beinert, N
    Dowe, G
    Schäfer, U
    Jäckle, H
    Bucheton, A
    Callister, D
    Campbell, L
    Henderson, NS
    McMillan, PJ
    Salles, C
    Tait, E
    Valenti, P
    Saunders, RDC
    Billaud, A
    Pachter, L
    Glover, DM
    Ashburner, M
    [J]. GENOME RESEARCH, 2001, 11 (05) : 710 - 730
  • [5] CELNIKER SE, 2002, GENOME BIOL, V3, P7901
  • [6] A computer program for aligning a cDNA sequence with a genomic DNA sequence
    Florea, L
    Hartzell, G
    Zhang, Z
    Rubin, GM
    Miller, W
    [J]. GENOME RESEARCH, 1998, 8 (09) : 967 - 974
  • [7] Glemet E, 1997, COMPUT APPL BIOSCI, V13, P137
  • [8] Homology-based annotation yields 1,042 new candidate genes in the Drosophila melanogaster genome
    Gopal, S
    Schroeder, M
    Pieper, U
    Sczyrba, A
    Aytekin-Kurban, G
    Bekiranov, S
    Fajardo, JE
    Eswar, N
    Sanchez, R
    Sali, A
    Gaasterland, T
    [J]. NATURE GENETICS, 2001, 27 (03) : 337 - 340
  • [9] The genome sequence of the malaria mosquito Anopheles gambiae
    Holt, RA
    Subramanian, GM
    Halpern, A
    Sutton, GG
    Charlab, R
    Nusskern, DR
    Wincker, P
    Clark, AG
    Ribeiro, JMC
    Wides, R
    Salzberg, SL
    Loftus, B
    Yandell, M
    Majoros, WH
    Rusch, DB
    Lai, ZW
    Kraft, CL
    Abril, JF
    Anthouard, V
    Arensburger, P
    Atkinson, PW
    Baden, H
    de Berardinis, V
    Baldwin, D
    Benes, V
    Biedler, J
    Blass, C
    Bolanos, R
    Boscus, D
    Barnstead, M
    Cai, S
    Center, A
    Chatuverdi, K
    Christophides, GK
    Chrystal, MA
    Clamp, M
    Cravchik, A
    Curwen, V
    Dana, A
    Delcher, A
    Dew, I
    Evans, CA
    Flanigan, M
    Grundschober-Freimoser, A
    Friedli, L
    Gu, ZP
    Guan, P
    Guigo, R
    Hillenmeyer, ME
    Hladun, SL
    [J]. SCIENCE, 2002, 298 (5591) : 129 - +
  • [10] Kent WJ, 2002, GENOME RES, V12, P656, DOI [10.1101/gr.229202, 10.1101/gr.229202. Article published online before March 2002]