NCBI Reference Sequence Project: update and current status

被引:136
作者
Pruitt, KD [1 ]
Tatusova, T [1 ]
Maglott, DR [1 ]
机构
[1] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20894 USA
关键词
D O I
10.1093/nar/gkg111
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The goal of the NCBI Reference Sequence (RefSeq) project is to provide the single best non-redundant and comprehensive collection of naturally occurring biological molecules, representing the central dogma. Nucleotide and protein sequences are explicitly linked on a residue-by-residue basis in this collection. Ideally all molecule types will be available for each well-studied organism, but the initial database collection pragmatically includes only those molecules and organisms that are most readily identified. Thus different amounts of information are available for different organisms at any given time. Furthermore, for some organisms additional intermediate records are provided when the genome sequence is not yet finished. The collection is supplied by NCBI through three distinct pipelines in addition to collaborations with community groups. The collection is curated on an ongoing basis. Additional information about the NCBI RefSeq project is available at http://www.ncbi.nih.gov/RefSeq/.
引用
收藏
页码:34 / 37
页数:4
相关论文
共 9 条
[1]   The Mouse Genome Database (MGD): expanding genetic and genomic resources for the laboratory mouse [J].
Blake, JA ;
Eppig, JT ;
Richardson, JE ;
Davisson, MT .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :108-111
[2]  
Gelbart WM, 1999, NUCLEIC ACIDS RES, V27, P85, DOI 10.1093/nar/27.1.85
[3]  
Hamosh A, 2000, HUM MUTAT, V15, P57, DOI 10.1002/(SICI)1098-1004(200001)15:1<57::AID-HUMU12>3.0.CO
[4]  
2-G
[5]   Introducing RefSeq and LocusLink: curated human genome resources at the NCBI [J].
Pruitt, KD ;
Katz, KS ;
Sicotte, H ;
Maglott, DR .
TRENDS IN GENETICS, 2000, 16 (01) :44-47
[6]   Complete genomes in WWW Entrez: data representation and analysis [J].
Tatusova, TA ;
Karsch-Mizrachi, I ;
Ostell, JA .
BIOINFORMATICS, 1999, 15 (7-8) :536-543
[7]   Rat Genome Database (RGD): mapping disease onto the genome [J].
Twigger, S ;
Lu, J ;
Shimoyama, M ;
Chen, D ;
Pasko, D ;
Long, H ;
Ginster, J ;
Chen, CF ;
Nigam, R ;
Kwitek, A ;
Eppig, J ;
Maltais, L ;
Maglott, D ;
Schuler, G ;
Jacob, H ;
Tonellato, PJ .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :125-128
[8]  
Westerfield M, 1999, METHOD CELL BIOL, V60, P339
[9]   Guidelines for human gene nomenclature (1997) [J].
White, JA ;
McAlpine, PJ ;
Antonarakis, S ;
Cann, H ;
Eppig, JT ;
Frazer, K ;
Frezal, J ;
Lancet, D ;
Nahmias, J ;
Pearson, P ;
Peters, J ;
Scott, A ;
Scott, H ;
Spurr, N ;
Talbot, C ;
Povey, S .
GENOMICS, 1997, 45 (02) :468-471