The TIGR Plant Repeat Databases: a collective resource for the identification of repetitive sequences in plants

被引:228
作者
Ouyang, S [1 ]
Buell, CR [1 ]
机构
[1] Inst Genome Res, Rockville, MD 20850 USA
关键词
D O I
10.1093/nar/gkh099
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In a number of higher plants, a substantial portion of the genome is composed of repetitive sequences that can hinder genome annotation and sequencing efforts. To better understand the nature of repetitive sequences in plants and provide a resource for identifying such sequences, we constructed databases of repetitive sequences for 12 plant genera: Arabidopsis, Brassica, Glycine, Hordeum, Lotus, Lycopersicon, Medicago, Oryza, Solanum, Sorghum, Triticum and Zea (www.tigr.org/tdb/e2kl /plant. repeats/index.shtml). The repetitive sequences within each database have been coded into super-classes, classes and sub-classes based on sequence and structure similarity. These databases are available for sequence similarity searches as well as downloadable files either as entire databases or subsets of each database. To further the utility for comparative studies and to provide a resource for searching for repetitive sequences in other genera within these families, repetitive sequences have been combined into four databases to represent the Brassicaceae, Fabaceae, Gramineae and Solanaceae families. Collectively, these databases provide a resource for the identification, classification and analysis of repetitive sequences in plants.
引用
收藏
页码:D360 / D363
页数:4
相关论文
共 17 条
[1]  
Arumuganathan K, 1991, PLANT MOL BIOL REP, V9, P208, DOI [DOI 10.1007/BF02672069, 10.1007/BF02672069]
[2]   The maize genome sequencing project [J].
Chandler, VL ;
Brendel, V .
PLANT PHYSIOLOGY, 2002, 130 (04) :1594-1597
[3]   Genetic definition and sequence analysis of Arabidopsis centromeres [J].
Copenhaver, GP ;
Nickel, K ;
Kuromori, T ;
Benito, MI ;
Kaul, S ;
Lin, XY ;
Bevan, M ;
Murphy, G ;
Harris, B ;
Parnell, LD ;
McCombie, WR ;
Martienssen, RA ;
Marra, M ;
Preuss, D .
SCIENCE, 1999, 286 (5449) :2468-2474
[4]  
DESHPANDE VG, 1988, H-S Z PHYSIOL CHEM, V361, P1223
[5]   Sequence and analysis of rice chromosome 4 [J].
Feng, Q ;
Zhang, YJ ;
Hao, P ;
Wang, SY ;
Fu, G ;
Huang, YC ;
Li, Y ;
Zhu, JJ ;
Liu, YL ;
Hu, X ;
Jia, PX ;
Zhang, Y ;
Zhao, Q ;
Ying, K ;
Yu, SL ;
Tang, YS ;
Weng, QJ ;
Zhang, L ;
Lu, Y ;
Mu, J ;
Lu, YQ ;
Zhang, LS ;
Yu, Z ;
Fan, DL ;
Liu, XH ;
Lu, TT ;
Li, C ;
Wu, YR ;
Sun, TG ;
Lei, HY ;
Li, T ;
Hu, H ;
Guan, JP ;
Wu, M ;
Zhang, RQ ;
Zhou, B ;
Chen, ZH ;
Chen, L ;
Jin, ZQ ;
Wang, R ;
Yin, HF ;
Cai, Z ;
Ren, SX ;
Lv, G ;
Gu, WY ;
Zhu, GF ;
Tu, YF ;
Jia, J ;
Zhang, Y ;
Chen, J .
NATURE, 2002, 420 (6913) :316-320
[6]   Plant transposable elements: Where genetics meets genomics [J].
Feschotte, C ;
Jiang, N ;
Wessler, SR .
NATURE REVIEWS GENETICS, 2002, 3 (05) :329-341
[7]   GENOME SIZE AND PROPORTION OF REPEATED NUCLEOTIDE-SEQUENCE DNA IN PLANTS [J].
FLAVELL, RB ;
BENNETT, MD ;
SMITH, JB ;
SMITH, DB .
BIOCHEMICAL GENETICS, 1974, 12 (04) :257-269
[8]   Analysis of the genome sequence of the flowering plant Arabidopsis thaliana [J].
Kaul, S ;
Koo, HL ;
Jenkins, J ;
Rizzo, M ;
Rooney, T ;
Tallon, LJ ;
Feldblyum, T ;
Nierman, W ;
Benito, MI ;
Lin, XY ;
Town, CD ;
Venter, JC ;
Fraser, CM ;
Tabata, S ;
Nakamura, Y ;
Kaneko, T ;
Sato, S ;
Asamizu, E ;
Kato, T ;
Kotani, H ;
Sasamoto, S ;
Ecker, JR ;
Theologis, A ;
Federspiel, NA ;
Palm, CJ ;
Osborne, BI ;
Shinn, P ;
Conway, AB ;
Vysotskaia, VS ;
Dewar, K ;
Conn, L ;
Lenz, CA ;
Kim, CJ ;
Hansen, NF ;
Liu, SX ;
Buehler, E ;
Altafi, H ;
Sakano, H ;
Dunn, P ;
Lam, B ;
Pham, PK ;
Chao, Q ;
Nguyen, M ;
Yu, GX ;
Chen, HM ;
Southwick, A ;
Lee, JM ;
Miranda, M ;
Toriumi, MJ ;
Davis, RW .
NATURE, 2000, 408 (6814) :796-815
[9]   MOLECULAR MAPPING OF RICE CHROMOSOMES [J].
MCCOUCH, SR ;
KOCHERT, G ;
YU, ZH ;
WANG, ZY ;
KHUSH, GS ;
COFFMAN, WR ;
TANKSLEY, SD .
THEORETICAL AND APPLIED GENETICS, 1988, 76 (06) :815-829
[10]  
MCKNIGHT TD, 1997, BIOCHEMISTRY-MOSCOW, V62, P1432