Identification of novel families of membrane proteins from the model plant Arabidopsis thaliana

被引:86
作者
Ward, JM [1 ]
机构
[1] Univ Minnesota, Dept Plant Biol, St Paul, MN USA
关键词
D O I
10.1093/bioinformatics/17.6.560
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The completion of the Arabidopsis genome offers the first opportunity to analyze all of the membrane protein sequences of a plant. The majority of integral membrane proteins including transporters, channels, and pumps contain hydrophobic alpha -helices and can be selected based on TransMembrane Spanning (TMS) domain prediction. By clustering the predicted membrane proteins based on sequence, it is possible to sort the membrane proteins into families of known function, based on experimental evidence or homology, or unknown function. This provides a way to identify target sequences for future functional analysis. Results: An automated approach was used to select potential membrane protein sequences from the set of all predicted proteins and cluster the sequences into related families. The recently completed sequence of Arabidopsis thaliana, a model plant, was analyzed. Of the 25 470 predicted protein sequences 4589 (18%) were identified as containing two or more membrane spanning domains. The membrane protein sequences clustered into 628 distinct families containing 3208 sequences. Of these, 211 families (1764 sequences) either contained proteins of known function or showed homology to proteins of known function in other species. However, 417 families (1444 sequences) contained only sequences with no known function and no homology to proteins of known function. In addition, 1381 sequences did not cluster with any family and no function could be assigned to 1337 of these.
引用
收藏
页码:560 / 563
页数:4
相关论文
共 14 条
[1]   An overview of membrane transport proteins in Saccharomyces cerevisiae [J].
Andre, B .
YEAST, 1995, 11 (16) :1575-1611
[2]   GeneRAGE: a robust algorithm for sequence clustering and domain detection [J].
Enright, AJ ;
Ouzounis, CA .
BIOINFORMATICS, 2000, 16 (05) :451-457
[3]   AMINO-ACID SUBSTITUTION MATRICES FROM PROTEIN BLOCKS [J].
HENIKOFF, S ;
HENIKOFF, JG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (22) :10915-10919
[4]  
HIRSCHBERG DS, 1977, J ACM, V24, P644
[5]   A SIMPLE METHOD FOR DISPLAYING THE HYDROPATHIC CHARACTER OF A PROTEIN [J].
KYTE, J ;
DOOLITTLE, RF .
JOURNAL OF MOLECULAR BIOLOGY, 1982, 157 (01) :105-132
[6]   OPTIMAL ALIGNMENTS IN LINEAR-SPACE [J].
MYERS, EW ;
MILLER, W .
COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1988, 4 (01) :11-17
[7]   A GENERAL METHOD APPLICABLE TO SEARCH FOR SIMILARITIES IN AMINO ACID SEQUENCE OF 2 PROTEINS [J].
NEEDLEMAN, SB ;
WUNSCH, CD .
JOURNAL OF MOLECULAR BIOLOGY, 1970, 48 (03) :443-+
[8]   Classification of all putative permeases and other membrane plurispanners of the major facilitator superfamily encoded by the complete genome of Saccharomyces cerevisiae [J].
Nelissen, B ;
DeWachter, R ;
Goffeau, A .
FEMS MICROBIOLOGY REVIEWS, 1997, 21 (02) :113-134
[9]   Consensus predictions of membrane protein topology [J].
Nilsson, J ;
Persson, B ;
von Heijne, G .
FEBS LETTERS, 2000, 486 (03) :267-269
[10]   Comparative genomics of plant chromosomes [J].
Paterson, AH ;
Bowers, JE ;
Burow, MD ;
Draye, X ;
Elsik, CG ;
Jiang, CX ;
Katsar, CS ;
Lan, TH ;
Lin, YR ;
Ming, RG ;
Wright, RJ .
PLANT CELL, 2000, 12 (09) :1523-1539