Expansion of the Protein Repertoire in Newly Explored Environments: Human Gut Microbiome Specific Protein Families

被引:46
作者
Ellrott, Kyle [1 ]
Jaroszewski, Lukasz [2 ]
Li, Weizhong [3 ]
Wooley, John C. [1 ,2 ,3 ]
Godzik, Adam [1 ,2 ,3 ,4 ]
机构
[1] Univ Calif San Diego, Joint Ctr Struct Genom, La Jolla, CA 92093 USA
[2] Burnham Inst Med Res, Joint Ctr Struct Genom, La Jolla, CA USA
[3] Univ Calif San Diego, Calif Inst Telecommun & Informat Technol, La Jolla, CA 92093 USA
[4] Burnham Inst Med Res, Joint Ctr Mol Modeling, La Jolla, CA USA
关键词
DATABASE; METAGENOMICS; SETS;
D O I
10.1371/journal.pcbi.1000798
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The microbes that inhabit particular environments must be able to perform molecular functions that provide them with a competitive advantage to thrive in those environments. As most molecular functions are performed by proteins and are conserved between related proteins, we can expect that organisms successful in a given environmental niche would contain protein families that are specific for functions that are important in that environment. For instance, the human gut is rich in polysaccharides from the diet or secreted by the host, and is dominated by Bacteroides, whose genomes contain highly expanded repertoire of protein families involved in carbohydrate metabolism. To identify other protein families that are specific to this environment, we investigated the distribution of protein families in the currently available human gut genomic and metagenomic data. Using an automated procedure, we identified a group of protein families strongly overrepresented in the human gut. These not only include many families described previously but also, interestingly, a large group of previously unrecognized protein families, which suggests that we still have much to discover about this environment. The identification and analysis of these families could provide us with new information about an environment critical to our health and well being.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 27 条
[1]  
Altschul SF, 1998, PROTEINS, V32, P88, DOI 10.1002/(SICI)1097-0134(19980701)32:1<88::AID-PROT10>3.0.CO
[2]  
2-J
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[5]   Extensive DNA inversions in the B. fragilis genome control variable gene expression [J].
Cerdeño-Tárraga, AM ;
Patrick, S ;
Crossman, LC ;
Blakely, G ;
Abratt, V ;
Lennard, N ;
Poxton, I ;
Duerden, B ;
Harris, B ;
Quail, MA ;
Barron, A ;
Clark, L ;
Corton, C ;
Doggett, J ;
Holden, MTG ;
Larke, N ;
Line, A ;
Lord, A ;
Norbertczak, H ;
Ormond, D ;
Price, C ;
Rabbinowitsch, E ;
Woodward, J ;
Barrell, B ;
Parkhill, J .
SCIENCE, 2005, 307 (5714) :1463-1465
[6]   Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB [J].
DeSantis, T. Z. ;
Hugenholtz, P. ;
Larsen, N. ;
Rojas, M. ;
Brodie, E. L. ;
Keller, K. ;
Huber, T. ;
Dalevi, D. ;
Hu, P. ;
Andersen, G. L. .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2006, 72 (07) :5069-5072
[7]   PSI-2: Structural Genomics to Cover Protein Domain Family Space [J].
Dessailly, Benoit H. ;
Nair, Rajesh ;
Jaroszewski, Lukasz ;
Fajardo, J. Eduardo ;
Kouranov, Andrei ;
Lee, David ;
Fiser, Andras ;
Godzik, Adam ;
Rost, Burkhard ;
Orengo, Christine .
STRUCTURE, 2009, 17 (06) :869-881
[8]   Diversity of the human intestinal microbial flora [J].
Eckburg, PB ;
Bik, EM ;
Bernstein, CN ;
Purdom, E ;
Dethlefsen, L ;
Sargent, M ;
Gill, SR ;
Nelson, KE ;
Relman, DA .
SCIENCE, 2005, 308 (5728) :1635-1638
[9]   Hidden Markov models [J].
Eddy, SR .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) :361-365
[10]   An efficient algorithm for large-scale detection of protein families [J].
Enright, AJ ;
Van Dongen, S ;
Ouzounis, CA .
NUCLEIC ACIDS RESEARCH, 2002, 30 (07) :1575-1584