共 59 条
Inferring function using patterns of native disorder in proteins
被引:101
作者:

Lobley, Anna
论文数: 0 引用数: 0
h-index: 0
机构: UCL, Dept Comp Sci, Bioinformat Unit, London, England

Swindells, Mark B.
论文数: 0 引用数: 0
h-index: 0
机构: UCL, Dept Comp Sci, Bioinformat Unit, London, England

Orengo, Christine A.
论文数: 0 引用数: 0
h-index: 0
机构: UCL, Dept Comp Sci, Bioinformat Unit, London, England

Jones, David T.
论文数: 0 引用数: 0
h-index: 0
机构:
UCL, Dept Comp Sci, Bioinformat Unit, London, England UCL, Dept Comp Sci, Bioinformat Unit, London, England
机构:
[1] UCL, Dept Comp Sci, Bioinformat Unit, London, England
[2] Inpharm, London, England
[3] UCL, Dept Biochem, Biocomp Grp, London, England
关键词:
D O I:
10.1371/journal.pcbi.0030162
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
Natively unstructured regions are a common feature of eukaryotic proteomes. Between 30% and 60% of proteins are predicted to contain long stretches of disordered residues, and not only have many of these regions been confirmed experimentally, but they have also been found to be essential for protein function. In this study, we directly address the potential contribution of protein disorder in predicting protein function using standard Gene Ontology ( GO) categories. Initially we analyse the occurrence of protein disorder in the human proteome and report ontology categories that are enriched in disordered proteins. Pattern analysis of the distributions of disordered regions in human sequences demonstrated that the functions of intrinsically disordered proteins are both length- and positiondependent. These dependencies were then encoded in feature vectors to quantify the contribution of disorder in human protein function prediction using Support Vector Machine classifiers. The prediction accuracies of 26 GO categories relating to signalling and molecular recognition are improved using the disorder features. The most significant improvements were observed for kinase, phosphorylation, growth factor, and helicase categories. Furthermore, we provide predicted GO term assignments using these classifiers for a set of unannotated and orphan human proteins. In this study, the importance of capturing protein disorder information and its value in function prediction is demonstrated. The GO category classifiers generated can be used to provide more reliable predictions and further insights into the behaviour of orphan and unannotated proteins.
引用
收藏
页码:1567 / 1579
页数:13
相关论文
共 59 条
[1]
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
[J].
Altschul, SF
;
Madden, TL
;
Schaffer, AA
;
Zhang, JH
;
Zhang, Z
;
Miller, W
;
Lipman, DJ
.
NUCLEIC ACIDS RESEARCH,
1997, 25 (17)
:3389-3402

Altschul, SF
论文数: 0 引用数: 0
h-index: 0
机构: NHGRI, LAB GENET DIS RES, NIH, BETHESDA, MD 20892 USA

Madden, TL
论文数: 0 引用数: 0
h-index: 0
机构: NHGRI, LAB GENET DIS RES, NIH, BETHESDA, MD 20892 USA

Schaffer, AA
论文数: 0 引用数: 0
h-index: 0
机构: NHGRI, LAB GENET DIS RES, NIH, BETHESDA, MD 20892 USA

Zhang, JH
论文数: 0 引用数: 0
h-index: 0
机构: NHGRI, LAB GENET DIS RES, NIH, BETHESDA, MD 20892 USA

Zhang, Z
论文数: 0 引用数: 0
h-index: 0
机构: NHGRI, LAB GENET DIS RES, NIH, BETHESDA, MD 20892 USA

Miller, W
论文数: 0 引用数: 0
h-index: 0
机构: NHGRI, LAB GENET DIS RES, NIH, BETHESDA, MD 20892 USA

Lipman, DJ
论文数: 0 引用数: 0
h-index: 0
机构: NHGRI, LAB GENET DIS RES, NIH, BETHESDA, MD 20892 USA
[2]
Gene Ontology: tool for the unification of biology
[J].
Ashburner, M
;
Ball, CA
;
Blake, JA
;
Botstein, D
;
Butler, H
;
Cherry, JM
;
Davis, AP
;
Dolinski, K
;
Dwight, SS
;
Eppig, JT
;
Harris, MA
;
Hill, DP
;
Issel-Tarver, L
;
Kasarskis, A
;
Lewis, S
;
Matese, JC
;
Richardson, JE
;
Ringwald, M
;
Rubin, GM
;
Sherlock, G
.
NATURE GENETICS,
2000, 25 (01)
:25-29

Ashburner, M
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Ball, CA
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Blake, JA
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Botstein, D
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Butler, H
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Cherry, JM
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Davis, AP
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Dolinski, K
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Dwight, SS
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Eppig, JT
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Harris, MA
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Hill, DP
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Issel-Tarver, L
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Kasarskis, A
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Lewis, S
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Matese, JC
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Richardson, JE
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Ringwald, M
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Rubin, GM
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

Sherlock, G
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA
[3]
Comparative genomics and disorder prediction identify biologically relevant SH3 protein interactions
[J].
Beltrao, P
;
Serrano, L
.
PLOS COMPUTATIONAL BIOLOGY,
2005, 1 (03)
:202-211

Beltrao, P
论文数: 0 引用数: 0
h-index: 0
机构:
EMBL Struct & Computat Biol, Heidelberg, Germany EMBL Struct & Computat Biol, Heidelberg, Germany

Serrano, L
论文数: 0 引用数: 0
h-index: 0
机构:
EMBL Struct & Computat Biol, Heidelberg, Germany EMBL Struct & Computat Biol, Heidelberg, Germany
[4]
Improved prediction of signal peptides: SignalP 3.0
[J].
Bendtsen, JD
;
Nielsen, H
;
von Heijne, G
;
Brunak, S
.
JOURNAL OF MOLECULAR BIOLOGY,
2004, 340 (04)
:783-795

Bendtsen, JD
论文数: 0 引用数: 0
h-index: 0
机构: Tech Univ Denmark, Bioctr, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark

Nielsen, H
论文数: 0 引用数: 0
h-index: 0
机构: Tech Univ Denmark, Bioctr, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark

von Heijne, G
论文数: 0 引用数: 0
h-index: 0
机构: Tech Univ Denmark, Bioctr, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark

Brunak, S
论文数: 0 引用数: 0
h-index: 0
机构: Tech Univ Denmark, Bioctr, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark
[5]
Structural principles for the multispecificity of small GTP-binding proteins
[J].
Biou, V
;
Cherfils, J
.
BIOCHEMISTRY,
2004, 43 (22)
:6833-6840

Biou, V
论文数: 0 引用数: 0
h-index: 0
机构:
CNRS, Lab Enzymol & Biochim Struct, F-91198 Gif Sur Yvette, France CNRS, Lab Enzymol & Biochim Struct, F-91198 Gif Sur Yvette, France

Cherfils, J
论文数: 0 引用数: 0
h-index: 0
机构:
CNRS, Lab Enzymol & Biochim Struct, F-91198 Gif Sur Yvette, France CNRS, Lab Enzymol & Biochim Struct, F-91198 Gif Sur Yvette, France
[6]
Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence
[J].
Blom, N
;
Sicheritz-Pontén, T
;
Gupta, R
;
Gammeltoft, S
;
Brunak, S
.
PROTEOMICS,
2004, 4 (06)
:1633-1649

Blom, N
论文数: 0 引用数: 0
h-index: 0
机构: Tech Univ Denmark, Bioctr DTU, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark

Sicheritz-Pontén, T
论文数: 0 引用数: 0
h-index: 0
机构: Tech Univ Denmark, Bioctr DTU, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark

Gupta, R
论文数: 0 引用数: 0
h-index: 0
机构: Tech Univ Denmark, Bioctr DTU, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark

Gammeltoft, S
论文数: 0 引用数: 0
h-index: 0
机构: Tech Univ Denmark, Bioctr DTU, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark

Brunak, S
论文数: 0 引用数: 0
h-index: 0
机构: Tech Univ Denmark, Bioctr DTU, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark
[7]
The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology
[J].
Camon, E
;
Magrane, M
;
Barrell, D
;
Lee, V
;
Dimmer, E
;
Maslen, J
;
Binns, D
;
Harte, N
;
Lopez, R
;
Apweiler, R
.
NUCLEIC ACIDS RESEARCH,
2004, 32
:D262-D266

Camon, E
论文数: 0 引用数: 0
h-index: 0
机构:
EBI, Cambridge CB10 1SD, England EBI, Cambridge CB10 1SD, England

Magrane, M
论文数: 0 引用数: 0
h-index: 0
机构:
EBI, Cambridge CB10 1SD, England EBI, Cambridge CB10 1SD, England

Barrell, D
论文数: 0 引用数: 0
h-index: 0
机构:
EBI, Cambridge CB10 1SD, England EBI, Cambridge CB10 1SD, England

Lee, V
论文数: 0 引用数: 0
h-index: 0
机构:
EBI, Cambridge CB10 1SD, England EBI, Cambridge CB10 1SD, England

Dimmer, E
论文数: 0 引用数: 0
h-index: 0
机构:
EBI, Cambridge CB10 1SD, England EBI, Cambridge CB10 1SD, England

Maslen, J
论文数: 0 引用数: 0
h-index: 0
机构:
EBI, Cambridge CB10 1SD, England EBI, Cambridge CB10 1SD, England

Binns, D
论文数: 0 引用数: 0
h-index: 0
机构:
EBI, Cambridge CB10 1SD, England EBI, Cambridge CB10 1SD, England

Harte, N
论文数: 0 引用数: 0
h-index: 0
机构:
EBI, Cambridge CB10 1SD, England EBI, Cambridge CB10 1SD, England

Lopez, R
论文数: 0 引用数: 0
h-index: 0
机构:
EBI, Cambridge CB10 1SD, England EBI, Cambridge CB10 1SD, England

Apweiler, R
论文数: 0 引用数: 0
h-index: 0
机构:
EBI, Cambridge CB10 1SD, England EBI, Cambridge CB10 1SD, England
[8]
Conservation of intrinsic disorder in protein domains and families: II. Functions of conserved disorder
[J].
Chen, JW
;
Romero, P
;
Uversky, VN
;
Dunker, AK
.
JOURNAL OF PROTEOME RESEARCH,
2006, 5 (04)
:888-898

Chen, JW
论文数: 0 引用数: 0
h-index: 0
机构: Indiana Univ, Sch Med, Dept Biochem & Mol Biol, Ctr Computat Biol & Bioinformat, Indianapolis, IN 46202 USA

Romero, P
论文数: 0 引用数: 0
h-index: 0
机构: Indiana Univ, Sch Med, Dept Biochem & Mol Biol, Ctr Computat Biol & Bioinformat, Indianapolis, IN 46202 USA

Uversky, VN
论文数: 0 引用数: 0
h-index: 0
机构: Indiana Univ, Sch Med, Dept Biochem & Mol Biol, Ctr Computat Biol & Bioinformat, Indianapolis, IN 46202 USA

Dunker, AK
论文数: 0 引用数: 0
h-index: 0
机构: Indiana Univ, Sch Med, Dept Biochem & Mol Biol, Ctr Computat Biol & Bioinformat, Indianapolis, IN 46202 USA
[9]
Abundance of intrinsic disorder in protein associated with cardiovascular disease
[J].
Cheng, Yugong
;
LeGall, Tanguy
;
Oldfield, Christopher J.
;
Dunker, A. Keith
;
Uversky, Vladimir N.
.
BIOCHEMISTRY,
2006, 45 (35)
:10448-10460

Cheng, Yugong
论文数: 0 引用数: 0
h-index: 0
机构: Indiana Univ, Sch Med, Ctr Computat Biol & Bioinformat, Indianapolis, IN 46202 USA

LeGall, Tanguy
论文数: 0 引用数: 0
h-index: 0
机构: Indiana Univ, Sch Med, Ctr Computat Biol & Bioinformat, Indianapolis, IN 46202 USA

Oldfield, Christopher J.
论文数: 0 引用数: 0
h-index: 0
机构: Indiana Univ, Sch Med, Ctr Computat Biol & Bioinformat, Indianapolis, IN 46202 USA

Dunker, A. Keith
论文数: 0 引用数: 0
h-index: 0
机构: Indiana Univ, Sch Med, Ctr Computat Biol & Bioinformat, Indianapolis, IN 46202 USA

Uversky, Vladimir N.
论文数: 0 引用数: 0
h-index: 0
机构: Indiana Univ, Sch Med, Ctr Computat Biol & Bioinformat, Indianapolis, IN 46202 USA
[10]
Predicting enzyme class from protein structure without alignments
[J].
Dobson, PD
;
Doig, AJ
.
JOURNAL OF MOLECULAR BIOLOGY,
2005, 345 (01)
:187-199

Dobson, PD
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Manchester, Dept Biomol Sci, Manchester M60 1QD, Lancs, England Univ Manchester, Dept Biomol Sci, Manchester M60 1QD, Lancs, England

Doig, AJ
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Manchester, Dept Biomol Sci, Manchester M60 1QD, Lancs, England Univ Manchester, Dept Biomol Sci, Manchester M60 1QD, Lancs, England