Determining nucleolar association from sequence by leveraging protein-protein interactions

被引:4
作者
Boden, Mikael [1 ,2 ,3 ]
Teasdale, Rohan D. [1 ,2 ]
机构
[1] Univ Queensland, Inst Mol Biosci, St Lucia, Qld 4072, Australia
[2] Univ Queensland, ARC Ctr Excellence Bioinformat, St Lucia, Qld 4072, Australia
[3] Univ Queensland, Sch Informat Technol & Elect Engn, St Lucia, Qld, Australia
关键词
proteins; secondary structure; sequences;
D O I
10.1089/cmb.2007.0163
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Controlled intra-nuclear organization of proteins is critical for sustaining correct function of the cell. Proteins and RNA are transported by passive diffusion and associate with compartments by virtue of diverse molecular interactions-presenting a challenging problem for data-driven model building. An increasing inventory of proteins with known intra-nuclear destination and proliferation of molecular interaction data motivate an integrative method, leveraging the existing evidence to build accurate models of intranuclear trafficking. Kernel canonical correlation analysis (KCCA) enables the construction of predictors based on genomic sequence data, but leverages other knowledge sources during training. The approach specifically involves the induction of protein sequence features and relations most pertinent to the recovery of nucleolar associated protein-protein interactions. With success rates of about 78%, the classification of nucleolar association from KCCA-induced features surpasses that of baseline approaches. We observe that the coalescence of protein-protein interaction data with sequence data enhances the prediction of highly interconnected, key ribosomal and RNA-related nucleolar proteins. For supplementary material, see www.itee.uq.edu.au/pprowler/nucleoli.
引用
收藏
页码:291 / 304
页数:14
相关论文
共 28 条
  • [1] Akaho S., 2001, P IMPS
  • [2] The Biomolecular Interaction Network Database and related tools 2005 update
    Alfarano, C
    Andrade, CE
    Anthony, K
    Bahroos, N
    Bajec, M
    Bantoft, K
    Betel, D
    Bobechko, B
    Boutilier, K
    Burgess, E
    Buzadzija, K
    Cavero, R
    D'Abreo, C
    Donaldson, I
    Dorairajoo, D
    Dumontier, MJ
    Dumontier, MR
    Earles, V
    Farrall, R
    Feldman, H
    Garderman, E
    Gong, Y
    Gonzaga, R
    Grytsan, V
    Gryz, E
    Gu, V
    Haldorsen, E
    Halupa, A
    Haw, R
    Hrvojic, A
    Hurrell, L
    Isserlin, R
    Jack, F
    Juma, F
    Khan, A
    Kon, T
    Konopinsky, S
    Le, V
    Lee, E
    Ling, S
    Magidin, M
    Moniakis, J
    Montojo, J
    Moore, S
    Muskat, B
    Ng, I
    Paraiso, JP
    Parker, B
    Pintilie, G
    Pirone, R
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 : D418 - D424
  • [3] Nucleolar proteome dynamics
    Andersen, JS
    Lam, YW
    Leung, AKL
    Ong, SE
    Lyon, CE
    Lamond, AI
    Mann, M
    [J]. NATURE, 2005, 433 (7021) : 77 - 83
  • [4] [Anonymous], ADV NEURAL INFORM PR
  • [5] [Anonymous], 1998, Encyclopedia of Biostatistics
  • [6] Bach F.R., 2002, J MACHINE LEARNING R, V3, P1
  • [7] Addressing protein localization within the nucleus
    Bickmore, WA
    Sutherland, HGE
    [J]. EMBO JOURNAL, 2002, 21 (06) : 1248 - 1254
  • [8] The contribution of nuclear compartmentalization to gene regulation
    Carmo-Fonseca, M
    [J]. CELL, 2002, 108 (04) : 513 - 521
  • [9] Dynamic complex formation during the yeast cell cycle
    de Lichtenberg, U
    Jensen, LJ
    Brunak, S
    Bork, P
    [J]. SCIENCE, 2005, 307 (5710) : 724 - 727
  • [10] The Nuclear Protein Database (NPD): sub-nuclear localisation and functional annotation of the nuclear proteome
    Dellaire, G
    Farrall, R
    Bickmore, WA
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 328 - 330