HUGE: a database for human KIAA proteins, a 2004 update integrating HUGEppi and ROUGE

被引:68
作者
Kikuno, R
Nagase, T
Nakayama, M
Koga, H
Okazaki, N
Nakajima, D
Ohara, O
机构
[1] Kazusa DNA Res Inst, Chiba 2920818, Japan
[2] Chiba Ind Advancement Ctr, Mihama Ku, Chiba 2617126, Japan
[3] RIKEN Res Ctr Allergy & Immunol, Tsurumi Ku, Kanagawa 2300045, Japan
关键词
D O I
10.1093/nar/gkh035
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We have been developing a Human Unidentified Gene-Encoded (HUGE) protein database (http:// www.kazusa.or.jp/huge) to summarize results from sequence analysis of human novel large (>4 kb) cDNAs identified in the Kazusa cDNA sequencing project. At present, HUGE contains 2031 cDNA entries (KIAA cDNAs), for each of which a gene/ protein characteristic table has been prepared. Since we have been shifting our research attention from the identification and cloning of novel cDNAs to the functional analysis of the proteins encoded by these cDNAs (KIAA proteins), we have not substantially increased the number of cDNA entries in HUGE for some time. Instead, we have manually curated 451 KIAA cDNAs in order to prepare a set of genetic resources to facilitate the functional analysis of KIAA proteins. In addition, we have updated the contents of the corresponding gene/protein characteristic tables in HUGE and have constructed two subsidiary databases, HUGEppi (http://www. kazusa.or.jp/huge/ppi) and ROUGE (http://www. kazusa.or.jp/rouge), to make available the results from our study of KIAA protein function. HUGEppi shows detailed information on protein-protein interactions detected between 84 pairs of KIAA proteins by yeast two-hybrid screening. ROUGE summarizes the results of computer-assisted analyses of similar to 1000 mouse homologues of human large cDNAs that we identified.
引用
收藏
页码:D502 / D504
页数:3
相关论文
共 15 条
[1]   SOSUI: classification and secondary structure prediction system for membrane proteins [J].
Hirokawa, T ;
Boon-Chieng, S ;
Mitaku, S .
BIOINFORMATICS, 1998, 14 (04) :378-379
[2]   Gene identification and classification in the Synechocystis genomic sequence by recursive gene mark analysis [J].
Hirosawa, M ;
Isono, K ;
Hayes, WS ;
Borodovsky, M .
DNA SEQUENCE, 1997, 8 (1-2) :17-29
[3]   HUGE: a database for human large proteins identified in the Kazusa cDNA sequencing project [J].
Kikuno, R ;
Nagase, T ;
Waki, M ;
Ohara, O .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :166-168
[4]   Initial sequencing and analysis of the human genome [J].
Lander, ES ;
Int Human Genome Sequencing Consortium ;
Linton, LM ;
Birren, B ;
Nusbaum, C ;
Zody, MC ;
Baldwin, J ;
Devon, K ;
Dewar, K ;
Doyle, M ;
FitzHugh, W ;
Funke, R ;
Gage, D ;
Harris, K ;
Heaford, A ;
Howland, J ;
Kann, L ;
Lehoczky, J ;
LeVine, R ;
McEwan, P ;
McKernan, K ;
Meldrim, J ;
Mesirov, JP ;
Miranda, C ;
Morris, W ;
Naylor, J ;
Raymond, C ;
Rosetti, M ;
Santos, R ;
Sheridan, A ;
Sougnez, C ;
Stange-Thomann, N ;
Stojanovic, N ;
Subramanian, A ;
Wyman, D ;
Rogers, J ;
Sulston, J ;
Ainscough, R ;
Beck, S ;
Bentley, D ;
Burton, J ;
Clee, C ;
Carter, N ;
Coulson, A ;
Deadman, R ;
Deloukas, P ;
Dunham, A ;
Dunham, I ;
Durbin, R ;
French, L .
NATURE, 2001, 409 (6822) :860-921
[5]   Evolutionary parameters of the transcribed mammalian genome: An analysis of 2,820 orthologous rodent and human sequences [J].
Makalowski, W ;
Boguski, MS .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (16) :9407-9412
[6]   The InterPro Database, 2003 brings increased coverage and new features [J].
Mulder, NJ ;
Apweiler, R ;
Attwood, TK ;
Bairoch, A ;
Barrell, D ;
Bateman, A ;
Binns, D ;
Biswas, M ;
Bradley, P ;
Bork, P ;
Bucher, P ;
Copley, RR ;
Courcelle, E ;
Das, U ;
Durbin, R ;
Falquet, L ;
Fleischmann, W ;
Griffiths-Jones, S ;
Haft, D ;
Harte, N ;
Hulo, N ;
Kahn, D ;
Kanapin, A ;
Krestyaninova, M ;
Lopez, R ;
Letunic, I ;
Lonsdale, D ;
Silventoinen, V ;
Orchard, SE ;
Pagni, M ;
Peyruc, D ;
Ponting, CP ;
Selengut, JD ;
Servant, F ;
Sigrist, CJA ;
Vaughan, R ;
Zdobnov, EM .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :315-318
[7]   Prediction of the coding sequences of unidentified human genes. XXII. The complete sequences of 50 new cDNA clones which code for large proteins [J].
Nagase, T ;
Kikuno, R ;
Ohara, O .
DNA RESEARCH, 2001, 8 (06) :319-327
[8]  
Nakajima D, 2002, DNA RES, V9, P99, DOI 10.1093/dnares/9.3.99
[9]   Protein-protein interactions between large proteins: Two-hybrid screening using a functionally classified library composed of long cDNAs [J].
Nakayama, M ;
Kikuno, R ;
Ohara, O .
GENOME RESEARCH, 2002, 12 (11) :1773-1784
[10]  
Ohara O, 1997, DNA Res, V4, P53, DOI 10.1093/dnares/4.1.53