Thesaurus as a complex network

被引:43
作者
Holanda, AD [1 ]
Pisa, IT [1 ]
Kinouchi, O [1 ]
Martinez, AS [1 ]
Ruiz, EES [1 ]
机构
[1] Univ Sao Paulo, Fac Filosofia Ciencias & Letras Ribeirao Pret, BR-14040901 Ribeirao Preto, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
complex networks; directed graphs; thesaurus;
D O I
10.1016/j.physa.2004.06.025
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
A thesaurus is one, out of many, possible representations of term (or word) connectivity. The terms of a thesaurus are seen as the nodes and their relationship as the links of a directed graph. The directionality of the links retains all the thesaurus information and allows the measurement of several quantities. This has lead to a new term classification according to the characteristics of the nodes, for example, nodes with no links in, no links out, etc. Using an electronic available thesaurus we have obtained the incoming and outgoing link distributions. While the incoming link distribution follows a stretched exponential function, the lower bound for the outgoing link distribution has the same envelope of the scientific paper citation distribution proposed by Tsallis and Albuquerque (Eur. Phys. J. B 13 (2000) 777). However, a better fit is obtained by simpler function which is the solution of Ricatti's differential equation. We conjecture that this differential equation is the continuous limit of a stochastic growth model of the thesaurus network. We also propose a new manner to arrange a thesaurus using the "inversion method". (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:530 / 536
页数:7
相关论文
共 20 条
[1]  
[Anonymous], 1997, ELEMENTARY DIFFERENT
[2]  
[Anonymous], 1949, Human behaviour and the principle of least-effort
[3]  
BROUERS F, 2003, COMMUNICATION NOV
[4]   Giant strongly connected component of directed networks [J].
Dorogovtsev, SN ;
Mendes, JFF ;
Samukhin, AN .
PHYSICAL REVIEW E, 2001, 64 (02) :4
[5]  
GRIFFITHS TL, 2004, PROBABILISTIC APPROA
[6]   Deterministic walks in random networks:: an application to thesaurus graphs [J].
Kinouchi, O ;
Martinez, AS ;
Lima, GF ;
Lourenço, GM ;
Risau-Gusman, S .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2002, 315 (3-4) :665-676
[7]   The potential of latent semantic analysis for machine grading of clinical case summaries [J].
Kintsch, W .
JOURNAL OF BIOMEDICAL INFORMATICS, 2002, 35 (01) :3-7
[8]   Stretched exponential distributions in nature and economy: "fat tails" with characteristic scales [J].
Laherrere, J ;
Sornette, D .
EUROPEAN PHYSICAL JOURNAL B, 1998, 2 (04) :525-539
[9]   A solution to Plato's problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge [J].
Landauer, TK ;
Dumais, ST .
PSYCHOLOGICAL REVIEW, 1997, 104 (02) :211-240
[10]   Deterministic walks in random media [J].
Lima, GF ;
Martinez, AS ;
Kinouchi, O .
PHYSICAL REVIEW LETTERS, 2001, 87 (01) :1-010603