Organizing open archives via lightweight ontologies to facilitate the use of heterogeneous collections

被引:5
作者
Alfredo Sanchez, J. [1 ]
Auxilio Medina, Maria [2 ]
Starostenko, Oleg [3 ]
Benitez, Antonio [2 ]
Lopez Dominguez, Eduardo [2 ]
机构
[1] Univ Americas Puebla, Cholula, Mexico
[2] Univ Politecn Puebla, Cholula, Mexico
[3] Univ Americas Puebla, Cholula, Mexico
来源
ASLIB PROCEEDINGS | 2012年 / 64卷 / 01期
关键词
Information integration; Ontologies; Open archives; Distributed collections; Clustering; Information management; archives;
D O I
10.1108/00012531211196701
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose - This paper seeks to focus on the problems of integrating information from open, distributed scholarly collections, and on the opportunities these collections represent for research communities in developing countries. The paper aims to introduce OntOAIr, a semi-automatic method for constructing lightweight ontologies of documents in repositories such as those provided by the Open Archives Initiative (OAI). Design/methodology/approach - OntOAIr uses simplified document representations, a clustering algorithm, and ontological engineering techniques. Findings - The paper presents experimental results of the potential positive impact of ontologies and specifically of OntOAIr on the use of collections provided by OAT. Research limitations/implications - By applying OntOAIr, scholars who frequently spend many hours organizing OAI information spaces will obtain support that will allow them to speed up the entire research cycle and, expectedly, participate more fully in global research communities. Originality/value - The proposed method allows human and software agents to organize and retrieve groups of documents from multiple collections. Applications of OntOAIr include enhanced document retrieval. In this paper, the authors focus particularly on document retrieval applications.
引用
收藏
页码:46 / 66
页数:21
相关论文
共 32 条
[1]  
Aitken S., 2000, WORKSH APPL ONT PROB, P34
[2]  
[Anonymous], 2005, CLASSIFICATION CLUST
[3]  
Berry W.M., 2010, SURVEY TEXT MINING
[4]  
Borst WN., 1997, Construction of Engineering Ontologies for Knowledge Sharing and Reuse
[5]  
Brase J, 2003, IN HAND I S, P555
[6]  
Cui Z., 2000, P 33 HAW INT C SYST, P8
[7]  
Diederich J, 2007, LECT NOTES COMPUT SC, V4675, P1
[8]  
doccluster, 2007, CLUST C LIB DOC
[9]  
Fung B.C.M., 2005, P 3 SIAM INT C DAT M, P59
[10]  
Fung B.C.M., 2006, ENCY DATA WAREHOUSIN, VI, P555