An integrated metadata access infrastructure for a network of federated curated data repositories

被引:3
作者
Adeleke, Oluwalani [1 ]
Otoo, E.J. [1 ]
机构
[1] School of Computer Science, University of Witwatersrand, Johannesburg
来源
OCLC Systems and Services | 2014年 / 30卷 / 02期
关键词
Database management; Federated data repositories; Metadata access; Metadata access interface; Metadata dissemination; Metadata service;
D O I
10.1108/OCLC-08-2013-0032
中图分类号
学科分类号
摘要
Purpose – This paper aims to study integrated metadata access infrastructure for a network of federated curated data repositories. With the increase in collaborative initiatives among diverse scientific discipline, infrastructure should be in place to facilitate effective information sharing. Scientific data sharing involves provisioning, curation and dissemination of information about the various datasets for discovery and access by other peers, which is achieved using metadata services. The heterogeneous nature of various distributed dataset repositories has resulted in the use of heterogeneous metadata services. This poses some challenges in efficient dataset sharing and information retrieval. To allow for universal accessibility of these autonomous curated data repositories, it is important to establish cross-integration on these data repositories for information sharing.; Design/methodology/approach – The authors address this problem through provisioning of a universal metadata interface design that can be integrated with some popular metadata services such as integrated rule-oriented data system (iRODS), OpenDap/THREDDS and MERCURY. Given a network of federated heterogeneous distributed metadata services over autonomous curated data repositories, the authors present an implementation of a universal interface system that can probe and query different metadata databases to access the essential information provided for data discovery and enable data migration.; Findings – The authors present the architecture that integrates and allows communication between our interface and the various autonomous data repositories. The authors show how they can integrate their system with THREDDS and iRODS to accomplish data discovery and access operations without altering the implementations of the metadata services at their remote locations.; Originality/value – Their system provides unique architecture for information discovery and metadata searches which employs the application programming interfaces for the respective metadata services and communicates using the Zero C Internet communication engine (ICE) protocol. © Emerald Group Publishing Limited.
引用
收藏
页码:91 / 107
页数:16
相关论文
共 16 条
[1]  
Apps Ann, Guidelines for encoding bibliographic citation information in dublin core, metadata, (2005)
[2]  
Cornillon P., Gallagher J., Sgouros T., Opendap: Accessing data in a distributed, heterogeneous environment, Data Science Journal, 2, pp. 164-174, (2003)
[3]  
Deelman E., Singh G., Atkinson M., Chervenak A., Chue Hong N., Kesselman C., Patil S., Pearlma L., Su M.H., Grid-based metadata services, Proceedings of the 16th International Conference on Scientific and Statistical Database Management, pp. 393-402, (2004)
[4]  
DICE, Fact Sheet: IRODS Integrated Rule Oriented Data System Open Source Data Grid, Helping People Organize and Manage Large Collections of Distributed Digital Data, (2009)
[5]  
ESRI, Metadata and GIS: An Esri® white paper, (2002)
[6]  
FGDC, Federal geographic data committee, (2012)
[7]  
Fletcher I., Federated search: The options, Search Technologies Corp, (2013)
[8]  
Gallagher J., OPeNDAP/Hyrax interfaces, ocean observatories initiative (OOI) workshop UCSD, (2008)
[9]  
Jonathan D., Parallel I/O: Formats for scientific data management NetCDF4, HDF5, (2012)
[10]  
Law D., Arcgis for server101