Large scale distributed data repository: design of a molecular dynamics trajectory database

被引:12
作者
Feig, M
Abdullah, M
Johnsson, L
Pettitt, BM
机构
[1] Univ Houston, Dept Chem, Houston, TX 77204 USA
[2] Univ Houston, Inst Mol Design, Houston, TX 77204 USA
[3] Univ Houston, Dept Comp Sci, Houston, TX 77204 USA
关键词
distributed database; scientific data archive; data analysis;
D O I
10.1016/S0167-739X(99)00039-4
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The design of a molecular dynamics trajectory database is presented as an example of the organization of large-scale dynamic distributed repositories for scientific data. Large scientific datasets are usually interpreted through reduced data calculated by analysis functions. This allows a database architecture in which the analyzed datasets, that are kept in addition to the raw datasets, are transferred to a database user. A flexible user interface with a well defined Application Program Interface (API) allows for a wide array of analysis functions and the incorporation of user defined functions is a critical part of the database design. An analysis function is executed only when the requested analysis result is not available from an earlier request. A prototype implementation used to gain initial practical experiences with performance and scalability is presented. (C) 1999 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:101 / 110
页数:10
相关论文
共 8 条
[1]  
Allen M. P., 1987, Computer Simulation of Liquids
[2]  
[Anonymous], PLAN DAT SYST PDS
[3]   QUERY-PROCESSING IN A SYSTEM FOR DISTRIBUTED DATABASES (SDD-1) [J].
BERNSTEIN, PA ;
GOODMAN, N ;
WONG, E ;
REEVE, CL ;
ROTHNIE, JB .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 1981, 6 (04) :602-625
[4]  
*BNL, PROT DAT BANK PDB
[5]  
*NAT CTR BIOT INF, GENB SEQ DAT
[6]  
Ozsu M. T., 1991, PRINCIPLES DISTRIBUT
[7]  
Schatz B.R., 1996, IEEE COMPUT, V29, P22
[8]  
Stonebraker M., 1994, Proceedings. The 10th International Conference Data Engineering (Cat. No.94CH3383-7), P54, DOI 10.1109/ICDE.1994.283004