ALLOCATING DATA AND OPERATIONS TO NODES IN DISTRIBUTED DATABASE DESIGN

被引:46
作者
MARCH, ST
RHO, S
机构
[1] Information and Decision Science, Carlson School of Management, University of Minnesota, Minneapolis, MN 55455
关键词
DISTRIBUTED DATABASE DESIGN; DATA PARTITIONING AND REPLICATION; DATA ALLOCATION; OPERATION ALLOCATION; PERFORMANCE MODELING AND ANALYSIS; GENETIC ALGORITHM;
D O I
10.1109/69.382299
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The allocation of data and operations to nodes in a computer communications network is a critical issue in distributed database design, An efficient distributed database design must trade off performance and cost among retrieval and update activities at the various nodes, It must consider the concurrency control mechanism used as well as capacity constraints at nodes and on links in the network, It must determine where data will be allocated, the degree of data replication, which copy of the data will be used for each retrieval activity, and where operations such as select, project, join, and union will be performed, We develop a comprehensive mathematical modeling approach for this problem, The approach first generates units of data (file fragments) to be allocated from a logical data model representation and a characterization of retrieval and update activities, Retrieval and up date activities are then decomposed into relational operations on these fragments. Both fragments and operations on them are then allocated to nodes using a mathematical modeling approach, The mathematical model considers network communication, local processing, and data storage costs, A genetic algorithm is developed to solve this mathematical formulation.
引用
收藏
页码:305 / 317
页数:13
相关论文
共 30 条
[1]   DATA ALLOCATION IN DISTRIBUTED DATABASE-SYSTEMS [J].
APERS, PMG .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 1988, 13 (03) :263-304
[2]   AN ALGORITHM FOR CONCURRENCY-CONTROL AND RECOVERY IN REPLICATED DISTRIBUTED DATABASES [J].
BERNSTEIN, PA ;
GOODMAN, N .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 1984, 9 (04) :596-615
[3]  
BLANKINSHIP R, 1991, 17 INT C VER LARG DA, P389
[4]  
CARLIS JV, 1983, COMPUT PERFORM, V4, P198
[5]  
CARLIS JV, 1984, P INT C DATA ENG, P24
[6]  
Ceri S., 1984, DISTRIBUTED DATABASE
[7]   IMPLICATIONS OF CERTAIN ASSUMPTIONS IN DATABASE PERFORMANCE EVALUATION [J].
CHRISTODOULAKIS, S .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 1984, 9 (02) :163-186
[8]   ON OPTIMAL SITE ASSIGNMENT FOR RELATIONS IN THE DISTRIBUTED DATABASE ENVIRONMENT [J].
CORNELL, DW ;
YU, PS .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1989, 15 (08) :1004-1009
[9]  
GARDY D, 1989, ACM T DATABASE SYST, V14, P574, DOI 10.1145/76902.76907
[10]  
Goldberg DE, 1989, GENETIC ALGORITHMS S