Crystal fingerprint space - a novel paradigm for studying crystal-structure sets

被引:79
作者
Valle, Mario [1 ]
Oganov, Artem R. [2 ,3 ,4 ]
机构
[1] Swiss Natl Supercomp Ctr CSCS, Data Anal & Visualizat Grp, CH-6928 Manno, Switzerland
[2] SUNY Stony Brook, Dept Geosci, Dept Phys & Astron, Stony Brook, NY 11794 USA
[3] SUNY Stony Brook, New York Ctr Computat Sci, Stony Brook, NY 11794 USA
[4] Moscow MV Lomonosov State Univ, Dept Geol, Moscow 119992, Russia
来源
ACTA CRYSTALLOGRAPHICA SECTION A | 2010年 / 66卷
关键词
crystal fingerprints; USPEX; structure classification; fingerprint space; INTRINSIC DIMENSION; ALGORITHM;
D O I
10.1107/S0108767310026395
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The initial aim of the crystal fingerprint project was to solve a very specific problem: to classify and remove duplicate crystal structures from the results generated by the evolutionary crystal-structure predictor USPEX. These duplications decrease the genetic diversity of the population used by the evolutionary algorithm, potentially leading to stagnation and, after a certain time, reducing the likelihood of predicting essentially new structures. After solving the initial problem, the approach led to unexpected discoveries: unforeseen correlations, useful derived quantities and insight into the structure of the overall set of results. All of these were facilitated by the project's underlying idea: to transform the structure sets from the physical configuration space to an abstract, high-dimensional space called the fingerprint space. Here every structure is represented as a point whose coordinates (fingerprint) are computed from the crystal structure. Then the space's distance measure, interpreted as structure `closeness', enables grouping of structures into similarity classes. This model provides much flexibility and facilitates access to knowledge and algorithms from fields outside crystallography, e.g. pattern recognition and data mining. The current usage of the fingerprint-space model is revealing interesting properties that relate to chemical and crystallographic attributes of a structure set. For this reason, the mapping of structure sets to fingerprint space could become a new paradigm for studying crystal-structure ensembles and global chemical features of the energy landscape.
引用
收藏
页码:507 / 517
页数:11
相关论文
共 43 条
[1]  
Advanced Visual Systems, 2009, AVS EXPR
[2]  
[Anonymous], 2002, Principal components analysis
[3]  
[Anonymous], VISUALIZATION HDB
[4]  
[Anonymous], 1961, Adaptive Control Processes: a Guided Tour, DOI DOI 10.1515/9781400874668
[5]  
[Anonymous], 2000, 5 ONL WORLD C SOFT C
[6]  
Beyer K, 1999, LECT NOTES COMPUT SC, V1540, P217
[7]  
Borg I., 2005, Modern multidimensional scaling: theory and applications
[8]   CHARACTERIZING EXPERIMENTAL TIME-SERIES USING LOCAL INTRINSIC DIMENSION [J].
BUZUG, TM ;
VONSTAMM, J ;
PFISTER, G .
PHYSICS LETTERS A, 1995, 202 (2-3) :183-190
[9]   Data dimensionality estimation methods: a survey [J].
Camastra, F .
PATTERN RECOGNITION, 2003, 36 (12) :2945-2954
[10]   Intrinsic dimension estimation of data: An approach based on Grassberger-Procaccia's algorithm [J].
Camastra, F ;
Vinciarelli, A .
NEURAL PROCESSING LETTERS, 2001, 14 (01) :27-34