Environmental chemistry through intelligent atmospheric data analysis

被引:43
作者
Gross, Deborah S. [1 ]
Atlas, Robert [2 ]
Rzeszotarski, Jeffrey [2 ]
Turetsky, Emma [2 ]
Christensen, Janara [2 ]
Benzaid, Sami [2 ]
Olson, Jamie [2 ]
Smith, Thomas [2 ]
Steinberg, Leah [2 ]
Sulman, Jon [2 ]
Ritz, Anna [2 ]
Anderson, Benjamin [2 ]
Nelson, Catherine [2 ]
Musicant, David R. [2 ]
Chen, Lei [3 ]
Snyder, David C. [4 ]
Schauer, James J. [4 ]
机构
[1] Carleton Coll, Dept Chem, Northfield, MN 55057 USA
[2] Carleton Coll, Dept Comp Sci, Northfield, MN 55057 USA
[3] Univ Wisconsin, Dept Comp Sci, Madison, WI 53705 USA
[4] Univ Wisconsin, Environm Chem & Technol Program, Madison, WI 53705 USA
关键词
Mass spectrometry; Clustering; Aerosol particle; Data mining; Database design; Data and knowledge visualization; User interfaces; LASER MASS-SPECTROMETRY; PARTICLE ANALYSIS; CLUSTER-ANALYSIS; CLASSIFICATION; ALGORITHMS; MANAGEMENT; SOFTWARE; SYSTEM; ART-2A;
D O I
10.1016/j.envsoft.2009.12.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Here we present a new open-source software package designed to facilitate the analysis of atmospheric data, with emphasis on data mining applications applied to single-particle mass spectrometry data from aerosol particles. The software package, Enchilada (Environmental Chemistry through Intelligent Atmospheric Data Analysis), is designed to seamlessly handle large datasets, to allow for temporal aggregation of data from many instruments, and to integrate techniques such as clustering (K-means, K-medians, and Art-2a), labeling of peaks in mass spectra, and temporal correlations of multiple datasets from multiple instrument types. The software, which continues to be developed and improved, provides users with a single package to integrate data from multiple mass spectrometer systems (ATOFMS, PALMS, SPASS, Q-AMS) as well as any time-based data stream. A detailed description of the software and examples of analysis methods that are incorporated into it are described here. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:760 / 769
页数:10
相关论文
共 70 条
[1]  
Allen J. O., 2008, YAADA SOFTWARE TOOLK
[2]  
Anderson B.J., 2005, USER FRIENDLY CLUSTE
[3]  
ANDERSON BJ, 2006, 6 SIAM INT C DAT MIN
[4]  
[Anonymous], 2002, Principal components analysis
[5]  
[Anonymous], 1998, Learning from data-concepts, theory and methods
[6]  
[Anonymous], 2004, WILEY SER PROB STAT
[7]  
[Anonymous], 1973, Pattern Classification and Scene Analysis
[8]  
[Anonymous], 2009, MATLAB
[9]  
Badman ER, 2000, J MASS SPECTROM, V35, P659, DOI 10.1002/1096-9888(200006)35:6<659::AID-JMS5>3.3.CO
[10]  
2-M