Explorative data analysis techniques and unsupervised clustering methods to support clinical assessment of Chronic Obstructive Pulmonary Disease (COPD) phenotypes

被引:39
作者
Paoletti, Matteo [1 ]
Camiciottoli, Gianna [2 ]
Meoni, Eleonora [2 ]
Bigazzi, Francesca [2 ]
Cestelli, Lucia [2 ]
Pistolesi, Massimo [2 ]
Marchesi, Carlo [1 ]
机构
[1] Univ Florence, Dept Comp Sci & Syst, I-50139 Florence, Italy
[2] Univ Florence, Dept Internal Med, Sect Resp Med, I-50139 Florence, Italy
关键词
Explorative biomedical data analysis; Homogeneity analysis; Biomedical data clustering; Biomedical clustering; Multiple Correspondence Analysis; Principal Component Analysis; K-Harmonic Means; COPD; Chronic Obstructive Pulmonary Disease; Biomedical data mining; ATTENUATION;
D O I
10.1016/j.jbi.2009.05.008
中图分类号
TP39 [计算机的应用];
学科分类号
080201 [机械制造及其自动化];
摘要
Chronic Obstructive Pulmonary Disease (COPD) is the fourth leading cause of death worldwide and represents one of the major causes of chronic morbidity. Cigarette smoking is the most important risk factor for COPD. In these patients, the airflow limitation is caused by a mixture of small airways disease and parenchyma destruction, the relative contribution of which varies from person to person. The twofold nature of the pathology has been studied in the past and according to some authors each patient should be classified as presenting a predominantly bronchial or emphysematous phenotype. In this study we applied various explorative analysis techniques (PCA, MCA, MDS) and recent unsupervised clustering methods (KHM) to study a large dataset, acquired from 415 COPD patients, to assess the presence of hidden structures in data corresponding to the different COPD phenotypes observed in clinical practice. In order to validate our methods, we compared the results obtained from a training set of 415 patients with lung density data acquired in a test set of 93 patients who underwent HRCT (High Resolution Computerized Tomography). (C) 2009 Elsevier Inc. All rights reserved.
引用
收藏
页码:1013 / 1021
页数:9
相关论文
共 28 条
[1]
[Anonymous], 1993, MULTIVARIATE ANAL FU
[2]
Benzecri Jean-Pierre, 1992, Correspondence Analysis Handbook
[3]
Spirometrically gated high-resolution CT findings in COPD - Lung attenuation vs lung function and dyspnea severity [J].
Camiciottoli, G ;
Bartolucci, M ;
Maluccio, NM ;
Moroni, C ;
Mascalchi, M ;
Giuntini, C ;
Pistolesi, M .
CHEST, 2006, 129 (03) :558-564
[4]
Cios KJ., 2007, Data Min. A Knowl. Discov. Approach, V15, P257
[5]
Clinical research in chronic obstructive pulmonary disease - Needs and opportunities [J].
Croxton, TL ;
Weinmann, GG ;
Senior, RM ;
Wise, RA ;
Crapo, JD ;
Buist, AS .
AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2003, 167 (08) :1142-1149
[6]
NATURAL-HISTORY OF CHRONIC AIR-FLOW OBSTRUCTION [J].
FLETCHER, C ;
PETO, R .
BMJ-BRITISH MEDICAL JOURNAL, 1977, 1 (6077) :1645-1648
[7]
BIPLOT GRAPHIC DISPLAY OF MATRICES WITH APPLICATION TO PRINCIPAL COMPONENT ANALYSIS [J].
GABRIEL, KR .
BIOMETRIKA, 1971, 58 (03) :453-+
[8]
COMPARISON OF COMPUTED DENSITY AND MACROSCOPIC MORPHOMETRY IN PULMONARY-EMPHYSEMA [J].
GEVENOIS, PA ;
DEMAERTELAER, V ;
DEVUYST, P ;
ZANEN, J ;
YERNAULT, JC .
AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 1995, 152 (02) :653-657
[9]
THE GEOMETRIC INTERPRETATION OF CORRESPONDENCE-ANALYSIS [J].
GREENACRE, M ;
HASTIE, T .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1987, 82 (398) :437-447
[10]
Greenacre M.J., 1984, J ANIM ECOL, DOI DOI 10.2307/4399