Biomedical data integration: using XML to link clinical and research data sets

被引:7
作者
Berman, JJ [1 ]
Bhatia, K [1 ]
机构
[1] NCI, Resources Dev Branch, Canc Diag Program, Bethesda, MD 20892 USA
关键词
common data elements; data integration; interoperabiltiy; translational research; XML;
D O I
10.1586/14737159.5.3.329
中图分类号
R36 [病理学];
学科分类号
100104 ;
摘要
Data integration occurs when a query proceeds through multiple data sets, thereby relating diverse data extracted from different data sources. Data integration is particularly important to biomedical researchers since data obtained from experiments on human tissue specimens have little applied value unless they can be combined with medical data (i.e., pathologic and clinical information). In the past, research data were correlated with medical data by manually retrieving, reading, assembling and abstracting patient charts, pathology reports, radiology reports and the results of special tests and procedures. Manual annotation of research data is impractical when experiments involve hundreds or thousands of tissue specimens resulting in large, complex data collections. The purpose of this paper is to review how XML (eXtensible Markup Language) provides the fundamental tools that support biomedical data integration. The article also discusses some of the most important challenges that block the widespread availability of annotated biomedical data sets.
引用
收藏
页码:329 / 336
页数:8
相关论文
共 45 条
[1]  
AHMED K, 2001, PROFESSIONAL XML MET
[2]   The human plasma proteome - History, character, and diagnostic prospects [J].
Anderson, NL ;
Anderson, NG .
MOLECULAR & CELLULAR PROTEOMICS, 2002, 1 (11) :845-867
[3]  
[Anonymous], P 30 VLDB C TOR CAN
[4]   Combining laboratory data sets from multiple institutions using the logical observation identifier names and codes (LOINC) [J].
Baorto, DM ;
Cimino, JJ ;
Parvin, CA ;
Kahn, MG .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 1998, 51 (01) :29-37
[5]   Biomarker boom slowed by validation concerns [J].
Benowitz, S .
JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2004, 96 (18) :1356-1357
[6]   Pathology data integration with extensible Markup Language [J].
Berman, JJ .
HUMAN PATHOLOGY, 2005, 36 (02) :139-145
[7]   Tumor classification: molecular analysis meets Aristotle [J].
Berman, JJ .
BMC CANCER, 2004, 4 (1)
[8]   The tissue microarray data exchange specification: implementation by the Cooperative Prostate Cancer Tissue Resource [J].
Berman, JJ ;
Datta, M ;
Kajdacsy-Balla, A ;
Melamed, J ;
Orenstein, J ;
Dobbin, K ;
Patel, A ;
Dhir, R ;
Becich, MJ .
BMC BIOINFORMATICS, 2004, 5 (1)
[9]  
Berman JJ, 2004, ARCH PATHOL LAB MED, V128, P344
[10]   Racing to share pathology data [J].
Berman, JJ .
AMERICAN JOURNAL OF CLINICAL PATHOLOGY, 2004, 121 (02) :169-171