Multi-Task Joint Learning Model for Segmenting and Classifying Tongue Images Using a Deep Neural Network

被引:195
作者
Xu, Qiang [1 ]
Zeng, Yu [2 ]
Tang, Wenjun [3 ]
Peng, Wei [1 ]
Xia, Tingwei [1 ]
Li, Zongrun [1 ]
Teng, Fei [2 ]
Li, Weihong [1 ]
Guo, Jinhong [4 ]
机构
[1] Chengdu Univ Tradit Chinese Med, Sch Basic Med Sci, Chengdu 610075, Peoples R China
[2] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 610031, Peoples R China
[3] Hosp Chengdu Univ Tradit Chinese Med, Chengdu 610072, Peoples R China
[4] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Inst Med Equipment, Chengdu 611731, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Tongue; Task analysis; Image segmentation; Coatings; Image color analysis; Machine learning; Feature extraction; Tongue characterization; tongue classification; tongue segmentation; multi-task joint learning; deep learning; CHINESE-MEDICINE; SEGMENTATION;
D O I
10.1109/JBHI.2020.2986376
中图分类号
TP [自动化技术、计算机技术];
学科分类号
080201 [机械制造及其自动化];
摘要
Automatic tongue image segmentation and tongue image classification are two crucial tongue characterization tasks in traditional Chinese medicine (TCM). Due to the complexity of tongue segmentation and fine-grained traits of tongue image classification, both tasks are challenging. Fortunately, from the perspective of computer vision, these two tasks are highly interrelated, making them compatible with the idea of Multi-Task Joint learning (MTL). By sharing the underlying parameters and adding two different task loss functions, an MTL method for segmenting and classifying tongue images is proposed in this paper. Moreover, two state-of-the-art deep neural network variants (UNET and Discriminative Filter Learning (DFL)) are fused into the MTL to perform these two tasks. To the best of our knowledge, our method is the first attempt to manage both tasks simultaneously with MTL. We conducted extensive experiments with the proposed method. The experimental results show that our joint method outperforms the existing tongue characterization methods. Besides, visualizations and ablation studies are provided to aid in understanding our approach, which suggest that our method is highly consistent with human perception.
引用
收藏
页码:2481 / 2489
页数:9
相关论文
共 43 条
[1]
[Anonymous], 1994, Differential Diagnosis of TCM Symptoms
[2]
Computerized tongue diagnosis based on Bayesian networks [J].
Bo, P ;
Zhang, D ;
Li, NM ;
Wang, KQ .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2004, 51 (10) :1803-1810
[3]
Chen L, 2015, IEEE INT C BIOINFORM, P990, DOI 10.1109/BIBM.2015.7359818
[4]
Cyranoski D, 2018, NATURE, V561, P448, DOI 10.1038/d41586-018-06782-7
[5]
BIRNet: Brain image registration using dual-supervised fully convolutional networks [J].
Fan, Jingfan ;
Cao, Xiaohuan ;
Yap, Pew-Thian ;
Shen, Dinggang .
MEDICAL IMAGE ANALYSIS, 2019, 54 :193-206
[6]
Fu SY, 2017, 2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), P730, DOI 10.1109/ICBDA.2017.8078732
[7]
Guo JW, 2016, 2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), P1386, DOI 10.1109/CISP-BMEI.2016.7852933
[8]
Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[9]
Hou J, 2017, 2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), P725, DOI 10.1109/ICBDA.2017.8078731
[10]
Tongue shape classification by geometric features [J].
Huang, Bo ;
Wu, Jinsong ;
Zhang, David ;
Li, Naimin .
INFORMATION SCIENCES, 2010, 180 (02) :312-324