本文已被:浏览 1796次 下载 1318次
投稿时间:2016-06-12 修订日期:2016-12-20
投稿时间:2016-06-12 修订日期:2016-12-20
中文摘要: 针对人体面部皮肤状态指标与中医体质类型之间的关联性进行科学、定量研究,从测试数据持续累积与知识发现深入推进的过程视角,尝试揭示人体内在中医体质与外观皮肤状态指标间的复杂动态演化规律。综合小样本条件下决策树的良好归纳特性及大样本条件下贝叶斯算法分类准确率高的优势。提出基于建模数据量会不断增多的趋势,构建可自适应修订决策树和模糊朴素贝叶斯融合分类算法的权重,以适用于测试数据从小到大积累过程中分类模型均具有较好分类特性及可解释性的应用要求。其中决策树采用最佳后剪枝方式,避免了常规决策树存在的过拟合弊端;朴素贝叶斯算法则通过定义指标归属区间的模糊隶属度来解决皮肤属性测试与分类中存在的随机性与模糊性。实证结果表明本文提出的分类模型的融合权重可动态调整且随着建模数据的增多分类精度会相应提高。目前对应151个建模数据的分类模型的分类准确率为86.7%,高于独立决策树、朴素贝叶斯的83.3%和80%,亦高于对照组80个建模数据对应分类准确率的76.7%。分析可得:此皮肤与体质动态分类模型通过有效利用参与建模的数据信息,能识别出人体面部外观皮肤状态指标与内在中医体质之间的复杂关联性,建立的分类模型具有较好的精度与可解释性,为基于数据驱动的中医理论的科学化、智能化发展进行了有益的探索。
Abstract:It is valuable to recognize the correlation between the skin state index of human face and the type of Traditional Chinese Medicine (TCM) constitution with scientific and quantitative research methods, and from the process perspective of the accumulation of data and the further advance of knowledge discovery in database, the complex dynamic evolution law of TCM constitution and the appearance skin state index would be revealed. A classification model was proposed that combined the good inductive properties of decision tree for small sample data and the high classification accuracy of Bayes algorithm for large sample data. The decision tree and fuzzy Naive Bayes algorithm were fused to optimize adaptively the weight of index under the trend of the test data accumulation from less to more with a better classification accuracy and interpretation performance. The post-pruning way was used to avoid the over fitting of the conventional decision tree. The fuzzy membership function was introduced to the Naive Bayes by defining interval boundary to solve the randomness and fuzziness in testing and classification of skin state index. The results showed that the fusion weights of the proposed classification model could be adjusted dynamically and the classification accuracy would be increased with the increase of modeling data. The classification accuracy of the classification model with 151 data is 86.7%, higher than 83.3% of independent decision tree, 80% of Naive Bayes,and 76.7% of the matched group with 80 data. The dynamic classification model of facial skin and TCM constitution could effectively identify the complex relationship between the facial skin index and the internal type of TCM constitution by utilizing the data information involved in modeling. The classification model has good accuracy and interpretability, which is a beneficial exploration for the scientific and intelligent development of TCM theory based on data-driven.
文章编号:201600575 中图分类号: 文献标志码:
基金项目:北京市教育委员会科技发展计划重点项目(KZ201510011011);北京工商大学促进人才培养综合改革项目(19005428069/007);北京工商大学研究生创新基金
作者简介:
引用文本:
张慧妍,李爽,王小艺,王立,于家斌,许继平,董银卯,孟宏.基于关联性的动态分类模型—以皮肤与体质为例[J].工程科学与技术,2017,49(3):137-143.
ZHANG Huiyan,LI Shuang,WANG Xiaoyi,WANG Li,YU Jiabin,XU Jiping,DONG Yinmao,MENG Hong.Dynamic Classification Model Based on Correlation Recognition—An Example of Skin and Traditional Chinese Medicine Constitution[J].Advanced Engineering Sciences,2017,49(3):137-143.
引用文本:
张慧妍,李爽,王小艺,王立,于家斌,许继平,董银卯,孟宏.基于关联性的动态分类模型—以皮肤与体质为例[J].工程科学与技术,2017,49(3):137-143.
ZHANG Huiyan,LI Shuang,WANG Xiaoyi,WANG Li,YU Jiabin,XU Jiping,DONG Yinmao,MENG Hong.Dynamic Classification Model Based on Correlation Recognition—An Example of Skin and Traditional Chinese Medicine Constitution[J].Advanced Engineering Sciences,2017,49(3):137-143.