|
- 2016
基于真实世界临床数据的失眠病判别分析DOI: 10.3969/j.issn.0253-2778.2016.10.011 Keywords: 结构化, 半监督学习, 标签传播, 中医, 疾病判别, 失眠Key words: structurization semi-supervised learning label propagation algorithm TCM disease identification insomnia Abstract: 基于真实世界中医医疗数据集,提出了针对性的中医非结构化转结构化的数据预处理方法,并在监督分类模型和半监督分类模型上对得到的症状特征进行了实验验证.在真实医疗数据集上进行实验,发现无论是监督分类算法还是半监督分类算法在所提出的数据预处理模型上都得到了较优的分类效果,并且发现标签传播算法不仅在分类器稳定性上取得了较大的优势,在带标注数据较少时,仍能取得较好的实验结果.Abstract:A new data preprocessing method based on the real-world medical database was proposed, which can change unstructured data into structured data. Supervised algorithms and semi-supervised algorithms were utilized to verify the effectiveness of the clinical features which were obtained through our data preprocessing method. From the experimental results on the real world dataset, it is found that both supervised classification and semi-supervised algorithms can get a better result based on the clinical symptom features trained from our data preprocessing method. And it is found that the label propagation algorithm not only achieves a great stability on the real Chinese medicine database when compared with classical classification algorithm, but also obtains good results when the ratio is low.
|