|
基于“DeepSeek + 医疗”在线评论的情感分析
|
Abstract:
近年来,人工智能在医疗方面的应用越来越广泛,2025年1月掀起了国产大模型DeepSeek的热潮,全国多家医院陆续接入了DeepSeek技术,如西安国际医学中心医院,该技术的接入可以提升诊断效率及就医体验。随着DeepSeek大模型在医疗领域广泛部署,其应用效果引发关注,但针对用户情感态度与使用意愿的系统研究仍然缺乏。为此,本研究采用Python爬虫从抖音平台采集4000余条评论,经数据清洗和分词处理,使用TF-IDF提取文本特征,构建Transformer情感分析模型,并运用LDA模型提取评论主题。结果表明,评论情感以正面为主,多数用户认可“DeepSeek + 医疗”的诊断准确性与便利性,但少量负面评论涉及隐私、安全及AI可靠性顾虑。LDA分析识别出智能医疗、就医服务、医疗变革、诊疗方案四个主要主题。据此提出提高诊断精准度、优化用户界面和加强隐私保护等建议,以提升用户信任度和使用意愿。
In recent years, the integration of artificial intelligence (AI) into the healthcare sector has advanced rapidly. In January 2025, the large domestic language model DeepSeek garnered widespread attention, with numerous hospitals across China—including Xi’an International Medical Center Hospital—successively adopting this technology. The implementation of DeepSeek has demonstrated the potential to enhance diagnostic efficiency and improve the overall patient experience. Despite its growing deployment, there remains a paucity of systematic research on user sentiment and adoption intentions related to this technology. To address this research gap, this study employed Python-based web scraping techniques to collect over 4,000 user comments from the TikTok platform. Following data cleaning and Chinese word segmentation, textual features were extracted using the Term Frequency–Inverse Document Frequency (TF-IDF) method. A transformer-based sentiment analysis model was then constructed to assess public sentiment, and a Latent Dirichlet Allocation (LDA) model was utilized to identify underlying thematic structures within the comments. Findings reveal that public sentiment is predominantly positive, with the majority of users expressing approval of the diagnostic accuracy and convenience offered by the “DeepSeek + healthcare” model. Nonetheless, a minority of negative comments reflect concerns pertaining to data privacy, system security, and the overall reliability of AI technologies. The LDA topic modeling identified four principal themes: intelligent healthcare, medical service delivery, healthcare transformation, and diagnostic strategies. Based on these insights, the study proposes targeted recommendations to enhance diagnostic precision, improve user interface design, and strengthen data privacy safeguards, thereby fostering greater user trust and increasing the willingness to adopt AI-driven healthcare solutions.
[1] | 闾海荣, 江瑞, 张学工, 等. DeepSeek与医学大语言模型: 技术创新与医疗服务模式重构[J]. 医学信息学杂志, 2025, 46(2): 1-7+13. |
[2] | 冯海, 马甲林, 许林杰, 等. 融合标签嵌入和知识感知的多标签文本分类方法[J]. 南京大学学报(自然科学), 2023, 59(2): 273-281. |
[3] | 黄春梅, 王松磊. 基于词袋模型和TF-IDF的短文本分类研究[J]. 软件工程, 2020, 23(3): 1-3. |
[4] | 常冬雨. 高校智能财务审计系统构建研究[J]. 财会讯, 2021(17): 147-151. |
[5] | 朱益平, 慕钰, 孙逸宁. ChatGPT大语言模型的评论情感分类预测与主题识别研究[J/OL]. 情报科学, 1-35. https://kns.cnki.net/kcms2/article/abstract?v=SyD34uTtguCt6KGLBC0lIL8cZh7WpFeRL1YR7Hl6hTuqiePKVA7rnV771S9C1U36uEp8TzgVhVemrqTKKV6_UYhrepmU0cqqzgPBBdBxRaX2vjcZOza7ACsMnPXq29Sce4plTgSEf2F286OXj_LvAmc0LXGGzo9mxzo84shBYZ_AoQV58ksoNf6AhyW9zohZyTU-1vVBjCU=&uniplatform=NZKPT, 2025-03-11. |
[6] | 马景楠, 毛春. ChatGPT用户接受和使用意愿影响因素研究——基于TAM和TTF整合视角[J]. 科技传播, 2024, 16(17): 118-124. |
[7] | Blei, D.M., Ng, A.Y. and Jordan, M.L. (2003) Latent Dirichlet Allocation. The Journal of Machine Learning Research, 3, 993-1002. |
[8] | 高慧颖, 刘嘉唯, 杨淑昕. 基于改进LDA的在线医疗评论主题挖掘[J]. 北京理工大学学报, 2019, 39(4): 427-434. |
[9] | 孙辛欣, 孙亚楠, 赵宇翔, 等. AI医疗语音助手声音特征对老年人感知可信度和使用意愿的影响研究[J/OL]. 数据分析与知识发现, 1-18. https://kns.cnki.net/kcms2/article/abstract?v=a4fp6zKrpgZcKeb1ySGwmI8Kfk9bDzuqrnWZqtZMnjkPyYQBzTBDiXNiDA4X2T42k0266FSV5DrrJoeuwHLG2q2mgfmdyr0Gn9KC_1ZQqix8jtxSkCCgHicey57uXs7x_nGxbtaPaXDZEoD3IMD9pnRn7opvBMEMTDVqjWqKT_aIZhibRtcWqA==&uniplatform=NZKPT&language=CHS, 2025-03-11. |
[10] | 张海, 刘畅, 王东波, 等. ChatGPT用户使用意愿影响因素研究[J]. 情报理论与实践, 2023, 46(4): 15-22. |