|
电解加工知识本体中领域术语提取的研究与应用
|
Abstract:
为提高电解加工工艺知识本体中的概念提取的完整性,本文中构建了一种半自动化领域术语提取模型,该模型结合统计分析和数据挖掘的思想设计了N-Word算法,进行领域术语中词组的提取,3-Word构词性能最佳。为了提高领域术语的准确性,基于互信息(MI)和绝对词频对领域术语过滤得到2137个术语,进一步对术语修正和同义词合并处理,最终得到标准化的领域概念1894个。此模型满足对电解加工领域术语的提取,提高术语的领域覆盖度,保证本体构建的准确性。
In order to improve the integrity of concept extraction in ECM process knowledge ontology, this paper constructs a semi-automatic domain term extraction model, which combines the idea of statistical analysis and data mining to design N-Word algorithm to extract phrases in domain terms. 3-Word has the best word formation performance. In order to improve the accuracy of domain terms, 2137 terms were filtered based on mutual information (MI) and absolute word frequency, and 1894 standardized domain concepts were finally obtained through further term modification and synonym merging. This model can extract terms in the field of electrochemical machining, improve the domain coverage of terms, and ensure the accuracy of ontology construction.
[1] | 任飞亮, 沈继坤, 孙宾宾. 从文本中构建领域本体技术综述[J]. 计算机学报, 2019, 42(3): 654-675. |
[2] | 白宁超, 唐聃, 王亚强. 基于主动学习的传统中医症状本体构建方法研究综述[J]. 电子技术与软件工程, 2016, 13(7): 162-163+222. |
[3] | 余丰民, 林彦汝. 基于关键词词频统计的学科研究热点漂移程度模型构建及实证分析[J]. 情报理论与实践, 2020, 43(2):100-105. |
[4] | 陈辰, 王璐, 郝晓雪. 基于词频统计与语义关联的京津冀协同发展研究热点与前沿监测研究[J]. 河北科技图苑, 2018, 31(1): 31-37. |
[5] | Orhan, U. and Tulu, C.N. (2021) A Novel Embedding Approach to Learn Word Vectors by Weighting Semantic Relations: SemSpace. Expert Systems with Applications, 180, 115-146. https://doi.org/10.1016/j.eswa.2021.115146 |
[6] | Srinath, A.N., López, L.P., Fashandi, S., et al. (2022) Thermal Management System Architecture for Hydrogen-Powered Propulsion Technologies: Practices, Thematic Clusters, System Architectures, Future Challenges, and Opportunities. Energies, 15, 304-314. https://doi.org/10.3390/en15010304 |
[7] | 肖宇, 邓正宏. 信噪比约束下基于互信息的雷达波形设计[J]. 系统工程与电子技术, 2021, 43(7): 1775-1780. |
[8] | 程玉胜, 宋帆, 王一宾. 基于专家特征的条件互信息多标记特征选择算法[J]. 计算机应用, 2020, 40(2): 503-509. |
[9] | 张曼婷. 基于互信息的不完备信息系统属性约简算法研究[D]: [硕士学位论文]. 西安: 西安科技大学, 2020. |
[10] | Mohammadi, S.J., Fashandi, S.A.M., Jafari, S. and Nikolaidis, T. (2021) A Scientometric Analysis and Critical Review of Gas Turbine Aero-Engines Control: From Whittle Engine to More-Electric Propulsion. Measurement and Control, 54, 935-966. https://doi.org/10.1177/0020294020956675 |