|
基于改进徒步优化算法的变精度邻域粗糙集特征选择研究
|
Abstract:
特征选择是数据降维的有效方法,粗糙集理论是处理不确定性信息的有效工具,在粗糙集理论中,变精度邻域粗糙集是处理连续型数值信息系统的有效工具,在分类任务中,变精度邻域粗糙集可以为属性集提供重要性度量。徒步优化算法是群智能优化算法中的一种,可以寻找组合最优解,本文针对徒步优化算法无法动态平衡局部与全局寻优能力,容易陷入局部最优的问题,提出改进徒步优化算法,并提出基于改进徒步优化算法的特征选择算法。为了验证所提算法的有效性,在8个UCI数据集上进行了基于改进徒步优化算法的特征选择算法的对比实验,通过实验验证了所提出算法的有效性和可行性。
Feature selection is an effective method for data dimensionality reduction, and rough set theory is an effective tool for dealing with uncertainty information. In rough set theory, variable precision neighborhood rough set is an effective tool for dealing with continuous numerical information systems. Hiking optimization algorithm is one of the swarm intelligence optimization algorithms, which can find the combined optimal solution. In this paper, aiming at the problem that the hiking optimization algorithm cannot dynamically balance the local and global optimization ability, and is easy to fall into the local optimum, an improved hiking optimization algorithm is proposed. A feature selection algorithm based on improved hiking optimization algorithm was proposed. In order to verify the effectiveness of the proposed algorithm, comparative experiments of feature selection algorithm based on improved hiking optimization algorithm are carried out on 8 UCI data sets, and the effectiveness and feasibility of the proposed algorithm are verified by experiments.
[1] | Pawlak, Z. (1982) Rough Sets. International Journal of Computer & Information Sciences, 11, 341-356. https://doi.org/10.1007/bf01001956 |
[2] | Hu, Q., Yu, D., Liu, J. and Wu, C. (2008) Neighborhood Rough Set Based Heterogeneous Feature Subset Selection. Information Sciences, 178, 3577-3594. https://doi.org/10.1016/j.ins.2008.05.024 |
[3] | Oladejo, S.O., Ekwe, S.O. and Mirjalili, S. (2024) The Hiking Optimization Algorithm: A Novel Human-Based Metaheuristic Approach. Knowledge-Based Systems, 296, Article ID: 111880. https://doi.org/10.1016/j.knosys.2024.111880 |
[4] | Hua, Z., Zhou, Y. and Huang, H. (2019) Cosine-Transform-Based Chaotic System for Image Encryption. Information Sciences, 480, 403-419. https://doi.org/10.1016/j.ins.2018.12.048 |
[5] | Krithikadatta, J. (2014) Normal Distribution. Journal of Conservative Dentistry, 17, 96-97. https://doi.org/10.4103/0972-0707.124171 |
[6] | McDonald, J.B. and Xu, Y.J. (1995) A Generalization of the β Distribution with Applications. Journal of Econometrics, 66, 133-152. https://doi.org/10.1016/0304-4076(94)01612-4 |
[7] | Heidari, A.A., Mirjalili, S., Faris, H., Aljarah, I., Mafarja, M. and Chen, H. (2019) Harris Hawks Optimization: Algorithm and Applications. Future Generation Computer Systems, 97, 849-872. https://doi.org/10.1016/j.future.2019.02.028 |
[8] | Xu, Y., Yang, Z., Li, X., Kang, H. and Yang, X. (2020) Dynamic Opposite Learning Enhanced Teaching-Learning-Based Optimization. Knowledge-Based Systems, 188, Article ID: 104966. https://doi.org/10.1016/j.knosys.2019.104966 |
[9] | Wang, X., Yang, J., Teng, X., Xia, W. and Jensen, R. (2007) Feature Selection Based on Rough Sets and Particle Swarm Optimization. Pattern Recognition Letters, 28, 459-471. https://doi.org/10.1016/j.patrec.2006.09.003 |
[10] | Chen, Y., Zhu, Q. and Xu, H. (2015) Finding Rough Set Reducts with Fish Swarm Algorithm. Knowledge-Based Systems, 81, 22-29. https://doi.org/10.1016/j.knosys.2015.02.002 |
[11] | 王生武. 粗糙集中基于鲸鱼算法和模糊决策的特征选择方法研究[D]: [硕士学位论文]. 成都: 西南交通大学, 2020. https://doi.org/10.27414/d.cnki.gxnju.2020.000780 |