全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Analyzing the Impact of Feature Selection on Crop Yield Prediction

DOI: 10.4236/jcc.2024.128017, PP. 278-291

Keywords: Crop Yield, Feature Selection, Hybrid, Embedded, Analysis

Full-Text   Cite this paper   Add to My Lib

Abstract:

In agriculture sector, machine learning has been widely used by researchers for crop yield prediction. However, it is quite difficult to identify the most critical features from a dataset. Feature selection techniques allow us to remove the extraneous and noisy features from the original feature set. The feature selection techniques help the model to focus only on the important features of the data, thus reducing execution time and improving efficiency of the model. The aim of this study is to determine relevant subset features for achieving high predictive performance by using different feature selection techniques like Filter methods, Wrapper methods and embedded methods. In this work, different feature selection techniques like Rank-based feature selection technique, weighted feature selection technique and Hybrid Feature Selection Technique have been applied to the agricultural data. The optimal feature set returned by different feature selection techniques is used for yield prediction using Linear regression, Random Forest, and Decision Tree Regressor. The accuracy of prediction obtained using the above three methods has been analyzed by using different evaluation parameters. This study helps in increasing predictive accuracy with the minimum number of features.

References

[1]  Wang, X., Chen, R., Yan, F., Zeng, Z. and Hong, C. (2019) Fast Adaptive K-Means Subspace Clustering for High-Dimensional Data. IEEE Access, 7, 42639-42651.
https://doi.org/10.1109/access.2019.2907043
[2]  Jaiswal, J.K. and Samikannu, R. (2017) Application of Random Forest Algorithm on Feature Subset Selection and Classification and Regression. 2017 World Congress on Computing and Communication Technologies, Tiruchirappalli, 2-4 February 2017, 65-68.
https://doi.org/10.1109/wccct.2016.25
[3]  Blum, A.L. and Langley, P. (1997) Selection of Relevant Features and Examples in Machine Learning. Artificial Intelligence, 97, 245-271.
https://doi.org/10.1016/s0004-3702(97)00063-5
[4]  Chen, C., Tsai, Y., Chang, F. and Lin, W. (2020) Ensemble Feature Selection in Medical Datasets: Combining Filter, Wrapper, and Embedded Feature Selection Results. Expert Systems, 37, e12553.
https://doi.org/10.1111/exsy.12553
[5]  Hancer, E., Xue, B. and Zhang, M. (2020) A Survey on Feature Selection Approaches for Clustering. Artificial Intelligence Review, 53, 4519-4545.
https://doi.org/10.1007/s10462-019-09800-w
[6]  Cekik, R. and Uysal, A.K. (2020) A Novel Filter Feature Selection Method Using Rough Set for Short Text Data. Expert Systems with Applications, 160, Article 113691.
https://doi.org/10.1016/j.eswa.2020.113691
[7]  Karasu, S., Altan, A., Bekiros, S. and Ahmad, W. (2020) A New Forecasting Model with Wrapper-Based Feature Selection Approach Using Multi-Objective Optimization Technique for Chaotic Crude Oil Time Series. Energy, 212, Article 118750.
https://doi.org/10.1016/j.energy.2020.118750
[8]  Ren, K., Fang, W., Qu, J., Zhang, X. and Shi, X. (2020) Comparison of Eight Filter-Based Feature Selection Methods for Monthly Streamflow Forecasting—Three Case Studies on CAMELS Data Sets. Journal of Hydrology, 586, Article 124897.
https://doi.org/10.1016/j.jhydrol.2020.124897
[9]  Kurniawan, Y.I., Cahyono, T., Nofiyati,, Maryanto, E., Fadli, A. and Indraswari, N.R. (2020) Preprocessing Using Correlation Based Features Selection on Naive Bayes Classification. IOP Conference Series: Materials Science and Engineering, 982, Article 012012.
https://doi.org/10.1088/1757-899x/982/1/012012
[10]  Gholamnezhad, P., Broumandnia, A. and Seydi, V. (2020) An Inverse Model-Based Multi-Objective Estimation of Distribution Algorithm Using Random-Forest Variable Importance Methods. Computational Intelligence, 38, 1018-1056.
https://doi.org/10.1111/coin.12315
[11]  Folli, G.S., Nascimento, M.H.C., de Paulo, E.H., da Cunha, P.H.P., Romão, W. and Filgueiras, P.R. (2020) Variable Selection in Support Vector Regression Using Angular Search Algorithm and Variance Inflation Factor. Journal of Chemometrics, 34, e3282.
https://doi.org/10.1002/cem.3282
[12]  Maya Gopal, P.S. and Bhargavi, R. (2019) Performance Evaluation of Best Feature Subsets for Crop Yield Prediction Using Machine Learning Algorithms. Applied Artificial Intelligence, 33, 621-642.
https://doi.org/10.1080/08839514.2019.1592343
[13]  Maya Gopal, P.S. and Bhargavi, R. (2019) Selection of Important Features for Optimizing Crop Yield Prediction. International Journal of Agricultural and Environmental Information Systems, 10, 54-71.
https://doi.org/10.4018/ijaeis.2019070104
[14]  Karimi, Z., Mansour Riahi Kashani, M. and Harounabadi, A. (2013) Feature Ranking in Intrusion Detection Dataset Using Combination of Filtering Methods. International Journal of Computer Applications, 78, 21-27.
https://doi.org/10.5120/13478-1164
[15]  Chouhan, S., Singh, D. and Singh, A. (2016) An Improved Feature Selection and Classification Using Decision Tree for Crop Datasets. International Journal of Computer Applications, 142, 5-8.
https://doi.org/10.5120/ijca2016909966
[16]  Akhiat, Y., Chahhou, M. and Zinedine, A. (2019) Ensemble Feature Selection Algorithm. International Journal of Intelligent Systems and Applications, 11, 24-31.
https://doi.org/10.5815/ijisa.2019.01.03
[17]  Mariammal, G., Suruliandi, A., Raja, S.P. and Poongothai, E. (2021) Prediction of Land Suitability for Crop Cultivation Based on Soil and Environmental Characteristics Using Modified Recursive Feature Elimination Technique with Various Classifiers. IEEE Transactions on Computational Social Systems, 8, 1132-1142.
https://doi.org/10.1109/tcss.2021.3074534
[18]  Suruliandi, A., Mariammal, G. and Raja, S.P. (2021) Crop Prediction Based on Soil and Environmental Characteristics Using Feature Selection Techniques. Mathematical and Computer Modelling of Dynamical Systems, 27, 117-140.
https://doi.org/10.1080/13873954.2021.1882505
[19]  Gonzalez-Sanchez, A., Frausto-Solis, J. and Ojeda-Bustamante, W. (2014) Attribute Selection Impact on Linear and Nonlinear Regression Models for Crop Yield Prediction. The Scientific World Journal, 2014, 1-10.
https://doi.org/10.1155/2014/509429
[20]  Hsu, H., Hsieh, C. and Lu, M. (2011) Hybrid Feature Selection by Combining Filters and Wrappers. Expert Systems with Applications, 38, 8144-8150.
https://doi.org/10.1016/j.eswa.2010.12.156
[21]  Dewi, C. and Chen, R.C. (2019) Random Forest and Support Vector Machine on Features Selection for Regression Analysis. International Journal of Innovative Computing, Information & Control, 15, 2027-2037.
[22]  Pudjihartono, N., Fadason, T., Kempa-Liehr, A.W. and O’Sullivan, J.M. (2022) A Review of Feature Selection Methods for Machine Learning-Based Disease Risk Prediction. Frontiers in Bioinformatics, 2, Article 927312.
https://doi.org/10.3389/fbinf.2022.927312
[23]  Suresh Sumi, M.S. and Narayanan, A. (2019) Improving Classification Accuracy Using Combined Filter+Wrapper Feature Selection Technique. 2019 IEEE International Conference on Electrical, Computer and Communication Technologies, Coimbatore, 20-22 February 2019, 1-6.
[24]  Raja, S.P., Sawicka, B., Stamenkovic, Z. and Mariammal, G. (2022) Crop Prediction Based on Characteristics of the Agricultural Environment Using Various Feature Selection Techniques and Classifiers. IEEE Access, 10, 23625-23641.
https://doi.org/10.1109/access.2022.3154350
[25]  Thomas, R.N. and Gupta, R. (2020) Feature Selection Techniques and Its Importance in Machine Learning: A Survey. 2020 IEEE International StudentsConference on Electrical, Electronics and Computer Science, Bhopal, 22-23 February 2020, 1-6.
https://doi.org/10.1109/sceecs48394.2020.189
[26]  Parmar, K.P. and Bhatt, T. (2022) Crop Yield Prediction Based on Feature Selection and Machine Learners: A Review. Proceedings of the Second International Conference on Artificial Intelligence and Smart Energy, Coimbatore, 23-25 February 2022, 354-358.
https://doi.org/10.1109/ICAIS53314.2022.9742891
[27]  Bouchlaghem, Y., Akhiat, Y. and Amjad, S. (2022) Feature Selection: A Review and Comparative Study. E3S Web of Conferences, 351, Article No. 01046.
https://doi.org/10.1051/e3sconf/202235101046
[28]  Poornima, K.A. and Dheepa, G. (2022) An Efficient Feature Selection and Classification for the Crop Field Identification: A Hybridized Wrapper Based Approach. Turkish Journal of Computer and Mathematics Education, 13, 241-254.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133