全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

抗癌药物活性的预测和优化分析——以乳腺癌为例
Prediction and Optimization of Anticancer Drug Activity—A Case Study of Breast Cancer

DOI: 10.12677/SA.2022.116139, PP. 1338-1347

Keywords: 随机森林,XGBoost,粒子群算法,优化问题
Random Forest
, XGBoost, Particle Swarm Optimization Algorithm, Optimization Problem

Full-Text   Cite this paper   Add to My Lib

Abstract:

本文利用机器学习方法,探究了统计模型在制药领域的应用价值。以治疗乳腺癌的候选药物为例,在研发过程中,主要考虑如下两种指标:1) 找出能够抑制ERα活性的化合物,尽量降低ERα活性;2) 药物应当具备ADMET性质。本文采用数据挖掘技术和基本统计方法,首先基于随机森林算法从504个变量中筛选出20个主要变量,利用XGBoost算法建立化合物结构(即分子描述符)与ERα生物活性之间的非线性关系,预测ERα生物活性,结果显示用该方法预测的准确率为92.74%。再用粒子群算法建立优化模型,在保证化合物具有较好的生物活性和ADMET性质的前提下,计算出分子描述符的具体数值,该研究结果为药物设计提供了一定的参考依据。
This paper uses machine learning methods to explore the application value of statistical models in the pharmaceutical field. Taking drug candidates for the treatment of breast cancer as an example, in the research and development process, the following two indicators are mainly considered: 1) find out the compounds that can inhibit ERα activity and reduce ERα activity as much as possible; 2) The drug should have ADMET properties. In this paper, data mining technology and basic statistical methods are used, firstly, 20 main variables are selected from 504 variables based on random forest algorithm, and the nonlinear relationship between compound structure (i.e. molecular descriptor) and ERα biological activity is established by XGBoost algorithm to predict ERα biological activity, and the results show that the accuracy of prediction by this method is 92.74%. Then, the particle swarm algorithm is used to establish an optimization model, and the specific value of the molecular descriptor is calculated under the premise of ensuring that the compound has good biological activity and ADMET properties, and the results of this study provide a certain reference for drug design.

References

[1]  Saba, A., Muhammad, S., Khera, R.A., et al. (2022) Identification of Halogen-Based Derivatives as Potent Inhibitors of Estrogen Receptor Alpha of Breast Cancer: An In-Silico Investigation. Journal of Computational Biophysics and Chemistry, 21, 181-205.
https://doi.org/10.1142/S2737416522500090
[2]  Bentzon, N., Düring, M., During, R.B., et al. (2008) Prognostic Effect of Estrogen Receptor Status across Age in Primary Breast Cancer. International Journal of Cancer, 122, 1089-1094.
https://doi.org/10.1002/ijc.22892
[3]  沈杰. 药物ADMET理论预测方法开发和靶向雌激素受体的药物设计研究[D]: [博士学位论文]. 上海: 华东理工大学, 2011.
[4]  Zang, Q., Rotroff, D.M. and Judson, R.S. (2013) Binary Classification of a Large Collection of Environmental Chemicals from Estrogen Receptor Assays by Quantitative Structure-Activity Relationship and Machine Learning Methods. Journal of Chemical Information and Modeling, 53, 3244-3261.
https://doi.org/10.1021/ci400527b
[5]  Zekri, A., Harkati, D., Kenouche, S., et al. (2020) QSAR Modeling, Docking, ADME and Reactivity of Indazole Derivatives as Antagonizes of Estrogen Receptor Alpha (ER-α) Positive in Breast Cancer. Journal of Molecular Structure, 1217, Article ID: 128442.
https://doi.org/10.1016/j.molstruc.2020.128442
[6]  Chang, K., Liu, S., Yan, H., et al. (2021) Quantitative Structure-Activity Relationship Modeling of Estrogen Receptor Alpha Bioactivity Based on Multiple Algorithms. 2021 4th International Conference on Algorithms, Computing and Artificial Intelligence, 1-6.
https://doi.org/10.1145/3508546.3508572
[7]  Babiker, S., Nasir, O., Alotaibi, S.H., et al. (2020) Prospective Breast Cancer Risk Factors Prediction in Saudi Women. Saudi Journal of Biological Sciences, 27, 1624-1631.
https://doi.org/10.1016/j.sjbs.2020.02.012
[8]  Chen, T.Q. and Guestrin, C. (2016) XGBoost: A Scalable Tree Boosting System. 22nd SIGKDD Conference on Knowledge Discovery and Data Mining, 785-794.
https://doi.org/10.1145/2939672.2939785

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133