OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

PLOS ONE 2013

Drug Repositioning by Kernel-Based Integration of Molecular Structure, Molecular Activity, and Phenotype Data

DOI: 10.1371/journal.pone.0078518

Yongcui Wang, Shilong Chen, Naiyang Deng, Yong Wang

Full-Text Cite this paper Add to My Lib

Abstract:

Computational inference of novel therapeutic values for existing drugs, i.e., drug repositioning, offers the great prospect for faster and low-risk drug development. Previous researches have indicated that chemical structures, target proteins, and side-effects could provide rich information in drug similarity assessment and further disease similarity. However, each single data source is important in its own way and data integration holds the great promise to reposition drug more accurately. Here, we propose a new method for drug repositioning, PreDR (Predict Drug Repositioning), to integrate molecular structure, molecular activity, and phenotype data. Specifically, we characterize drug by profiling in chemical structure, target protein, and side-effects space, and define a kernel function to correlate drugs with diseases. Then we train a support vector machine (SVM) to computationally predict novel drug-disease interactions. PreDR is validated on a well-established drug-disease network with 1,933 interactions among 593 drugs and 313 diseases. By cross-validation, we find that chemical structure, drug target, and side-effects information are all predictive for drug-disease relationships. More experimentally observed drug-disease interactions can be revealed by integrating these three data sources. Comparison with existing methods demonstrates that PreDR is competitive both in accuracy and coverage. Follow-up database search and pathway analysis indicate that our new predictions are worthy of further experimental validation. Particularly several novel predictions are supported by clinical trials databases and this shows the significant prospects of PreDR in future drug treatment. In conclusion, our new method, PreDR, can serve as a useful tool in drug discovery to efficiently identify novel drug-disease interactions. In addition, our heterogeneous data integration framework can be applied to other problems.

References

[1]	DiMasi JA, Hansen RW, Grabowski HG (2003) The price of innovation: new estimates of drug development costs. J Health Econ 22: 151–185.
[2]	Kinnings SL, Liu N, Buchmeier N, Tonge PJ, Xie L, et al. (2009) Drug discovery using chemical systems biology: repositioning the safe medicine Comtan to treat multi-drug and extensively drug resistant tuberculosis. PLoS Comput Biol 5: e1000423.
[3]	Li J, Zhu X, Chen JY (2009) Building disease-specific drug-protein connectivity maps from molecular interaction networks and PubMed abstracts. PLoS Comput Biol 5: e1000450.
[4]	Kotelnikova E, Yuryev A, Mazo I, Daraselia N (2010) Computational approaches for drug repositioning and combination therapy design. J Bioinform Comput Biol 8: 593–606.
[5]	Hu G, Agarwal P (2009) Human disease-drug network based on genomic expression profiles. PLoS One 4: e6536.
[6]	Chiang AP, Butte AJ (2009) Systematic evaluation of drug-disease relationships to identify leads for novel drug uses. Clin Pharmacol Ther 86: 507–510.
[7]	Wu ZK, Wang Y, Chen L (2013) Network-based drug repositioning. Mol. BioSyst 9(6): 1268–1281.
[8]	Gottlieb A, Stein GY, Ruppin E, Sharan R (2011) PREDICT: a method for inferring novel drug indications with application to personalized medicine. Mol Syst Biol 7: 496.
[9]	Vapnik V (1995) The nature of statistical learning theory. Springer, New York.
[10]	Vapnik V (1998) Statistical learning theory. Wiley.
[11]	Sch？lkopf B, Tsuda K, Vert JP (2004) Support vector machine applications in computational biology. MIT Press, Cambridge, MA, pp: 71–92.
[12]	van Driel MA, Bruggeman J, Vriend G, Brunner HG, Leunissen JA (2006) A text-mining analysis of the human phenome. Eur J Hum Genet 14: 535–542.
[13]	Hofmann T, Sch？lkopf B, Smola AJ (2008) Kernel methods in machine learning. Ann Stat 36: 1171–1220.
[14]	Pauwels E, Stoven V, Yamanishi Y (2011) Predicting drug side-effect profiles: a chemical fragmentbased approach. BMC Bioinformatics 12: 169.
[15]	Yamanishi Y, Kotera M, Kanehisa M, Goto S (2010) Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework. Bioinformatics 26: i246–i254.
[16]	Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, et al. (2006) From genomics to chemical genomics: new developments in kegg. Nucleic Acids Res 34: D354–D357.
[17]	Schomburg I, Chang A, Ebeling C, Gremse M, Heldt C, et al. (2004) Brenda, the enzyme database: updates and major new developments. Nucleic Acids Res 32: D431–D433.
[18]	Günther S, Kuhn M, Dunkel M, Campillos M, Senger C, et al. (2008) Supertarget and matador: resources for exploring drug–target relationships. Nucleic Acids Res 36: D919–D922.
[19]	Wishart DS, Knox C, Guo AC, Cheng D, Shrivastava S, et al. (2008) Drugbank: a knowledgebase for drugs, drug actions and drug targets. Nucleic Acids Res 36: D901–D906.
[20]	Yamanishi Y, Araki M, Gutteridge A, Honda W, Kanehisa M (2008) Prediction of drug-target interaction networks from the integration of chemical and genomic spaces. Bioinformatics 24: i232–i240.
[21]	Zhao SW, Li S (2010) Network-based relating pharmacological and genomic spaces for drug target identification. PLoS ONE 5(7): e11764.
[22]	Wang YC, Yang ZX, Wang Y, Deng NY (2010) Computationally probing drug-protein interactions via support vector machine. Lett Drug Des Discov 7: 370–378.
[23]	Wang YC, Chen SL, Deng NY, Wang Y (2013) Network predicting drug’s anatomical therapeutic chemical code. Bioinformatics 29(10): 1317–1324.
[24]	Smith TF, Waterman M (1981) Identification of common molecular subsequences. J Mol Biol 147: 195–197.
[25]	Campillos M, Kuhn M, Gavin AC, Jensen LJ, Bork P (2008) Drug target identification using side-effect similarity. Science 321: 263–266.
[26]	Yang L, Agarwal P (2011) Systematic drug repositioning based on clinical side-effects. PLoS ONE 6(12): e28025.
[27]	Duran-Frigola M, Aloy P (2012) Recycling side-effects into clinical markers for drug repositioning. Genome Med 4: 3.
[28]	Basilico J, Hofmann T (2004) A joint framework for collaborative and content filtering. 27th Annual International ACM SIGIR Conference.
[29]	Ben-Hur A, Noble WS (2005) Kernel methods for predicting protein-protein interactions. Bioin- formatics 21 Suppl 1i38–i46.
[30]	Oyama S, Manning CD (2004) Using feature conjunctions across examples for learning pairwise classifiers. In European Conference on Machine Learning 2004, pp: 322–333.
[31]	Hue M, Vert JP (2010) On learning with kernels for unordered pairs. ICML, pp: 463–470.
[32]	Wu G, Chang EY (2003) Class-boundary alignment for imbalanced dataset learning. In ICML 2003 Workshop on Learning from Imbalanced Data Sets.
[33]	Hamosh A, Scott AF, Amberger JS, Bocchini CA, McKusick VA (2002) Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res 30: 52–55.
[34]	Chen B, Wild D, Guha R (2009) PubChem as a Source of Polypharmacology. (J Chem Inform Model 49(9): 2044–2055.
[35]	Kuhn M, Campillos M, Letunic I, Jensen LJ, Bork P (2010) A side effect resource to capture phenotypic effects of drugs. Mol Syst Biol 6: 343.
[36]	Chang CC, Lin CJ (2011) LIBSVM : a library for support vector machines. ACM TIST 2(27): 1–27.
[37]	Gribskov M, Robinson NL (1996) Use of receiver operating characteristic (roc) analysis to evaluate sequence matching. Comput Chem 20: 25–33.
[38]	Raghavan VV, Bollmann P, Jung GS (1989) A critical investigation of recall and precision as measures of retrieval system performance. ACM TMIS 7(3): 205–229.
[39]	Davis J, Goadrich M (2006) The relationship between Precision-Recall and ROC curves. ICML ’06 Proceedings of the 23rd international conference on Machine learning, pages 233–240. ACM.
[40]	Keshava Prasad TS, Goel R, Kandasamy K, Keerthikumar S, Kumar S, et al. (2009) Human protein reference database-2009 update. Nucleic Acids Res 37: D767–D772.
[41]	Brown KR, Jurisica I (2005) Online predicted human interaction database. Bioinformatics 21: 2076–2082.
[42]	Bader GD, Betel D, Hogue CW (2003) BIND: the biomolecular interaction network database. Nucleic Acids Res 31: 248–250.
[43]	Wu X, Jiang R, Zhang MQ, Li S (2008) Network-based global inference of human disease genes. Mol Syst Biol 4: 189.
[44]	Fraser HB, Plotkin JB (2007) Using protein complexes to predict phenotypic effects of gene mutation. Genome Biol 8: R252.
[45]	McGary KL, Lee I, Marcotte EM (2007) Broad network-based predictability of S. cerevisiae gene loss-of-function phenotypes. Genome Biol 8: R258.
[46]	Whisstock JC, Lesk AM (2003) Prediction of protein function from protein sequence and structure. Q Rev Biophys 36: 307–340.
[47]	Dobson PD, Cai YD, Stapley BJ, Doig AJ (2004) Prediction of protein function in the absence of significant sequence similarity. Curr Med Chem 11: 2135–2142.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133