全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

DNA Sequence Classification by Convolutional Neural Network

DOI: 10.4236/jbise.2016.95021, PP. 280-286

Keywords: DNA Sequence Classification, Deep Learning, Convolutional Neural Network

Full-Text   Cite this paper   Add to My Lib

Abstract:

In recent years, a deep learning model called convolutional neural network with an ability of extracting features of high-level abstraction from minimum preprocessing data has been widely used. In this research, we proposed a new approach in classifying DNA sequences using the convolutional neural network while considering these sequences as text data. We used one-hot vectors to represent sequences as input to the model; therefore, it conserves the essential position information of each nucleotide in sequences. Using 12 DNA sequence datasets, we evaluated our proposed model and achieved significant improvements in all of these datasets. This result has shown a potential of using convolutional neural network for DNA sequence to solve other sequence problems in bioinformatics.

References

[1]  Eickholt, J. and Cheng, J. (2013) DNdisorder: Predicting Protein Disorder Using Boosting and Deep Networks. BMC Bioinformatics, 14, 88-98.
http://dx.doi.org/10.1186/1471-2105-14-88
[2]  Leung, M.K.K., Xiong, H.Y., Lee, L.J. and Frey, B.J. (2014) Deep Learning of the Tissue-Regulated Splicing Code. Bioinformatics, 30, i121-i129.
http://dx.doi.org/10.1093/bioinformatics/btu277
[3]  Lee, H., Grosse, R., Ranganath, R. and Ng, Y.A. (2009) Convolutional Deep Belief Networks for Scalable Unsupervised Learning of Hierarchical. Proceeding of the 26th Annual International Conference on Machine Learning, Montreal, 14-18 June 2009, 609-616.
http://dx.doi.org/10.1145/1553374.1553453
[4]  Mikolov, T., Chen, K., Greg, C. and Dean, J. (2013) Efficient Estimation of Word Representations in Vector Space. arXiv preprint arXiv:1301.3781.
[5]  Johnson, R. and Zhang, T. (2015) Effective Use of Word Order for Text Categorization with Convolutional Neural Networks. Proceeding of Human Language Technologies: The 2015 Annual Conference of the North American, Denver Colorado, 31 May-5 June 2015, 103-112.
http://dx.doi.org/10.3115/v1/n15-1011
[6]  Pokholok, D.K., Harbison, C.T., Levine, S., Cole, M., Hannett, N.M., Lee, T.I., Bell, G.W., Walker, K., Rolfe, P.A., Herbolsheimer, E., Zeitlinger, J., Lewitter, F., Gifford, D.K. and Young, R.A. (2005) Genome-Wide Map of Nucleosome Acetylation and Methylation in Yeast. Cell, 122, 517-527.
http://dx.doi.org/10.1016/j.cell.2005.06.026
[7]  Higashihara, M., Rebolledo-Mendez, J.D., Yamada, Y. and Satou, K. (2008) Application of a Feature Selection Method to Nucleosome Data: Accuracy Improvement and Comparison with Other Methods. WSEAS Transactions on Biology and Biomedicine, 5, 153-162.
[8]  Li, J. and Wong, L. (2003) Using Rules to Analyse Bio-Medical Data: A Comparison between C4.5 and PCL. Proceedings of Advances in Web-Age Information Management 4th International Conference, Chengdu, 17-19 August 2003, 254-265.
http://dx.doi.org/10.1007/978-3-540-45160-0_25
[9]  Towell, G., Shavlik, J. and Noordewier, M. (1990) Refinement of Approximate Domain Theories by Knowledge-Based Artificial Neural Networks. Proceedings of the 8th National Conference on Artificial Intelligence, Boston, 29 July-3 August 1990, 861-866.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133