全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
电子学报  2013 

基于Memetic优化的智能DNA序列数据压缩算法

DOI: 10.3969/j.issn.0372-2112.2013.03.016, PP. 513-518

Keywords: DNA序列数据压缩,生物信息学,近似重复矢量,Memetic算法

Full-Text   Cite this paper   Add to My Lib

Abstract:

提出近似重复矢量(ApproximateRepeatVector,ARV)模型用于DNA序列冗余片段的描述.通过将数据生物信息学特征引入压缩预处理,并使用ARV矢量构造编码码本,提出了非对称DNA序列压缩算法BioLZMA-2.算法引入基于粒子群优化的Memetic改进方法CLIPSO-MA用于压缩码本的智能优化设计,有效提升了编码性能.在标准测试序列上的实验结果表明,BioLZMA-2可获得比现有DNA序列数据压缩方法更高的压缩率.

References

[1]  Srinivasa K G,Jagadish M,et al.Efficient compression of non-repetitive DNA sequences using dynamic programming [A].Proceeding of International Conference on Advanced Computing and Communications [C].Mangalore:ADCOM,2006.569-574.
[2]  Matsumoto T,Sadakane K,et al.Biological sequence compression algorithms [A].Proceeding of Genome Informatics Workshop [C].Tokyo:CIW,2000.43-52.
[3]  Nordin A,Yazid M,et al.A guided dynamic programming approach for searching a set of similar DNA sequences [A].Proceeding of International Conference on the Applications of Digital Information and Web Technologies [C].London:IEEE,2009.512-517.
[4]  周家锐,纪震,等.基于自适应智能单粒子优化算法的Gabor人脸识别方法 [A].全国模式识别学术会议 [C].重庆:CCPR,2010.1-5. Zhou J R,Ji Z,et al.Face recognition using Gabor wavelet and self-adaptive intelligent single particle optimizer.Proceeding of Chinese Conference on Pattern Recognition.Chongqing:CCPR,2010.1-5.(in Chinese)
[5]  Wu S,Manber U.Fast text searching:allowing errors[J].Communications of the ACM,1992,35(10):83-91.
[6]  Osborne M.Predicting DNA Sequences Using a Backoff Language Model [DB/OL].http://www.cogsci.ed.ac.uk/~osborne/dna-backoff.ps.gz,2009-05-15.
[7]  Galperin M Y,Cochrane G R.Petabyte-scale innovations at the European nucleotide archive[J].Nucleic Acids Research,2009,37:D1-D4.
[8]  Chen X,Kwong S,et al.A compression algorithm for DNA sequences and its applications in genome comparison [A].Proceeding of the 10th Workshop on Genome Informatics [C].Tokyo:GIW,1999.51-61.
[9]  Chen X,Li M,et al.DNACompress:fast and effective DNA sequence compression[J].Bioinformatics,2002,18(12):1696-1698.
[10]  Korodi G,Tabus I.An efficient normalized maximum likelihood algorithm for DNA sequence compression[J].ACM Transactions on Information Systems,2005,23(1):3-34.
[11]  林毅申,林丕源,等.基于字典的DNA序列压缩算法研究及应用[J].计算机应用研究,2007,24(6):265-267. Lin Y S,Lin P Y,et al.Research and mplementation of dictionary-based DNA compression algorithm[J].Application Research of Computers,2007,24(6):265-267.(in Chinese)
[12]  纪震,周家锐,等.基于生物信息学特征的DNA序列数据压缩算法[J].电子学报,2011,39(5):991-995. Ji Z,Zhou J R,et al.Bioinformatics features based DNA sequence data compression algorithm[J].Acta Electronica Sinica,2011,39(5):991-995.(in Chinese)
[13]  王玉,饶妮妮,等.基于小波变换技术预测DNA序列的编码区[J].电子学报,2007,35(1):141-144. Wang Y,Rao N N,et al.Predicting protein coding regions of DNA sequences based on wavelet translation technique[J].Acta Electronica Sinica,2007,35(1):141-144.(in Chinese)
[14]  Liang J J,Qin A K.Comprehensive learning particle swarm optimizer for global optimization of multimodal functions[J].IEEE Transactions on Evolutionary Computation,2006,10(3):281-295.
[15]  Benson D A,Karsch-Mizrachi I,et al.GenBank[J].Nucleic Acids Research,2008,36:D25-D30.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133