|
- 2015
基于半监督学习的即时语音通信隐藏检测
|
Abstract:
传统即时通信隐藏检测方法主要采用基于监督学习的检测方式,导致部署前需大量复杂的人工预处理,同时训练数据集与测试数据集分布的差异会影响检测的准确率。针对以上问题,该文首先重点针对即时语音通信隐蔽信道提出了一种全新的半监督混合式检测模型,该模型不存在人工挑选与标注训练数据集的过程,解决检测操作人工预处理复杂和适用性差的问题;然后设计了基于自学习的多准则融合模块,用于自行生成伪标注数据集,其可信度和代表度共同决定了即时语音通信隐藏检测系统的性能,且不存在语音通信隐藏检测中训练与测试集分布失配的情况;最后针对即时语音通信中常见的低码率语音流载体进行实验分析,在失配状况下基于有监督的检测方法以及无监督检测方法相比,其准确率具有明显优势;当训练样本与测试样本的分布不匹配时,该方法相比有监督的检测方法所受的影响更小。同时,实验显示该方法可以适用于多种编码检测过程。
Abstract:Existing instant voice communication steganalysis schemes are mainly based on supervised learning classifiers. These kinds of methods need large amounts of pre-processing and training and their accuracy can be easily destroyed by differences between the distribution of the training and testing data sets. This paper describes a semi-supervised hybrid detection model to improve detection which removes the manually annotated training data set, so this model is simpler and gives better detection scopes. This paper also describes a self-learning, multi-criteria fusion module which can automatically generate pseudo-labelled sets and combines the confidence and representative levels to judge the performance of instant voice communication steganalysis. There is no distribution mismatch between the testing data and the training data in this method. Tests with common low bit-rate speech coding carriers show that this method is more accurate than the un-supervised method and the supervised method in mismatched conditions. When the distributions of the training and testing data sets differ, this method is less affected than the supervised method. The tests also show that this method can be deployed on different kinds of speech codecs.
[1] | Mazurczyk W, Karas M, Szczypiorski K. SkyDe:A Skype-based steganographic method[J]. International Journal of Computers Communications & Control, 2013, 8(3):432-443. |
[2] | Kopiczko P, Mazurczyk W, Szczypiorski K. Stegtorrent:a steganographic method for the p2p file sharing service[C]//Proceedings of 2013 Security and Privacy Workshops(SPW). San Francisco, CA, USA:IEEE Press, 2013:151-157. |
[3] | HUANG Yongfeng, TANG Shanyu, ZHANG Yuan. Detection of covert voice-over Internet protocol communications using sliding window-based steganalysis[J]. IET Communications, 2011, 5(7):929-936. |
[4] | HUANG Yongfeng, LIU Chenghao, TANG Shanyu, et al. Streganography integration into a low-bit rate speech codec[J]. IEEE Transactions on Information Forensics and Security, 2012, 7(6):1865-1876. |
[5] | 两年前废掉了短信现又瞄准了语音通信[EB/OL].[2015-07-28]. http://ec.ctiforum.com/jishu/qiye/qiyetong- xinjishu/jishitongxin/jishudongtai/433729.html. Two years ago WeChat gave up SMS, now is aiming at voice communication[EB/OL].[2015-07-28]. http://ec.ctiforum.com/jishu/qiye/qiyetongxinjishu/jishitongxin/jishudongtai/433729.html.(in Chinese) |
[6] | LI Songbin, TAO Huaizhou, HUANG Yongfeng. Detection of quantization index modulation steganography in G.723.1 bit stream based on quantization index sequence analysis[J]. Journal of Zhejiang University SCIENCE C, 2012, 13(8):624-634. |
[7] | TIAN Hui, LIU Jin, LI Songbin. Improving security of quantization-index-modulation steganography in low bit-rate speech streams[J]. Multimedia Systems, 2014, 20(2):143-154. |
[8] | HUANG Yongfeng, TANG Shanyu, BAO Chunlan, et al. Steganalysis of compressed speech to detect covert voice over Internet protocol channels[J]. IET Information Security, 2011, 5(1):26-32. |
[9] | BAO Chunlan, HUANG Yongfeng, ZHU Chunyi. Steganalysis of compressed speech[C]//Proceedings of Multiconference on Computational Engineering in Systems Applications. Beijing, China:IEEE Press, 2006:5-10. |
[10] | 看极端组织ISIS如何展开网络"恐怖营销"[EB/OL].[2015-07-28]. http://www.techweb.com.cn/news/2014-09-01/2070887.shtml. See the extreme organization ISIS how to expand the network "terrorism marketing"[EB/OL].[2015-07-28]. http://www.techweb.com.cn/news/2014-09-01/2070887.shtml.(in Chinese) |
[11] | LIU Qingzhong, SUNG AH, QIAN Mengyu. Temporal derivative-based spectrum and mel-cepstrum audio steganalysis[J]. IEEE Transactions on Information Forensics and Security, 2009, 4(3):359-368. |
[12] | Ko?al OH, YürüklüE, Avcibas I. Chaotic-type features for speech steganalysis[J]. IEEE Transactions on Information Forensics and Security, 2008, 3(4):651-661. |
[13] | Kraetzer C, Dittmann J. Pros and cons of Mel-cepstrum based audio steganalysis using SVM classification[J]. Information Hiding Lecture Notes on Computer Science, 2007, 4567:359-377. |
[14] | Janicki A, Mazurczyk W, Szczypiorski K. Steganalysis of transcoding steganography[J]. Ann. Telecommun. 2014, 69(7-8):449-460. |
[15] | Mazurczyk W. VoIP steganography and its detection:A survey[J]. ACM Computing Surveys, 2013, 46(2):Article No. 20. |