OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

电子学报 2012

基于随机化视觉词典组和上下文语义信息的目标检索方法

, PP. 2472-2480

赵永威,郭志刚,李弼程,高毫林,陈刚

Keywords: 目标检索,上下文语义信息,精确欧氏位置敏感哈希,随机化视觉词典组,K-L散度

Full-Text Cite this paper Add to My Lib

Abstract:

传统的视觉词典法(BagofVisualWords,BoVW)具有时间效率低、内存消耗大以及视觉单词同义性和歧义性的问题,且当目标区域所包含的信息不能正确或不足以表达用户检索意图时就得不到理想的检索结果.针对这些问题,本文提出了基于随机化视觉词典组和上下文语义信息的目标检索方法.首先,该方法采用精确欧氏位置敏感哈希(ExactEuclideanLocalitySensitiveHashing,E2LSH)对局部特征点进行聚类,生成一组支持动态扩充的随机化视觉词典组;然后,利用查询目标及其周围的视觉单元构造包含上下文语义信息的目标模型;最后,引入K-L散度(Kullback-Leiblerdivergence)进行相似性度量完成目标检索.实验结果表明,新方法较好地提高了目标对象的可区分性,有效地提高了检索性能.

References

[1]	Sivic J,Zisserman A.Video Google:A text retrieval approach to object matching in videos .Proceedings of 9th IEEE International Conference on Computer Vision .Nice:IEEE Press,2003.1470-1477.
[2]	Nister D,Stewenius H.Scalable recognition with a vocabulary tree .Proceedings of IEEE Conference on Computer Vision and Pattern Recognition .New York:IEEE Press,2006.2161-2168.
[3]	Raphael Marée,Philippe Denis,Louis Wehenkel,et al.Incremental indexing and distributed image search using shared randomized vocabularies .Proceedings of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval .Philadelphia:ACM Press,2010.91-100.
[4]	刘硕研,须德,冯松鹤,等.一种基于上下文语义信息的图像块视觉单词生成算法[J].电子学报,2010,38(5):1156-1161. LIU Shuo-yan,XU De,FENG Song-he,et al.A novel visual words definition algorithm of image patch based on contextual semantic information[J].Acta Electronica Sinica,2010,38(5):1156-1161.(in Chinese)
[5]	Van G J C,Veenman C J,Smeulders A W M,et al.Visual word ambiguity[J] IEEE Transactions on Pattern Analysis and Machine Intelligence,2010,7(32):1271-1283.
[6]	Wang Jing-yan,Li Yong-ping,Zhang Ying,et a1.Bag-of-features based medical image retrieval via multiple assignment and visual words weighting [J].IEEE Transactions on Medical Imaging,2011,30(11):1-17.
[7]	许相莉,张利彪,刘向东,等.基于粒子群的图像检索相关反馈算法[J].电子学报,2010,38(8):1935-1940. XU Xiang-li,ZHANG Li-biao,LIU Xiang-dong,et al.Image retrieval relevance feedback algorithm based on particle swarm optimization[J].Acta Electronica Sinica,2010,38(8):1935-1940.(in Chinese)
[8]	Jégou H,Douze M,Schmid C.Hamming embedding and weak geometric consistency for large scale image search .Proceedings of IEEE Conference on European Conference on Computer Vision .Heidelberg:IEEE Press,2008.1-29.
[9]	高常鑫,桑农.整合局部特征和滤波器特征的空间金字塔匹配模型[J].电子学报,2011,39(9):2034-2038. GAO Chang-xin,SANG Nong.Unifying local features and filterbank features in the spatial pyramid matching model[J].Acta Electronica Sinica,2011,39(9):2034-2038.(in Chinese)
[10]	Lowe D G.Distinctive image features from scale-invariant keypoints[J].International Journal of Computer Vision,2004,60(2):91-110.
[11]	Geng B,Yang L,Xu C.A study of language model for image retrieval .Proceedings of IEEE International Conference on Data Mining Workshops .Washington:IEEE press,2009.158-163.
[12]	Datar M,Immorlica N,Indyk P,et al.Locality-sensitive hashing scheme based on p-stable distributions .Proceedings of the Twentieth Annual Symposium on Computational Geometry .New York:ACM Press,2004.253-262.
[13]	Ponte J,Croft W B.A language modeling approach to information retrieval .International ACM Conference on Research and Development in Information Retrieval .Melbourne:ACM Press,1998.275-281.
[14]	Ma Y F,Zhang H J.Contrast-based image attention analysis by using fuzzy growing .Proceedings of the 11th ACM International Conference on Multimedia .Washington:ACM Press,2003.374-381.
[15]	冯松鹤,郎丛妍,须德.一种融合图学习和区域显著性分析的图像检索算法[J].电子学报,2011,39(10):2288-2294. FENG Song-he,LANG Cong-yan,XU De.Combining graph learning and region saliency analysis for content-based image retrieval[J].Acta Electronica Sinica,2011,39(10):2288-2294.(in Chinese)
[16]	Lafferty J,Zhai C.Document language models,query models,and risk minimization for information retrieval .Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval .New York:ACM Press,2001.111-119.
[17]	张瑞强,王作英,陆大金.关于汉语音字转换中语言模型零概率的问题[J].电子学报,1998,26(8):1042-1046. ZHANG Rui-qiang,WANG Zuo-ying,LU Da-jin.Zero-probabilities of language model in translations of Chinese spellings to characters[J].Acta Electronica Sinica,1998,26(8):1042-1046.(in Chinese)
[18]	Yahoo Company.Flickr1 Dataset .http://www.flickr.com/,2011-04-16.
[19]	Jurie F,Triggs B.Creating efficient codebooks for visual recognition .Proceedings of International Conference on Computer Vision .Beijing:Springer,2005.604-6l0.
[20]	Philbin J,Chum O,Isard M,et a1.Object retrieval with large vocabularies and fast spatial matching .Proceedings of IEEE Conference on Computer Vision and Pattern Recognition .Minneapolis:IEEE Press,2007.1-8.
[21]	Cao Yang,Wang Chang-hu,Li Zhi-wei,et al.Spatial-bag-of-features .Proceedings of IEEE Conference on Computer Vision and Pattern Recognition .San Francisco:IEEE Press,2010.3352-3359.
[22]	Philbin J,Chum O,Isard M,et al.Lost in quantization:improving particular object retrieval in large scale image databases .Proceedings of IEEE Conference on Computer Vision and Pattern Recognition .Anchorage:IEEE Press,2009.278-286.
[23]	Hsiao J H,Henry C.Topic-sensitive interactive image object retrieval with noise-proof relevance feedback .Proceedings of IEEE International Conference on Acoustics,Speech,and Signal Processing .Prague Congress Center:IEEE Press,2011.869-873.
[24]	Sajina R,Aghila G,Saruladha K.A survey of semantic similarity methods for ontology based information retrieval .Proceedings of IEEE Second International Conference on Machine Learning and Computing .Bangalore:IEEE Press,2010.297-302.
[25]	Mu Ya-dong,Ju Sun,Tony X,et al.Randomized locality sensitive vocabularies for bag-of-features model .Proceedings of IEEE European Conference on Computer Vision .Heraklion:IEEE Press,2010.748-761.
[26]	Robotics Research Group.Oxford5K Dataset .http://www.robots.ox.ac.uk/-vgg/data/oxbuildings/,2011-04-16.
[27]	Belkin N J.Some (what) grand challenges for information retrieval[J].ACM SIGIR Forum,2008,42(1):47-54.
[28]	Galleguillos C,Belongie S.Context based object categorization:a critical survey[J].Computer Vision and Image Understanding (CVIU),2010:712-722.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133