|
基于深度学习的多模态服装风格检索
|
Abstract:
近年来虽然深度学习在服装检索领域有了很多不错的成果,但是研究者们对服装风格研究却很少。消费者往往通过自己喜欢的风格来检索搭配的服装,或者是消费者更愿意检索到与自己穿衣风格相似的服装。现有的服装风格研究者只是将服装风格进行分类,通过用户输入图像为消费者识别喜欢的风格,然而这样的检索结果只能返回与该图像风格相似的服装,而不能与输入的图像完成搭配。因此本文从服装风格的整体兼容性出发,将每件服装单品视作单词,按照Word2vec中的单词相似性概念,分别提出了基于文本的风格检索模型以及基于图像的风格检索模型。最后将获取到的两种模态信息进行特征融合,提出了一个多模态风格检索模型。实验结果表明,在Polyvore多模态数据集上,按照前人研究者的服装风格相似性评判标准,多模态融合的服装风格检索方法比单模态风格检索以及其他多模态混合风格检索方法所获取的结果列表的平均相似度更佳。
Although deep learning has achieved many good results in the field of Fashion retrieval in recent years, researchers have little research on Fashion style. Consumers often search the matching clothing through their favorite style, or consumers are more willing to search the clothing similar to their own style. The existing Fashion style researchers only classify the Fashion style and identify the favorite style for consumers through the user’s input image. However, such retrieval results can only return the clothing similar to the image style, but can not match with the input image. Therefore, starting from the overall compatibility of Fashion style, this paper regards each piece of clothing as a word, and proposes text-based style retrieval model and image-based style retrieval model respectively according to the concept of word similarity in Word2vec. Finally, a multimodal style retrieval model is proposed based on feature fusion of the two modal information obtained. The experimental results show that on the Polyvore multimodal data set, according to the previous researchers’ Fashion style similarity evaluation criteria, the multimodal fusion Fashion style retrieval method has better average similarity than the single modal style retrieval and other multimodal hybrid style retrieval methods.
[1] | Gharaei, N.Y., Dadkhah, C. and Daryoush, L. (2021) Content-Based Clothing Recommender System Using Deep Neural Network. 2021 26th International Computer Conference, Computer Society of Iran (CSICC), Tehran, 3-4 March 2021, 1-6. |
[2] | 李扬, 黄荣, 董爱华. 基于改进Bilinear-CNN的服装图像风格识别[J]. 东华大学学报(自然科学版), 2021, 47(3): 90-95. |
[3] | Yang, S.-Y., Zhong, Y.-Q. and Wang, X. (2021) Clothing Recommendation with Style Recog-nition. 2021 14th Textile Bioengineering and Informatics Symposium Proceedings (TBIS 2021), Rube, 6-9 July 2021, 589-596. |
[4] | Hu, M.Y. and Zhong, Y.Q. (2020) Convolutional Neural Networks for Fashion Style Classification. Proceedings of 2020 SAISTA, Beijing, 12-14 August 2020, 208-212. |
[5] | Zhu, M., Jiang, N., Feng, X., et al. (2022) Style Analysis of Clothing from Fashion Shows Based on Deep Learning. 2022 5th International Conference on Data Science and Information Technology (DSIT), Shanghai, 22-24 July 2022, 1-6. https://doi.org/10.1109/DSIT55514.2022.9943872 |
[6] | Tautkute, I., Moejko, A., Stokowiec, W., et al. (2017) What Looks Good with My Sofa: Multimodal Search Engine for Interior Design. 2017 Federated Conference on Com-puter Science and Information Systems (FedCSIS), Prague, 3-6 September 2017, 1275-1282 |
[7] | Kenter, T., Borisov, A. and De Rijke, M. (2016) Siamese Cbow: Optimizing Word Embeddings Forsentence Representations. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 1, 941-951.
https://doi.org/10.18653/v1/P16-1089 |
[8] | Han, X., Wu, Z., Jiang, Y.G., et al. (2017) Learning Fashion Compati-bility with Bidirectional LSTMs. Proceedings of the 25th ACM international conference on Multimedia, Silicon Valley, 23-27 October 2017, 1078-1086. |
[9] | Tautkute, I., Trzcinski, T., Skorupa, A.P., et al. (2019) Deep Style: Multimodal Search Engine for Fashion and Interior Design. IEEE Access, 7, 84613-84628. https://doi.org/10.1109/ACCESS.2019.2923552 |
[10] | 苏卓, 柯司博, 王若梅, 等. 深度多模态融合服装风格检索[J]. 中国图象图形学报, 2021, 26(4): 857-871. |