%0 Journal Article
%T An Analysis of OpenSeeD for Video Semantic Labeling
%A Jenny Zhu
%J Journal of Computer and Communications
%P 59-71
%@ 2327-5227
%D 2025
%I Scientific Research Publishing
%R 10.4236/jcc.2025.131005
%X Semantic segmentation is a core task in computer vision that allows AI models to interact and understand their surrounding environment. Similarly to how humans subconsciously segment scenes, this ability is crucial for scene understanding. However, a challenge many semantic learning models face is the lack of data. Existing video datasets are limited to short, low-resolution videos that are not representative of real-world examples. Thus, one of our key contributions is a customized semantic segmentation version of the Walking Tours Dataset that features hour-long, high-resolution, real-world data from tours of different cities. Additionally, we evaluate the performance of open-vocabulary, semantic model OpenSeeD on our own custom dataset and discuss future implications.
%K Semantic Segmentation
%K Detection
%K Labeling
%K OpenSeeD
%K Open-Vocabulary
%K Walking Tours Dataset
%K Videos
%U http://www.scirp.org/journal/PaperInformation.aspx?PaperID=140362