|
计算机科学 2007
Multi-modal Analysis of Sports Video for Semantics
|
Abstract:
A semantic structure of sports video, exampled with soccer, and corresponding framework for semantics analysis are proposed. Video is parsed into pure video stream and audio stream. Video is segmented into shots according to low/physical features, and then into syntactic shots with the help of specific middle level contents. Audio can be extracted meaningful middle contents, e.g. excited speech of commenter. According to rules of soccer broadcasting, semantics of highlights can be analyzed based on syntactic contents from video and audio streams.