A saliency-based method for generating video summaries is presented, which exploits coupled audiovisual information from both media streams. Efficient and advanced speech and image processing algorithms to detect key frames that are acoustically and visually salient are used. Promising results are shown from experiments on a movie database.
IEEE Int'l Workshop on Multimedia Signal Processing , October 2007.
[ Bibtex ] [ PDF ]