Proceedings:
Intelligent Integration and Use of Text, Image, Video, and Audio Corpora
Volume
Issue:
Papers from the 1997 AAAI Spring Symposium
Track:
Contents
Downloads:
Abstract:
In theory, speech recognition technology can make any spoken words in video or audio media usable for text indexing, search and retrieval. This article describes the News-on- Demand application created within the Informedia TM Digital Video Library project and discusses how speech recognition is used in transcript creation from video, alignment with closed-captioned transcripts, audio paragraph segmentation and a spoken query interface. Speech recognition accuracy varies dramatically depending on the quality and type of data used. Informal information retrieval tests show that reasonable recall and precision can be obtained with only moderate speech recognition accuracy.
Spring
Papers from the 1997 AAAI Spring Symposium