Automatic Content-Based Retrieval of Broadcast News

ACM Multimedia 95 - Electronic Proceedings
November 5-9, 1995
San Francisco, California

Automatic Content-Based Retrieval of Broadcast News

Martin G. Brown
: Olivetti Research Limited, 24a Trumpington Street; Cambridge, CB2 1QA, UK; Voice number mgb@cam-orl.co.uk
Jonathan T. Foote: Cambridge University Engineering Department,; Cambridge, CB2 1PZ, United Kingdom; +44 1223 332 800 jtf@eng.cam.ac.uk
Gareth J. F. Jones: Cambridge University Engineering Department,; Cambridge, CB2 1PZ, United Kingdom; +44 1223 332 800 gjfj@eng.cam.ac.uk
Karen Sparck Jones: Cambridge University Computer Laboratory; Cambridge, CB2 3QG United Kingdom; +44 1223 332 654 ksj@cl.cam.ac.uk
Steve J. Young: Cambridge University Engineering Department,; Cambridge, CB2 1PZ, United Kingdom; +44 1223 332 654 sjy@eng.cam.ac.uk

This paper presents current work on a video retrieval project at Cambridge University and Olivetti Research Limited (ORL). We show that statistical methods developed for text retrieval are also effective for retrieving and browsing multimedia documents. These methods allow rapid retrieval of news broadcasts by information content determined from teletext subtitles. Information retrieval results for experiments performed on a large archive of news broadcasts are presented. This is made possible by the ORL Medusa system, which allows practical recording, storage, and playback of tens of gigabytes of multimedia data. This work is a step towards practical retrieval of multimedia documents, where the information content is determined from speech recognition performed on the audio soundtrack. We describe the project background, the ORL Medusa multimedia system, and retrieval application, as well as the news broadcast corpus and methods of browsing the retrieved news stories.

1. Introduction
2. Medusa: Multimedia on an ATM Network
- 2.1. The Medusa software environment
- 2.2. The Multimedia Repository
3. The News Broadcast Archive
- 3.1. Long-term broadcast storage
4. News Broadcast Retrieval
- 4.1. Information Retrieval
  - 4.1.1. Match score computation
  - 4.1.2. Broadcast Segmentation
- 4.2. IR Experiments
5. The News Broadcast Retrieval User Interface
- 5.1. A Video Browser
6. Future Work and Conclusions
7. Acknowledgements
References

1. Introduction

Recent years have seen a rapid increase in the availability and use of multimedia applications. These systems can generate large amounts of audio and video data which can be expensive to store and unwieldy to access. The Video Mail Retrieval (VMR) project at Cambridge University and Olivetti Research Limited (ORL), Cambridge, UK, is addressing these problems by developing systems to retrieve stored video material using the spoken audio soundtrack [1,16]. Specifically, the project focuses on the content-based location, retrieval, and playback of potentially relevant data. The primary goal of the VMR project is to develop a video mail retrieval application for the Medusa multimedia environment developed at ORL.

Previous work on the VMR project demonstrated practical retrieval of audio messages using speech recognition for content identification [8,4]. Because of the limited number of available audio messages, a much larger archive of television news broadcasts (along with accompanying subtitle transcriptions) is currently being collected. This will serve as a testbed for new methods of storing and accessing large amounts of audio/video data. The enormous potential size of the news broadcast archive dramatically illustrates the need for ways of automatically finding and retrieving information from the archive. Quantitative experiments demonstrate that Information Retrieval (IR) methods developed for searching text archives can accurately retrieve multimedia data, given suitable subtitle transcriptions. In addition, the same techniques can be used to rapidly locate interesting areas within an individual news broadcast.

Although large multimedia archives will be more common in the future, today they require a specialised and high-performance hardware infrastructure. The work presented here relies on the the Medusa system developed at ORL, which includes distributed, high-capacity multimedia repositories. This paper begins with an overview of the ORL Medusa technology. Subsequent sections describe the collection and storage of a BBC television broadcast news archive, a retrieval methodology for location of potentially relevant sections in response to users' requests, and a graphical user interface for content-based retrieval and browsing of news broadcasts.