Automatic Recognition of Spontaneous Speech for Access to Multilingual Oral History Archives

Automatic Recognition of Spontaneous Speech for Access to Multilingual Oral History Archives” by W. Byrne, D. Doermann, M. Franz, S. Gustman, J. Hajic, D. Oard, M. Picheny, J. Psutka, B. Ramabhadran, D. Soergel, T. Ward, and W.-J. Zhu. IEEE Transactions on Speech and Audio Processing, Special Issue on Spontaneous Speech Processing, July 2004, pp. 420-435 (16 pages).

Abstract

The MALACH project has the goal of developing the technologies needed to facilitate access to large collections of spontaneous speech. Its aim is to dramatically improve the state of the art in key Automatic Speech Recognition (ASR), Natural Language Processing (NLP) technologies for use in large-scale retrieval systems. The project leverages a unique collection of oral history interviews with survivors of the Holocaust that has been assembled and extensively annotated by the Survivors of the Shoah Visual History Foundation. This paper describes the collection, 116,000 hours of interviews in 32 languages, and the way in which system requirements have been discerned through user studies. It discusses ASR methods for very difficult speech (heavily accented, emotional, and elderly spontaneous speech), including transcription to create training data and methods for language modeling and speaker adaptation. Results are presented for for English and Czech. NLP results are presented for named entity tagging, topic segmentation, and supervised topic classification, and the architecture of an integrated search system that uses these results is described.

BibTeX entry:

@article{malach_sptrans03,
   author = {W. Byrne and D. Doermann and M. Franz and S. Gustman and J.
	Haji{\vc} and D. Oard and M. Picheny and J. Psutka and B.
	Ramabhadran and D. Soergel and T. Ward and W.-J. Zhu},
   title = {Automatic Recognition of Spontaneous Speech for Access to
	Multilingual Oral History Archives},
   journal = {IEEE Transactions on Speech and Audio Processing, Special
	Issue on Spontaneous Speech Processing},
   pages = {420--435 (16 pages)},
   month = jul,
   year = {2004},
   url = {http://dx.doi.org/10.1109/TSA.2004.828702}
}

Back to Bill Byrne publications.