CASS: A Phonetically Transcribed Corpus Of Mandarin Spontaneous Speech

Download: PDF.

“CASS: A Phonetically Transcribed Corpus Of Mandarin Spontaneous Speech” by A. LI, F. ZHENG, W. Byrne, P. Fung, T. Kamm, Y. LIU, Z. SONG, U. Ruhi, V. Venkataramani, and X. CHEN. In Proc. of the International Conference on Spoken Language Processing, 2000.

Abstract

A collection of Chinese spoken language has been collected and phonetically annotated to capture spontaneous speech and language effects. The Chinese Annotated Spontaneous Speech (CASS) corpus contains phonetically transcribed spontaneous speech. This corpus was created to begin to collect samples of most of the phonetic variations in Mandarin spontaneous speech due to pronunciation effects, including allophonic changes, phoneme reduction, phoneme deletion and insertion, as well as duration changes. It is intended for use in pronunciation modeling for improved automatic speech recognition and will be used at the 2000 Johns Hopkins University Language Engineering Workshop by the project on Pronunciation Modeling of Mandarin Casual Speech.

Download: PDF.

BibTeX entry:

@inproceedings{cass_icslp00,
   author = {A. LI and F. ZHENG and W. Byrne and P. Fung and T. Kamm and
	Y. LIU and Z. SONG and U. Ruhi and V. Venkataramani and X. CHEN},
   title = {{CASS}: A Phonetically Transcribed Corpus Of {M}andarin
	Spontaneous Speech},
   booktitle = {Proc. of the International Conference on Spoken Language
	Processing},
   pages = {(4 pages)},
   year = {2000}
}

Back to Bill Byrne publications.