|
|
Speech and Language Processing - Module 4F11Module LecturersProf. Phil Woodland pcw@eng.cam.ac.uk and Dr. Bill Byrne bill.byrne@eng.cam.ac.ukSyllabusIntroduction & Applications. Speech production mechanisms, types of speech sound, source-filter model, applications of speech and text processing. Lecture notes available in [lect1.pdf]. Links for the demos shown in lecture 1 are available here . FFT based methods. All-pole filter models, calculation of LP coefficients. LP Spectrum. Cepstral analysis. Front-end analysis for speech recognition (MFCCs). Lecture notes available in [lect23_2up.pdf] Statistical speech recognition, task complexity. Hidden Markov models. Continuous density HMM parameter estimation, Baum-Welch algorithm, Viterbi algorithm, Gaussian mixture models for HMMs. Lecture notes available in [lect45.pdf] Large vocabulary speech recogntion, continuous speech training, limitations of word models, context dependent phones, parameter tying, WSJ performance. Lecture notes available in [lect6.pdf] Perplexity, N-gram language models, discounting, interpolation. Continuous speech recognition. Pruning. Integrating context dependent HMMs and N-gram language models. Efficient realization of probabilistic models for sequence processing. Transduction, composition, determinization, minimum-cost search. WFSTs in ASR search and other language processing applications. Statistical pattern processing approaches to translation. Automatic evaluation of translation quality. Parallel text as training data. Models of word and phrase alignment in translation. Model estimation procedures. Phrase-based translation systems. Implementation via WFSTs. Introduction to TTS. Formant-based synthesis and concatenative synthesis. PSOLA. Examples papersThere will be two examples papers and two examples classes for the course. The solutions to the examples papers will be available on-line (after examples classes).Examples Paper 1 available in [4F11egPaper1.pdf]. The examples class for Examples Paper 1 will take place on 17th February 2012 at 12 noon
Exam FormatAssessment by 1.5 hour exam: 3 questions from 4.
Course Books |