Matt Shannon
Position: PhD student in statistical speech synthesis
Email: sms46 AT cam ac uk
Supervisor: Dr Bill Byrne
Research
Acoustic modelling for statistical speech synthesis. For the first part of my PhD I was part of the EMIME (Effective Multilingual Interaction in Mobile Environments) project for personalized speech synthesis.
Software
-
armspeech
Flexible python framework for probabilistic modelling of speech with a focus on autoregressive models. Allows the use of more complicated autoregressive-style output distributions than the HTS implementation below. Includes example experiments. -
Autoregressive HMM for HTS
An implementation of the autoregressive HMM built on top of HTS (HMM-based Speech Synthesis System). It provides the ability to do embedded re-estimation, decision tree clustering and synthesis using the autoregressive HMM. An example of how to use this implementation is available (autoregressive HMM version of the HTS demo).
Publications
-
M. Shannon, H. Zen and W. Byrne (2011)
The Effect of Using Normalized Models in Statistical Speech Synthesis
Proc. Interspeech 2011
[ postprint | bib | slides ]
-
M. Shannon and W. Byrne (2010)
Autoregressive clustering for HMM speech synthesis
Proc. Interspeech 2010
[ postprint | bib | poster ]
-
M. Shannon and W. Byrne (2009)
Autoregressive HMMs for speech synthesis
Proc. Interspeech 2009
[ postprint | bib | slides ]
-
M. Shannon and W. Byrne (2009)
A formulation of the autoregressive HMM for speech synthesis
Technical Report CUED/F-INFENG/TR.629
[ pdf | bib ]
-
M. Shannon and M.J.F. Gales (2008)
Sampling Methods for Instantaneous Speaker Adaptation
MPhil thesis, University of Cambridge, UK
[ final with corrections | final | bib | slides ]
Talks
-
M. Shannon (Mar 2011)
The effect of normalization -- a case study in speech synthesis
Machine Learning RCC, University of Cambridge, UK
[ slides | talks.cam ]
-
M. Shannon and H. Zen (Jan 2011)
Modelling trajectories in statistical speech synthesis
Cambridge statistical speech synthesis (SSS) seminar series
University of Cambridge, UK
[ my slides | Heiga's slides | talks.cam ]
-
M. Shannon and S. Bratières (May 2010)
Topics in Statistical Machine Translation
Machine Learning RCC, University of Cambridge, UK
[ talks.cam ]
-
M. Shannon and W. Byrne (May 2010)
Autoregressive HMMs for speech synthesis
EMIME workshop, Cambridge, UK
[ slides ]
-
M. Shannon (Jan 2009)
A Hierarchical Bayesian Language Model based on Pitman-Yor Processes
Machine Learning RCC, University of Cambridge, UK
[ talks.cam ]
Teaching
- demonstrator for CSTIT MPhil course 2008-9 and 2009-10
Contact Information
Baker Building, BE5-02
Engineering Department
Trumpington Street, Cambridge
CB2 1PZ, United Kingdom
