Examples of various speech synthesis systems
| P. Taylor. Analysis and synthesis of intonation using the tilt model. Journal of the Acoustical Society of America, 107(3):1697-1714, 2000. .pdf | P. Taylor, A W Black, R Caley. "Hetrogeneous Relation Graphs as a mechanism for representing linguistic information", Speech Communication 33 pp 153-174, 2001 |
| K. Richmond, S. King, and P. Taylor. Modelling
the uncertainty in recovering articulation from acoustics. Computer
Speech and Language, 17:153-172, 2003. [ bib | Abstract ] |
| P A Taylor. Concept-to-speech by phonological structure
matching. Philosophical Transactions of the Royal Society, Series A
, 2000. [ bib | .ps | .pdf ] |
| Simon King and Paul Taylor. Detection of phonological features
in continuous speech using neural networks. Computer Speech and Language
, 14(4):333-353, 2000. [ bib | .ps | .pdf | Abstract ] |
|
Paul A. Taylor, S. King, S. D. Isard, and H. Wright.
Intonation and dialogue context as constraints for speech
recognition.
Language and Speech, 41(3):493-512, 1998. [ bib | .ps | .pdf ] |
| Andreas Stolcke, N. Coccaro, R. Bates, P. Taylor,
C. Van Ess-Dykema, K. Ries, Elizabeth Shriberg, D. Jurafsky,
R.Martin, and M. Meteer. Dialog act modeling for automatic tagging
and recognition of conversational speech. Computational Linguistics
, 26(3), 2000. [ bib | .ps | .pdf ] |
Paul Taylor and Alan Black. Assigning phrase breaks from part
of speech sequences. Computer Speech and Language, 12:99-117, 1998. [ bib | .ps | .pdf ] |
| Elizabeth Shriberg, R. Bates, P. Taylor, A. Stolcke,
K. Ries, D. Jurafsky, N. Coccaro, R. Martin, M. Meteer,
and C. Van Ess-Dykema. Can prosody aid the automatic classification
of dialog acts in conversational speech? Language and Speech,
41(3-4), 1998. [ bib | .ps | .pdf ] |
| Paul A. Taylor and Amy Isard. SSML: A speech synthesis
markup language. Speech Communication, (21):123-133, 1997. [ bib | .ps | .pdf ] |
| Paul A. Taylor. The rise/fall/connection model of intonation.
Speech Communication, 15:169-186, 1995. [ bib | .ps | .pdf ] |
| P. Taylor. Grapheme-to-Phoneme conversion using Hidden Markov models
In Proc. Interspeech 2005
|
| J. Vepa, S. King, and P. Taylor. New objective
distance measures for spectral discontinuities in concatenative speech
synthesis. In Proc. IEEE 2002 workshop on speech synthesis, Santa
Monica, USA, September 2002. [ bib | .pdf | ] |
| J. Vepa, S. King, and P. Taylor. Objective
distance measures for spectral discontinuities in concatenative speech
synthesis. In Proc. ICSLP, Denver, USA, September 2002. [ bib | .pdf | ] |
| J. Frankel, K. Richmond, S. King, and P. Taylor.
An automatic speech recognition system using neural networks and linear
dynamic models to recover and model articulatory traces. In Proc. ICSLP
, 2000. [ .pdf ] |
| Edmilson Morais, Paul Taylor, and Fabio Violaro. Concatenative
text-to-speech synthesis based on prototype waveform interpolation (a time
frequency approach). In Proc. ICSLP 2000, Beijing, China, 2000. [ bib | .ps | .pdf ] |
| S. King, P. Taylor, J. Frankel, and K. Richmond.
Speech recognition via phonetically-featured syllables. In PHONUS
, volume 5, pages 15-34, Institute of Phonetics, University of the
Saarland, 2000. [ bib | .ps | .pdf | Abstract ] |
| Janet Hitzeman, Alan W. Black, Paul Taylor, Chris Mellish,
and Jon Oberlander. An annotation scheme for concept-to-speech synthesis.
In Proceedings of the European Workshop on Natural Language Generation
, pages 59-66, Toulouse, France, 1999. [ bib | .ps | .pdf ] |
| Paul Taylor and Alan W Black. Speech synthesis by phonological
structure matching. In Eurospeech99, Budapest, Hungary, 1999. [ bib | .ps | .pdf ] |
| Kurt E. Dusterhoff, Alan W. Black, and Paul A. Taylor.
Using decision trees within the tilt intonation model to predict f0
contours. In Eurospeech 99, Budapest, 1999. [ bib | .ps | .pdf ] |
| Simon King, Todd Stephenson, Stephen Isard, Paul Taylor, and Alex
Strachan. Speech recognition via phonetically featured syllables.
In Proc. ICSLP `98, pages 1031-1034, Sydney, Australia, December
1998. [ bib | .ps | .pdf | Abstract ] |
| Andreas Stolcke, E. Shriberg, R. Bates, P. Taylor,
K. Ries, D. Jurafsky, N. Coccaro, R. Martin, M. Meteer,
and C. Van Ess-Dykema. Dialog act modelling for conversational
speech. In AAAI Spring Symposium on Applying Machine Learning to Discourse
Processing, 1998. [ bib | .ps | .pdf ] |
| Paul A. Taylor, S. King, S. D. Isard, and H. Wright.
Intonation and dialogue context as constraints for speech recognition.
Language and Speech, 41(3):493-512, 1998. [ bib | .ps | .pdf ] |
| Janet Hitzeman, Alan W. Black, Paul Taylor, Chris Mellish,
and Jon Oberlander. On the use of automatically generated discourse-level
information in a concept-to-speech synthesis system. In ICSLP98
, volume 6, pages 2763-2768, Sydney, Australia, 1998. [ bib | .ps | .pdf ] |
| Richard Sproat, Andrew Hunt, Mari Ostendorf, Paul Taylor, Alan Black,
and Kevin Lenzo. Sable: a standard for TTS markup. In ICSLP98
, volume 5, pages 1719-1724, Sydney, Australia, 1998. [ bib | .ps | .pdf ] |
| Richard Sproat, Andrew Hunt, Mari Ostendorf, Paul Taylor, Alan Black,
and Kevin Lenzo. Sable: a standard for TTS markup. In Third
ESCA workshop on speech synthesis, pages 27-30, Jenolan Caves, Blue
Mountains, Australia, 1998. [ bib | .ps | .pdf ] |
| Paul A Taylor. The Tilt intonation model. In ICSLP98
, Sydney, 1998. [ bib | .ps | .pdf ] |
| Paul A Taylor, Alan Black, and Richard Caley. The architecture
of the festival speech synthesis system. In The Third ESCA Workshop
in Speech Synthesis, pages 147-151, Jenolan Caves, Australia, 1998. [ bib | .ps | .pdf ] |
| R. Sproat, Paul A. Taylor, M. Tanenblatt, and Amy
Isard. A markup language for text-to-speech synthesis. In Eurospeech
97, 1997. [ bib | .ps | .pdf ] |
| Alan W. Black and Paul A. Taylor. Assigning phrase
breaks from part-of-speech sequences. In Eurospeech97, volume 2,
pages 995-998, Rhodes, Greece, 1997. [ bib | .ps | .pdf ] |
| Dan Jurafsky, A. Stolcke, E. Shriberg, R. Bates,
P. Taylor, K. Ries, N. Coccaro, R. Martin, M. Meteer,
and C. Van Ess-Dykema. Automatic detection of discourse structure
for speech recognition and understanding. In 1997 IEEEWorkshop on
Speech Recognition and Understanding,, Santa Barbara, 1997. [ bib | .ps | .pdf ] |
| Alan W. Black and Paul A. Taylor. Automatically
clustering similar units for unit selection in speech synthesis. In
Eurospeech97, volume 2, pages 601-604, Rhodes, Greece, 1997. [ bib | .ps | .pdf ] |
| Helen Wright and Paul A. Taylor. Modelling intonational
structure using hidden markov models. In ESCA workshop on Intonation:
Theory Models and Applications, Athens, Greece, 1997. [ bib | .ps | .pdf ] |
| Alan W. Black and Paul A. Taylor. The Festival Speech
Synthesis System: System documentation. Technical Report HCRC/TR-83, Human
Communciation Research Centre, University of Edinburgh, Scotland, UK, 1997.
Avaliable at http://www.cstr.ed.ac.uk/projects/festival.html. [ bib ] |
| Paul A. Taylor, Simon King, Stephen Isard, Helen Wright, and
Jacqueline Kowtko. Using intonation to constrain language models in
speech recognition. In Proc. Eurospeech'97, Rhodes, 1997. [ bib | .pdf | Abstract ] |
| Paul A. Taylor, Hiroshi Shimodaira, Stephen Isard, Simon King,
and Jacqueline Kowtko. Using prosodic information to constrain language
models for spoken dialogue. In Proc. ICSLP `96, Philadelphia,
1996. [ bib | .ps | .pdf ] |
| Stephen D. Isard, S. King, P. A. Taylor, and J. Kowtko.
Prosodic information in a speech recognition system. In IEEE workshop
on speech recognition, Snowbird, Utah, 1995. [ bib ] |
| Stephen Isard, Simon King, Paul A. Taylor, and Jacqueline Kowtko.
Prosodic information in a speech recognition system intended for dialogue.
In IEEE Workshop in speech recognition, Snowbird, Utah, 1995. [ bib | Abstract ] |
| Paul A. Taylor and Amy Isard. SSML: A speech synthesis
markup language. In 2nd Speak! Workshop: Speech Generation in Multimodal
Information Systems and Practical Applications, Darmstadt, 1995. [ bib ] |
| Paul A. Taylor. Using neural networks to locate pitch
accents. In Proc. Eurospeech '95, Madrid, 1995. [ bib | .ps | .pdf ] |
| Eric Sanders and Paul A. Taylor. Using statistical models
to predict phrase boundaries for speech synthesis. In Proc. Eurospeech
'95, Madrid, 1995. [ bib | .ps | .pdf ] |
| Alan W. Black and Paul A. Taylor. A framework for
generating prosody from high level linguistics descriptions. In Spring
meeting, Acoustical society of Japan, 1994. [ bib ] |
| Alan W. Black and Paul A. Taylor. Assigning intonation
elements and prosodic phrasing for English speech synthesis from high level
linguistic input. In ICSLP94, volume 2, pages 715-718, Yokohama,
Japan, 1994. [ bib | .ps | .pdf ] |
| Alan W. Black and Paul A. Taylor. CHATR: A generic
speech synthesis system. In COLING '94, volume 2, pages 983-986,
Kyoto, Japan, 1994. [ bib | .ps | .pdf ] |
| Paul A. Taylor and Alan W. Black. Synthesizing conversational
intonation from a linguistically rich input. In Second ESCA/IEEE Workshop
on Speech Synthesis, New York, 1994. [ bib | .ps | .pdf ] |
| Paul A. Taylor. Automatic recognition of intonation from
F0 contours using the rise/fall/connection model. In Proc. Eurospeech
'93, Berlin, 1993. [ bib | .ps | .pdf ] |
| Paul A. Taylor. Synthesizing intonation using the RFC model.
In Proc. ESCA workshop on prosody, lund, sweden, 1993. [ bib | .ps | .pdf ] |
| Paul A. Taylor and S. D. Isard. A new model of intonation
for use with speech recognition and synthesis. In International Conference
on Spoken Language Processing, Banff, Canada, 1992. [ bib | .ps | .pdf ] |
| Paul A. Taylor. A phonetic model of English intonation
. PhD thesis, University of Edinburgh, 1992. [ bib | .ps | .pdf ] |
| Paul A. Taylor, I. A. Nairn, A. M. Sutherland, and
M. A. Jack. A real time speech synthesis system. In Proc.
Eurospeech '91, Genova Italy, 1991. [ bib ] |
| Paul A. Taylor, I. A. Nairn, A. M. Sutherland, and
M. A. Jack. A real time speech synthesis system. In IEEE
symposium, 1991. [ bib ] |
| Paul A. Taylor, I. A. Nairn, A. M. Sutherland, and
M. A. Jack. An interactive synthetic speech generation system.
In IEEE Colloquium on Systems and Applications of Man-Machine Interaction
Using Speech I/O, London, 1991. [ bib ] |
| Paul A. Taylor and Stephen D. Isard. Automatic diphone
segmentation. In Proc. Eurospeech '91, Genova, Italy, 1991. [ bib ] |
| Paul A. Taylor and Stephen D. Isard. Automatic diphone
segmentation using hidden markov models. In SST-90, Third International
Australian Conference in Speech Science and Technology, Melbourne,
Australia, 1990. [ bib ] |