[Univ of Cambridge] [Dept of Engineering]
 

 

Steve Young

 

Home

Research

Tutorial

Publications

People

Projects

Contact


 

Recent Publications ( Google Scholar Profile )

2021


S. Young (2021). "Hey Cyba: the Inner Workings of a Conversational Agent." Cambridge University Press, Info

2020


TH. Wen and S. Young (2020). "Recurrent neural network language generation for spoken dialogue systems." Computer Speech and Language, 63, Sept 2020.

B. Thomson, D. Vandyke, G. Frazzingaro, S. Delgado, T. Gunter, T. Voice, T. Helgason, S. Young, D. O Seaghdha, D. Kaplan (2020). "Optimizing dialogue policy decisions for digital assistants using implicit feedback.", US Patent 10810274.

2019


B. Thomson, A. Johannsen, D. O Seaghdha, F. Flego, L. Simonelli, S. Young, T. Voice, T. Helgason (2019). "Hierarchical belief states for digital assistants.", US Patent 10482874.

2018


S. Ultes, P. Budzianowski, I. Casanueva, L. Rojas-Barahona, B. Tseng, Y. Wu, S. Young and M. Gasic (2018). "Addressing Objects and Their Relations: The Conversational Entity Dialogue Model." Sigdial 18, Melbourne, Australia. PDF

P-H. Su, M. Gasic and S. Young (2018). "Reward Estimation for Dialogue Policy Optimisation." Computer Speech and Language, 51(1):24-43. PDF

LM. Rojas-Barahona, S. Ultes, P. Budzianowski, I. Casanueva, M. Gasic, B-H. Tseng, S. Young (2018) "Nearly Zero-Shot Learning for Semantic Decoding in Spoken Dialogue Systems." arXiv:1806.05484.

2017


I. Casanueva, P. Budzianowski, P.-H. Su, N. Mrksic, T.-H. Wen, S. Ultes, L. Rojas-Barahona, S. Young and M. Gasic (2017). "A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management." arxiv:1711.11023v1. PDF

P.-H. Su, P. Budzianowski, S. Ultes, M. Gasic and S. Young (2017). "Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management." SigDial 17, Saarbrucken, Germany. PDF

S. Ultes, P. Budzianowski, I. Casanueva, N. Mrksic, L. Rojas-Barahona, P.-H. Su, T.-H. Wen, M. Gasic and S. Young (2017). "Reward-Balancing for Statistical Spoken Dialogue Systems using Multi-objective Reinforcement Learning." Sigdial 17, Saarbrucken, Germany. PDF

S. Ultes, P. Budzianowski, I. Casanueva, N. Mrksic, L. Rojas-Barahona, P.-H. Su, T.-H. Wen, M. Gasic and S. Young (2017). "Domain-independent User Satisfaction Reward Estimation for Dialogue Policy Learning." Interspeech 2017, Stockholm. PDF

N. Mrksic, D. O Seaghdha, T.-H. Wen, B. Thomson and S. Young (2017). "Neural Belief Tracker: Data-Driven Dialogue State Tracking." ACL 2017, Vancouver, Canada. PDF

I. Vulic, N. Mrksic, R. Reichart, D. O Seaghdha, S. Young and A. Korhonen (2017). "Morph-fitting: Fine-Tuning Word Vector Spaces with Simple Language-Specific Rules." ACL 2017, Vancouver, Canada. PDF

N. Mrksic, I. Vulic, D. O Seaghdha, I. Leviant, R. Reichart, M. Gasic, A. Korhonen and S. Young (2017). "Semantic Specialisation of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints." Trans. ACL, 5:309-324. PDF

T.-H. Wen, Y. Miao, P. Blunsom, and S. Young (2017). "Latent Intention Dialogue Models." ICML 2017, Sydney, Australia. PDF

T.-H. Wen, D. Vandyke, N. Mrksic, M. Gasic, L. Rojas-Barahona, P.-H. Su, S. Ultes and S. Young (2017). "A Network-based End-to-End Trainable Task-oriented Dialogue System." EACL 2017, Valencia, Spain. PDF

M. Gasic, N. Mrksic, L. Rojas-Barahona, P.-H. Su, S. Ultes, D. Vandyke, T.-H. Wen and S. Young (2017). "Dialogue manager domain adaptation using Gaussian process reinforcement learning." Computer Speech and Language, 45(5):552-569. PDF

2016


L. Rojas-Barahona, M. Gasic, N. Mrksic, P.-H. Su, S. Ultes, T.-H. Wen and S. Young (2016). "Exploiting Sentence and Context Representations in Deep Neural Models for Spoken Language Understanding." Coling, Osaka, Japan. PDF

T.-H. Wen, M. Gasic, N. Mrksic, L. Rojas-Barahona, P.-H. Su, S. Ultes, D. Vandyke and S. Young (2016). "Conditional Generation and Snapshot Learning in Neural Dialogue Systems." EMNLP 2016. Austin, Tx. PDF

P.-H. Su, M. Gasic, N. Mrksic, L. Rojas-Barahona, S. Ultes, D. Vandyke, T.-H. Wen and S. Young (2016). "Continuously Learning Neural Dialogue Management." arXiv:1606.02689v1. PDF

D. Litman, S. Young, M. Gales, K. Knill, K. Ottewell, R. van Dalen and D. Vandyke (2016). "Towards Using Conversations with Spoken Dialogue Systems in the Automated Assessment of Non-Native Speakers of English." Sigdial. Los Angeles, CA. PDF

P-H. Su, M. Gasic, N. Mrksic, L. Rojas-Barahona, S. Ultes, D. Vandyke, T-H. Wen, and S. Young (2016). "On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems." ACL 2016, Berlin.[Best student paper] PDF

T.-H. Wen, M. Gasic, N. Mrksic, L. Rojas-Barahona, P.-H. Su, D. Vandyke and S. Young (2016). "Multi-domain Neural Network Language Generation for Spoken Dialogue Systems." NAACL HLT 2016, San Diego. PDF

N. Mrksic, D. O Seaghdha, B. Thomson, M. Gasic, L. Rojas-Barahona, P.-H. Su, D. Vandyke, T.-H. Wen and S. Young (2016). "Counter-fitting Word Vectors to Linguistic Constraints." NAACL HLT 2016, San Diego. PDF

2015


M. Gasic, N. Mrksic, P.-H. Su, D. Vandyke, T.-H. Wen and S. Young (2015). "Policy Committee for Adaptation in Multi-domain Spoken Dialogue Systems." IEEE ASRU 2015, Scotsdale, AZ. PDF

D. Vandyke, P.-H. Su, M. Gasic, N. Mrksic, T.-H. Wen and S. Young (2015). "Multi-Domain Dialogue Success Classifiers for Policy Training." IEEE ASRU 2015, Scottsdale, AZ. PDF

T.-H. Wen, M. Gasic, N. Mrksic, L. Rojas-Barahona, P.-H. Su, D. Vandyke and S. Young (2015). "Toward Multi-domain Language Generation using Recurrent Neural Networks." NIPS 2015 Workshop on Machine Learning for Spoken Language Understanding and Interaction, Montreal. PDF

T-H. Wen, M. Gasic, N. Mrksic, P-H. Su, D. Vandyke and S. Young (2015). "Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems." EMNLP 2015, Lisbon, Portugal. [Best paper] PDF

T-H. Wen, M. Gasic, D. Kim, N. Mrksic, P-H. Su, D. Vandyke and S. Young (2015). "Stochastic Language Generation in Dialogue using Recurrent Neural Networks with Convolutional Sentence Reranking." Sigdial 2015, Prague, Cz. [Best paper] PDF

P-H. Su, D. Vandyke, M. Gasic, N. Mrksic, T-H. Wen, and S. Young (2015). "Reward Shaping with Recurrent Neural Networks for Speeding up On-Line Policy Learning in Spoken Dialogue Systems." Sigdial 2015, Prague, Cz. PDF

Z. Wang, Y. Stylianou, T-H. Wen, P-H. Su and S. Young (2015). "Learning Domain-Independent Dialogue Policies via Ontology Parameterisation." Sigdial 2015, Prague, Cz. PDF

N. Mrksic, D. O'Seaghdha, B. Thomson, M. Gasic, P-H. Su, D. Vandyke, T-H. Wen and S. Young (2015). "Multi-domain Dialog State Tracking using Recurrent Neural Networks." ACL 2015, Beijing. PDF

P-H. Su, D. Vandyke, M. Gasic, D. Kim, N. Mrksic, T-H. Wen and S. Young (2015). "Learning from Real Users: Rating Dialogue Success with Neural Networks for Reinforcement Learning in Spoken Dialogue Systems." Interspeech 2015, Dresden. PDF

M. Gasic, D. Kim, P. Tsiakoulis and S. Young (2015). "Distributed Dialogue Policies for Multi-Domain Statistical Dialogue Management." IEEE ICASSP 15, Brisbane, Sydney. PDF(original) PDF(updated)

2014


M. Henderson, B. Thomson and S. Young (2014). "Robust Dialog State Tracking using Delexicalised Recurrent Neural Networks and Unsupervised Adaptation." IEEE SLT 2014, Lake Tahoe, NV. PDF

D. Kim, M. Henderson, M. Gasic, P. Tsiakoulis and S. Young (2014). "The Use of Discriminative Belief Tracking in POMDP-based Dialogue Systems." IEEE SLT 2014, Lake Tahoe, NV. PDF

M. Gasic, D. Kim, P. Tsiakoulis, C. Breslin, M. Henderson, M. Szummer, B. Thomson, and S. Young (2014). "Incremental on-line adaptation of POMDP-based dialogue managers to extended domains." Interspeech 2014, Singapore. PDF

D. Kim, C. Breslin, P. Tsiakoulis, M. Gasic, M. Henderson, and S. Young (2014). "Inverse Reinforcement Learning for Micro-Turn Management." Interspeech 2014, Singapore. PDF

P. Tsiakoulis, C. Breslin, M. Gasic, M. Henderson, D. Kim, and S. Young (2014). "Dialogue Context Sensitive Speech Synthesis using Factorized Decision Trees." Interspeech 2014, Singapore. PDF

F. Mairesse, S. Young (2014). "Stochastic Language Generation in Dialogue using Factored Language Models." Computational Linguistics, 40(4):763-799. PDF

M. Gasic and S. Young (2014). "Gaussian processes for POMDP-based dialogue manager optimization." IEEE Trans. Audio, Speech and Language Processing, 22(1):28-40. PDF

M. Henderson, B. Thomson and S. Young (2014). "Word-Based Dialog State Tracking with Recurrent Neural Networks." SigDial 2014, Philadelphia, PA. PDF

S. Young, C. Breslin, M. Gasic, M. Henderson, D. Kim, M. Szummer, B. Thomson, P. Tsiakoulis and E. Tzirkel Hancock (2014). "Evaluation of Statistical POMDP-based Dialogue Systems in Noisy Environments." International Workshop Spoken Dialogue Systems (IWSDS 2014), Napa, CA. PDF

2013


M. Gasic, C. Breslin, M. Henderson, D. Kim, M. Szummer, B. Thomson, P. Tsiakoulis and S. Young (2013). "POMDP-based dialogue manager adaptation to extended domains." SigDial 13, Metz, France. [Best paper] PDF

M. Henderson, B. Thomson and S. Young (2013). "Deep Neural Network Approach for the Dialog State Tracking Challenge." SigDial 13, Metz, France. PDF

M. Gasic, C. Breslin, M. Henderson, D. Kim, M. Szummer, B. Thomson, P. Tsiakoulis and S. Young (2013). "On-line Policy Optimisation of Bayesian Spoken Dialogue Systems via Human Interaction." Int Conf Acoustics Speech and Signal Processing (ICASSP), Vancouver, Canada. PDF

C. Breslin, M. Gasic, M. Henderson, D. Kim, M. Szummer, B. Thomson, P. Tsiakoulis, K. Yu and S. Young (2013). "Continuous ASR for Flexible Incremental Dialogue." Int Conf Acoustics Speech and Signal Processing (ICASSP), Vancouver, Canada. PDF

S. Young, M. Gasic, B. Thomson and J. Williams (2013). "POMDP-based Statistical Spoken Dialogue Systems: a Review." Proc IEEE, 101(5):1160-1179 PDF

S. Young (2013). "Talking to Machines" Royal Academy of Engineering Ingenia, 54:40-46 PDF

2012


M. Gasic, F. Jurcicek, B. Thomson and S. Young (2012). "Optimisation for POMDP-based Spoken Dialogue Systems" in Data-driven Methods for Adaptive Spoken Dialogue Systems, Ed. O. Lemon and O. Pietquin, Springer ISBN 978-1-4614-4803-7. PDF

M. Henderson, M. Gasic, B. Thomson, P. Tsiakoulis, K. Yu and S. Young (2012). "Discriminative Spoken Language Understanding Using Word Confusion Networks." IEEE SLT 2012, Miami, FL. PDF

M. Gasic, M. Henderson, B. Thomson, P. Tsiakoulis and S. Young (2012). "Policy optimisation of POMDP-based dialogue systems without state space compression." IEEE SLT 2012, Miami, FL. PDF

B. Thomson, M. Henderson, M. Gasic, P. Tsiakoulis and S. Young (2012) "N-Best error simulation for training spoken dialogue systems." IEEE SLT 2012, Miami, FL. PDF

M. Gasic, P. Tsiakoulis, M. Henderson, B. Thomson, K. Yu, E. Tzirkel and S. Young (2012). "The effect of cognitive load on a statistical dialogue system." SigDial 2012, Seoul, S. Korea. PDF

P. Tsiakoulis, M. Gasic, M. Henderson, J. Planells-Lerma, J Prombonas, B. Thomson, K. Yu, S. Young and E. Tzirkel (2012). "Statistical Methods for Building Robust Spoken Dialogue Systems in an Automobile" in Advances in Human Aspects of Road and Rail Transportation, Ed. NA. Stanton, CRC Press, ISBN 978-1-4398-7123-2. PDF

F. Jurcicek, B. Thomson and S. Young (2012). "Reinforcement learning for parameter estimation in statistical spoken dialogue systems." Computer Speech and Language, 26(3):127-228 PDF

2011


M. Gasic, F. Jurcicek, B. Thomson, K. Yu and S. Young (2011). "On-line policy optimisation of spoken dialogue systems via live interaction with human subjects. " ASRU 2011, Hawaii. PDF

F. Jurcicek, S. Keizer, M. Gasic, F. Mairesse, B. Thomson, K. Yu, S. Young (2011). "Real user evaluation of spoken dialogue systems using Amazon Mechanical Turk. " Interspeech, Florence, Italy. PDF

L. Daubigney, M. Gasic, S. Chandramohan, M. Geist, O. Pietquin and S. Young (2011). "Uncertainty management for on-line optimisation of a POMDP-based large-scale spoken dialogue system." Interspeech, Florence, Italy. PDF

A.W Black, S. Burger, A. Conkie, H. Hastie, S. Keizer, O. Lemon, N. Merigaud, G. Parent, G. Schubiner, B. Thomson, J.D Williams, K. Yu, S. Young, M. Eskenazi (2011). "Spoken Dialog Challenge 2010: Comparison of Live and Control Test Results. " SigDial 2011, Portland, Oregan. PDF

K. Yu, H. Zen, F. Mairesse and S. Young (2011). "Context Adaptive Training with Factorized Decision Trees for HMM based Statistical Parametric Speech Synthesis." Speech Communication, 53(6):914-923. PDF

M. Gasic and S. Young (2011). "Effective Handling of Dialogue State in the Hidden Information State POMDP-based Dialogue Manager." ACM Transactions on Speech and Language Processing, 7(3) PDF

F. Jurcicek, B. Thomson and S. Young (2011). "Natural Actor and Belief Critic: Reinforcement algorithm for learning parameters of dialogue systems modelled as POMDPs." ACM Transactions on Speech and Language Processing, 7(3) PDF

K. Yu and S. Young (2011). "Continuous F0 Modelling for HMM based Statistical Parametric Speech Synthesis." IEEE Trans. Audio, Speech and Language Processing, 19(5):1071-1079. PDF

K. Yu and S. Young (2011). "Joint Modelling of Voicing Label and Continuous F0 for HMM-based Speech Synthesis. " Int Conf Acoustics Speech and Signal Processing (ICASSP), Prague, CZ. PDF

2010


B. Thomson, F. Jurcicek, M. Gasic, S. Keizer, F. Mairesse, K. Yu and S. Young (2010). "Parameter learning for POMDP spoken dialogue models." SLT 2010, Berkeley, CA. [Best paper] PDF

B. Thomson, K. Yu, S. Keizer, M. Gasic, F. Jurcicek, F. Mairesse and S. Young (2010). "Bayesian Update of State for the Let's Go Spoken Dialogue Challenge. " SLT 2010, Berkeley, CA. PDF

S. Young (2010). "Cognitive User Interfaces" IEEE Signal Processing Magazine,27(3): 128-140. PDF

B. Thomson and S. Young (2010). "Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems." Computer Speech and Language,24(4): 562-588. [CSL 2015 Best paper Award] PDF

S. Young (2010). "Still Talking to Machines (Cognitively Speaking). " Interspeech 2010, Chiba, Japan. PDF

F. Jurcicek, B. Thomson, S. Keizer, F. Mairesse, M. Gasic, K. Yu and S. Young (2010). "Natural Belief-Critic: a reinforcement algorithm for parameter estimation in statistical spoken dialogue systems. " Interspeech 2010, Chiba, Japan. [Best paper] PDF

F. Lefevre, F. Mairesse and S. Young (2010). "Cross-Lingual Spoken Language Understanding from Unaligned Data using Discriminative Classification Models and Machine Translation." Interspeech 2010, Tokyo, Japan. PDF

K. Yu, H Zen, F. Mairesse and S. Young (2010). "Context Adaptive Training with Factorized Decision Trees for HMM-Based Speech Synthesis. " Interspeech 2010, Chiba, Japan. [Best paper] PDF

S. Keizer, M. Gasic, F. Jurcicek, F. Mairesse, B. Thomson, K. Yu and S. Young (2010). "Parameter estimation for agenda-based user simulation. " SigDial 2010, Tokyo, Japan. PDF

M. Gasic, F. Jurcicek, S. Keizer, F. Mairesse, B. Thomson, K. Yu and S. Young (2010). "Gaussian Processes for Fast Policy Optimisation of a POMDP Dialogue Manager for a Real-world Task. " SigDial 2010, Tokyo, Japan. PDF

K. Yu, B. Thomson and S. Young (2010). "From Discontinuous To Continuous F0 Modelling In HMM-based Speech Synthesis. " 7th ISCA Workshop on Speech Synthesis, Kyoto, Japan. PDF

K. Yu, F. Mairesse and S. Young (2010). "Word-level Emphasis Modelling in HMM-based Speech Synthesis. " Int Conf Acoustics Speech and Signal Processing (ICASSP) , Dallas, TX. PDF

S. Young, M. Gasic, S. Keizer, F. Mairesse, J. Schatzmann, B. Thomson and K. Yu (2010). "The Hidden Information State Model: a practical framework for POMDP-based spoken dialogue management." Computer Speech and Language, 24(2): 150-174. [CSL 2013 Best paper Award] PDF

F. Mairesse, M. Gasic, F. Jurcicek, S. Keizer, J. Prombonas, B. Thomson, K. Yu and S. Young (2010). "Phrase-based Statistical Language Generation using Graphical Models and Active Learning. " ACL 2010, Uppsala, Sweden. PDF

2009


S. Young (2009). "CUED Standard Dialogue Acts. " Internal Report, Dialogue Systems Group, Cambridge University. PDF

M. Gasic, F. Lefevre, F. Jurcicek, S. Keizer, F. Mairesse, B. Thomson, K. Yu and S. Young (2009). "Back-off Action Selection in Summary Space-Based POMDP-based Dialogue Systems. " ASRU 2009, Merano, Italy. PDF

F. Lefevre, M. Gasic, F. Jurcicek, S. Keizer, F. Mairesse, B. Thomson, K. Yu and S. Young (2009). "k-Nearest Neighbor Monte-Carlo Control Algorithm for POMDP-based Dialogue Systems. " SigDial 2009, London, UK. PDF

J. Jurcicek, M. Gasic, S. Keizer, F. Mairesse, B. Thomson and S. Young (2009). "Transformation-based Learning for Semantic Parsing. " Interspeech 2009, Brighton, UK. PDF

J. Schatzmann and S. Young (2009). "The Hidden Agenda User Simulation Model." IEEE Trans. Audio, Speech and Language Processing, 17(4):733-747. PDF

T. Toda and S. Young (2009). "Trajectory Training Considering Global Variance for HMM-based Speech Synthesis. " Int Conf Acoustics Speech and Signal Processing (ICASSP) , Taipei, Taiwan. PDF

K. Yu, T. Toda, M. Gasic, S. Keizer, F. Mairesse, B. Thomson and S. Young (2009). " Probabilistic Modelling of F0 in Unvoiced Regions in HMM-based Speech Synthesis. " Int Conf Acoustics Speech and Signal Processing (ICASSP) , Taipei, Taiwan. PDF

F. Mairesse, M. Gasic, F. Jurcicek, S. Keizer, B. Thomson, K. Yu and S. Young (2009). " Spoken Language Understanding from Unaligned Data using Discriminative Classification Models. " Int Conf Acoustics Speech and Signal Processing (ICASSP) , Taipei, Taiwan. PDF

Z. Inanoglu and S. Young (2009). "Data-driven Emotion Conversion in Spoken English." Speech Communication, 51(3): 268-283. [ISCA-EUSIPCO 2011 Best paper Award] PDF

2008


B. Thomson, M. Gasic, S. Keizer, F. Mairesse, J. Schatzmann, K. Yu, S. Young (2008). "User study of the Bayesian Update of Dialogue State approach to dialogue management." Interspeech 2008, Brisbane, Australia. PDF

S. Keizer, M. Gasic, F. Mairesse, B. Thomson, K. Yu, S. Young(2008). "Modelling User Behaviour in the HIS-POMDP Dialogue Manager. " IEEE Workshop on Spoken Language Technology (SLT08), Goa, India. PDF

B. Thomson, K. Yu, M. Gasic, S. Keizer, F. Mairesse, J. Schatzmann, S. Young(2008). "Evaluating Semantic-level Confidence Scores with Multiple Hypotheses." Interspeech 2008, Brisbane, Australia. PDF

A. Del Pozo and S. Young (2008). "The Linear Transformation of LF Glottal Waveforms for Voice Conversion." Interspeech 2008, Brisbane, Australia. PDF

Z. Inanoglu and S. Young (2008). "Emotion Conversion using F0 Segment Selection." Interspeech 2008, Brisbane, Australia. PDF

M. Gasic, S. Keizer, F. Mairesse, J. Schatzmann, B. Thomson, S. Young (2008). "Training and Evaluation of the HIS POMDP Dialogue System in Noise." Sigdial 2008, Columbus, Ohio. PDF

B. Thomson, J. Schatzmann, and S. Young (2008). "Bayesian Update of Dialogue State for Robust Dialogue Systems." Int Conf Acoustics Speech and Signal Processing ICASSP, Las Vegas. PDF

A. Del Pozo and S. Young (2008). "Repairing Tracheoesophageal Speech Duration." Fourth Speech Prosody Conference, Campinas,Brazil. PDF

2007


M. Gales and S. Young (2007). "The Application of Hidden Markov Models in Speech Recognition." Foundations and Trends in Signal Processing 1(3): 195-304. PDF

J. Schatzmann, B. Thomson, and S. Young (2007). "Error Simulation for Training Statistical Dialogue Systems." ASRU 07, Kyoto, Japan. PDF

J. Schatzmann, B. Thomson and S. Young. (2007). "Statistical User Simulation with a Hidden Agenda." SigDIAL, Antwerp. PDF

Z. Inanoglu and S. Young (2007). "A System for Transforming the Emotion in Speech: Combining Data-Driven Conversion Techniques for Prosody and Voice Quality." Interspeech 2007, Antwerp. PDF

J. Williams and S. Young. (2007). "Scaling POMDPs for Spoken Dialog Management." IEEE Audio, Speech and Language Processing, 15(7): 2116-2129. PDF

S. Young (2007). "HMMs and Related Speech Recognition Technologies." Springer Handbook of Speech Processing. J. Benesty, M. Sondhi and Y. Huang, Springer. PDF

J. Schatzmann, B. Thomson, K. Weilhammer, H. Ye, and S. Young (2007). "Agenda-Based User Simulation for Bootstrapping a POMDP Dialogue System." HLT/NAACL 2007, Rochester. PDF

B. Thomson, J. Schatzmann, K. Weilhammer, H. Ye, and S. Young. (2007). "Training a real-world POMDP-based Dialog System." HLT/NAACL Workshop "Bridging the Gap: Academic and Industrial Research in Dialog Technologies", Rochester. PDF

S. Young, J. Schatzmann, K. Weilhammer and H. Ye. (2007). "The Hidden Information State Approach to Dialog Management." ICASSP 2007, Honolulu, Hawaii. PDF

J. Williams and S. Young (2007). "Partially Observable Markov Decision Processes for Spoken Dialog Systems." Computer Speech and Language 21(2):231-422.  [CSL 2010 Best paper Award] PDF

2006


S. Young  (2006). "Using POMDPs for Dialog Management." IEEE/ACL Workshop on Spoken Language Technology (SLT 2006), Aruba. PDF

K. Weilhammer, M. Stuttle, et al. (2006). "Bootstrapping Language Models for Dialogue Systems." ICSLP 2006, Pittsburgh, PA. PDF

H. Ye and S. Young (2006). "A Clustering Approach to Semantic Decoding." ICSLP 2006, Pittsburgh, PA. PDF

J. Schatzmann, K. Weilhammer, M. Stuttle, S. Young. (2006). "A Survey of Statistical User Simulation Techniques for Reinforcement-Learning of Dialogue Management Strategies." Knowledge Engineering Review 21(2): 97-126. PDF

H. Ye and S. Young (2006). "Quality-enhanced Voice Morphing using Maximum Likelihood Transformations." IEEE Audio, Speech and Language Processing 14(4): 1301-1312. PDF

A. Del Pozo and S. Young (2006). "Continuous Tracheosophageal Speech Repair." European Signal Processing Conference - EUSIPCO2006, Florence, Italy. PDF

J. Williams and S. Young (2006). "Scaling POMDPs for dialog management with composite summary point-based value iteration (CSPBVI)." AAAI Workshop on Statistical and Empirical Approaches for Spoken Dialogue Systems, Boston. PDF

J. Williams, P. Poupart, S. Young (2006). "Partially Observable Markov Decision Processes with Continuous Observations for Dialogue Management" Recent Trends in Discourse and Dialogue. Eds L. Dybkjaer and W. Minker, Springer. [expanded version of 2005 SIGDial paper] PDF

2005


S. Young, J. Williams, J. Schatzmann, M. Stuttle, K. Weilhammer. (2005). "The Hidden Information State Approach to Dialogue Management", CUED Technical Report CUED/F-INFENG/TR.544. PDF

J. Schatzmann, K. Weilhammer, M. Stuttle and S. Young. (2005). "Effects of the User Model on Simulation-based Learning of Dialogue Strategies." IEEE Workshop Automatic Speech Recognition and Understanding (ASRU05), Cancun, Mexico. PDF

Z. Inanoglu and S. Young (2005). "Intonation Modelling and Adaptation for Emotional Prosody Generation." 1st Intnl Conf on Affective Computing and Intelligent Interaction, ACII 2005, Beijing, Springer-Verlag GmbH. PDF

J. Williams and S. Young (2005). "Scaling up POMDPs for Dialogue Management: the Summary POMDP Method." IEEE workshop on Automatic Speech Recognition and Understanding (ASRU2005), Cancun, Mexico. PDF

V. Seneviratne and S. Young (2005). "The Hidden Vector State Language Model". Interspeech 2005, Lisbon, Portugal. PDF

H. Ye and S. Young (2005). "Improving Speech Recognition Performance of Beginners in Spoken Conversational Interaction for Language Learning." Interspeech 2005, Lisbon, Portugal. PDF

J. Schatzmann, K. Georgila, and S. Young. (2005). "Quantitative Evaluation of User Simulation Techniques for Spoken Dialogue Systems." SIGDial, Lisbon. PDF

J. Williams, P. Poupart, and S. Young (2005).  "Partially Observable Markov Decision Processes with Continuous Observations for Dialogue Management."  SIGDial, Lisbon.
PDF

J. Williams, P. Poupart, and S. Young (2005).  "Factored Partially Observable Markov Decision Processes for Dialogue Management."  4th Workshop on Knowledge and Reasoning in Practical Dialogue Systems, Edinburgh.
PDF

Y. He and S. Young (2005). "Spoken Language Understanding using the Hidden Vector State Model." Speech Communication 48(3-4): 262-275.
PDF

J. Williams, P. Poupart and S. Young (2005). "Using Factored Partially Observable Markov Decision Processes with Continuous Observations for Dialog Management." Tech Report CUED/F-INFENG/TR.520, Cambridge University Engineering Dept.
PDF

Y. He and S. Young (2005). "Semantic Processing using the Hidden Vector State Model." Computer Speech and Language 19(1): 85-106.
PDF

H. Ye and S. Young (2004). "Voice Conversion for Unknown Speakers." ICSLP 2004, Jeju, Korea.
Gzipped Postscript

2004


H. Ye and S. Young (2004). "High Quality Voice Morphing." Int Conference Acoustics Speech and Signal Processing, Montreal, Canada.
Gzipped Postscript

J. Williams and S. Young (2004). "Characterising Task-oriented Dialog using a Simulated ASR Channel." ICSLP 2004, Jeju, Korea.
PDF

M. Stuttle, J. Williams and S. Young (2004). "A Framework for Dialog Systems Data Collection using a Simulated ASR Channel." ICSLP 2004, Jeju, Korea.
PDF

Y. He and S. Young (2004). "Robustness Issues in a Data-Driven Spoken Language Understanding System." HLT/NAACL04 Workshop on Spoken Language Understanding for Conversational Systems, Boston, MA.
PDF

S. Young (2003). "The Hidden Vector State Language Model." Tech Report CUED/F-INFENG/TR.467, Cambridge University.
PDF

2003


H. Ye and S. Young (2003). "Perceptually Weighted Linear Transformations for Voice Conversion." Eurospeech 2003, Geneva.
Gzipped Postscript

J. Williams and S. Young (2003). "Using Wizard-of-Oz Simulations to Bootstrap Reinforcement Learning-based Dialog Management Systems." 4th SIGdial Workshop on Discourse and Dialogue, Sapporo, Japan.
PDF

Y. He and S. Young (2003). "A Data-Driven Spoken Language Understanding System." IEEE Workshop on Automatic Speech Recognition and Understanding, US Virgin Islands.
PDF

Y. He and S. Young (2003). "Hidden Vector State Model for Hierarchical Semantic Parsing." Proceedings Int Conf Acoustics Speech and Signal Processing, Hong Kong.
PDF

2002


S. Young (2002). "The Statistical Approach to the Design of Spoken Dialogue Systems." Tech Report CUED/F-INFENG/TR.433, Cambridge University Engineering Department.
Gzipped Postscript

S. Young (2002). "Talking to Machines (Statistically Speaking)." Int Conf Spoken Language Processing, Denver, Colorado.
Gzipped Postscript

K. Scheffler and S. Young (2002). "Automatic Learning of Dialogue Strategy using Dialogue Simulation and Reinforcement Learning." HLT 2002, San Diego, USA.
Gzipped Postscript

H. Nock and S. Young (2002). "Modelling Asynchrony in Automatic Speech Recognition using Loosely Coupled HMMs." Cognitive Science 26(3): 283-301.
Gzipped Postscript

2001


S. Young (2001). "Statistical Modelling in Continuous Speech Recognition." Proc Int Conf on Uncertainty in Artificial Intelligence, Seattle.
Gzipped Postscript

A. Tuerk and S. Young (2001). "Polynomial Softmax Functions for Pattern Classification." Tech Report CUED/F-INFENG/TR402, Cambridge University Engineering Dept.
Gzipped Postscript

A. Tuerk and S. Young (2001). "Indicator Variable Dependent Output Probability Modelling via Continuous Posterior Functions." Int Conf Acoustics Speech and Signal Processing (ICASSP), Salt Lake City, Utah.
PDF

K. Scheffler and S. Young (2001). "Corpus-based Dialogue Simulation for Automatic Strategy Learning and Evaluation." Proc NAACL-2001 Workshop on Adaptation in Dialogue Systems, Pittsburgh, USA.
Gzipped Postscript

H. Nock and S. Young (2001). "A Comparison of Exact and Approximate Algorithms for Decoding and Training Loosely Coupled HMMs." Proc Inst Acoustics Workshop on Innovation in Speech Processing, Stratford-upon-Avon.
Gzipped Postscript

C. Blackburn and S. Young (2001). "Enhanced Speech Recognition using an Articulatory Production Model Trained on X-ray Data." Computer Speech and Language 15(3): 195-215.

S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev and P. Woodland (2000). "The HTK Book Version 3.0." Cambridge, England, Cambridge University.

S. Young (2000). "Probabilistic Methods in Spoken Dialogue Systems." Philosophical Trans Royal Society (Series A) 358(1769): 1389-1402. Gzipped Postscript

2000


S. Witt and S. Young (2000). "Phone-level Pronunciation Scoring and Assessment for Interactive Language Learning." Speech Communication 30(2/3): 95-108. PDF

K. Scheffler and S. Young (2000). "Probabilistic simulation of human-machine dialogues." Proc IEEE ICASSP, Istanbul, Turkey. Gzipped Postscript

H. Nock and S. Young (2000). "Loosely Coupled HMMs for ASR." Proc Int Conf Speech and Language Processing (ICSLP), Beijing, China. Gzipped Postscript

G. Moore and S. Young (2000). "Class-based language model adaptation using mixtures of word-class weights." Proc Int Conf Speech and Language Proc(ICSLP), Beijing, China.

C. Blackburn and S. Young (2000). "A Self-Learning Predictive Model of Articulatory Movements during Speech Production." J Acoustical Society of America 107(3): 1659-1670.

1999


S. Young (1999). Acoustic Modelling for Large Vocabulary Continuous Speech Recognition. Computational Models of Speech Pattern Processing: Proc NATO Advance Study Institute. K. Ponting, Springer-Verlag: 18-38. Gzipped Postscript

S. Witt and S. Young (1999). "Off-Line Acoustic Modelling of Non-Native Accents." Proc Eurospeech, Budapest, Hungary.

A. Tuerk and S. Young (1999). "Modelling Speaking Rate Using a Between Frame Distance Metric." Proc Eurospeech, Budapest, Hungary.

K. Scheffler and S. Young (1999). "Simulation of human-machine dialogues." Tech Report CUED/F-INFENG/TR 355, Cambridge University Engineering Dept. Gzipped Postscript

K. Knill and S. Young (1999). "Low-Cost Implementation of Open Set Keyword-spotting." Computer Speech and Language 13(3): 243-266.

M. Gales, K. Knill and S. Young (1999). "State-based Gaussian Selection in Large Vocabulary Continuous Speech Recognition using HMMs." IEEE Trans Speech and Audio Processing 7(2): 152-161.

1998


S. Young and L. Chase (1998). "Speech Recognition Evaluation: A Review of the US CSR and LVCSR Programmes." Computer Speech and Language 12(4): 263-279.

P. Woodland, T. Hain, S. Johnson, T. Niesler, A. Tuerk and S. Young (1998). "Experiments in Broadcast News Transcription." Proc IEEE ICASSP, Seattle.

P. Woodland, T. Hain, S. Johnson, T. Niesler, A. Tuerk, E. Whittaker and S. Young (1998). "The 1997 HTK Broadcast News Transcription System." DARPA Broadcast News Transcription and Understanding Workshop, Lansdowne, Virginia, Morgan Kaufmann.

S. Witt and S. Young (1998). "Bilingual Model Combination for Non-Native Speech Recognition." Proc Institute of Acoustics Conf Speech and Hearing, Windermere, England.

H. Nock and S. Young (1998). "Detecting and Improving Poor Pronunciations for Multiwords." Proc ESCA Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Rolduc, Netherlands.

T. Hain, S. Johnson, A. Tuerk, P. Woodland and S. Young (1998). "Segment Generation and Clustering in the HTK Broadcast News Transcription System." Proc DARPA Broadcast News Transcription and Understanding Workshop, Lansdowne, Virginia, Morgan-Kaufmann.

1997


S. Young, M. Brown, J. Foote, G. Jones and K. Sparck Jones (1997). "Acoustic Indexing for Multimedia Retrieval and Browsing." Proc IEEE ICASSP, Munich, Germany. Gzipped Postscript

S. Young and G. Bloothooft, Eds. (1997). "Corpus-Based Methods in Language and Speech Processing." Text, Speech and Language Technology. Dordrecht, Netherlands, Kluwer.

S. Young, M. Adda-Decker, X. Aubert, C. Dugast, J.-L. Gauvain, D. Kershaw, L. Lamel, D. v. Leeuwen, D. Pye, A. Robinson, H. Steeneken and P. Woodland (1997). "Multilingual Large Vocabulary Speech Recognition." Computer Speech and Language 11(1): 73-89.

P. Woodland, M. Gales, D. Pye and S. Young (1997). "Broadcast News Transcription using HTK." Proc IEEE ICASSP, Munich, Germany.

P. Woodland, M. Gales, D. Pye and S. Young (1997). "The Development of the 1996 HTK Broadcast News Transcription System." Proc DARPA Speech Recognition Workshop, Chantilly, Virginia, Morgan Kaufmann.

S. Witt and S. Young (1997). "Pronunciation Teaching Based on Automatic Speech Recognition." Proc. International Conf. on Language Teaching, Language Technology, Groningen, Netherlands.

S. Witt and S. Young (1997). "Computer-assisted Pronunciation Teaching based on Automatic Speech Recognition." Proc Conf. Language Teaching and Language Technology, Univ Groningen, the Netherlands.

S. Witt and S. Young (1997). "Language Learning Based on Non-Native Speech Recognition." Proc Eurospeech, Rhodes, Greece.

V. Valtchev, J. Odell, P. Woodland and S. Young (1997). "MMIE Training of Large Vocabulary Recognition Systems." Speech Communication 22: 303-314. [Best paper]

H.-H. Shih and S. Young (1997). "A Study on the Portability of a Grammatical Inference System." Proc 10th ROCLING (Research on Comp Linguistics) Int Conf, Academia Sinica, Taipei, Taiwan.

H. Nock, M. Gales and S. Young (1997). "A Comparative Study of Methods for Phonetic Decision-Tree State Clustering." Proc Eurospeech, Rhodes, Greece.

G. Jones, J. Foote, K. Sparck Jones and S. Young (1997). "The Video Mail Retrieval Project. Intelligent Multimedia Information Retrieval." Ed. M. Maybury, MIT Press: 191-214.

J. Foote, S. Young, G. Jones and K. Sparck Jones (1997). "Unconstrained Keyword Spotting using Phone Lattices with Application to Spoken Document Retrieval." Computer Speech and Language 11(3): 207-224.

1996


S. Young (1996). "Large Vocabulary Continuous Speech Recognition." IEEE Signal Processing Magazine 13(5): 45-57. Gzipped Postscript

V. Valtchev, P. Woodland and S. Young (1996). "Lattice-based Discriminative Training for Large Vocabulary Speech Recognition." Proc IEEE ICASSP, Atlanta.

V. Valtchev, P. Woodland and S. Young (1996). "Discriminative Optimisation of Large Vocabulary Systems." Proc Int Conf Speech and Language Processing (ICSLP), Philadelphia..

K. Sparck Jones, G. Jones, J. Foote and S. Young (1996). "Experiments in Spoken Document Retrieval." Information Processing and Management 32(4): 399-417.

K. Knill and S. Young (1996). "Fast Implementation Methods for Viterbi-based Word-Spotting." Proc IEEE ICASSP, Atlanta.

K. Knill, M. Gales and S. Young (1996). "Use of Gaussian Selection in Large Vocabulary Continuous Speech Recognition using HMMs." Proc Int Conf Spoken Language Processing (ICSLP), Philadelphia.

G. Jones, J. Foote, K. Sparck Jones and S. Young (1996). "Video Mail Retrieval using Voice: an Overview of the Stage 2 System." Proc Final Workshop on Multimedia Information Retrieval (MIRO), Glasgow, Scotland, Springer-Verlag.

G. Jones, J. Foote, K. Sparck Jones and S. Young (1996). "Robust Talker-Independent Audio Document Retrieval." Proc IEEE ICASSP, Atlanta.

G. Jones, J. Foote, K. Sparck Jones and S. Young (1996). "Retrieving Spoken Documents by Combining Multiple Index Sources." Proc ACM SIG Information Retrieval, Zurich.

M. Gales and S. Young (1996). "Robust Continuous Speech Recognition using Parallel Model Combination." IEEE Trans Speech and Audio Processing 4(5):352-359.

M. Brown, J. Foote, G. Jones, K. Sparck Jones and S. Young (1996). "Open-Vocabulary Speech Indexing for Voice and Video Mail Retrieval." Proc 4th ACM Int Multimedia Conf, Boston.

C. Blackburn and S. Young (1996). "A Self-Learning Speech Synthesis System." Proc ESCA 4th Tutorial and Workshop on Speech Production Modelling.

C. Blackburn and S. Young (1996). "Pseudo-articulatory Speech Synthesis for Recognition using Automatic Feature Extraction from X-Ray Data." Proc Int Conf Spoken Language Processing (ICSLP), Philadelphia.

1995


P. Woodland, C. Leggetter, J. Odell, V. Valtchev and S. Young (1995). "The 1994 HTK Large Vocabulary Speech Recognition System." Proc ICASSP, Detroit.

P. Woodland, C. Leggetter, J. Odell, V. Valtchev and S. Young (1995). "The Development of the 1994 HTK Large Vocabulary Speech Recognition System." Proc Spoken Language Technology Workshop, Morgan Kaufmann Publishers Inc, Austin, Texas.

K. Sparck Jones, J. Foote, G. Jones and S. Young (1995). "Spoken Document Retrieval." Proc 1st Annual Symposium on Document Analysis and Information Retrieval, Las Vegas.

H.-H. Shih, S. Young and N. Waegner (1995). "An Inference Approach to Grammar Construction." Computer Speech and Language 9(3): 235-256.

D. Pye, P. Woodland and S. Young (1995). "Large Vocabulary Multilingual Speech Recognition using HTK." Proc Eurospeech, Madrid.

G. Jones, J. Foote, K. Sparck Jones and S. Young (1995). "Video Mail Retrieval." Proc ICASSP, Detroit.

M. Gales and S. Young (1995). "A Fast and Flexible Implementation of Parallel Model Combination." Proc ICASSP, Detroit.

M. Gales and S. Young (1995). "Robust Speech Recognition in Additive and Convolutional Noise using Parallel Model Combination." Computer Speech and Language 9(4): 289-308.

M. Gales and S. Young (1995). "The Application of Parallel Model Combination to a Large Vocabulary Dictation Task." Proc Eurospeech, Madrid.

J. Foote, G. Jones, K. Sparck Jones and S. Young (1995). "Talker-Independent Keyword Spotting for Information Retrieval." Proc Eurospeech 95, Madrid.

J. Foote, M. Brown, G. Jones, K. Sparck Jones and S. Young (1995). "Video Mail Retrieval by Voice." Proc IMMI-1, Edinburgh.

M. Brown, J. Foote, G. Jones, K. Sparck Jones and S. Young (1995). "Automatic Content-Based Retrieval of Broadcast News." Proc ACM Multimedia 95, San Francisco.

C. Blackburn and S. Young (1995). "Towards Improved Speech Recognition using a Speech Production Model." Proc Eurospeech, Madrid.

C. Blackburn and S. Young (1995). "Learning New Articulator Trajectories for a Speech Production Model using Artificial Neural Networks." Proc ICCN, Perth, Australia.

C. Blackburn and S. Young (1995). "A Novel Self-Organising Speech Production System using Pseudo-Articulators." Proc 13th Int Congress of Phonetic Sciences, Stockholm, Sweden.