CUED-RNNLM Toolkit

CUED-RNNLM Toolkit

Introduction

CUED-RNNLM toolkit present an implementation of RNNLM training (on GPU), and efficient evaluation(on CPU). We have recently released CUED-RNNLM v1.0, which support LSTM, GRU, Highway structure, as well as more flexible deep structure.

more recently, we released CUED-RNNLM v1.1, which support su-RNNLMs.

Code and example for lattice rescoring on HTK and Kaldi lattice are available as well, please feel free to send me email (chenxie95@gmail.com) if you observed any issues.

We strongly recommend you to use CUED-RNNLM v1.0 and CUED-RNNLM v1.1 (for su-RNNLM), and the old version (CUED-RNNLM v0.1) won't be maintained

CUED-RNNLM.v1.1 - CUED-RNNLM.v1.0 - CUED-RNNLM.v0.1 - Contact - Reference - License

CUED-RNNLM v1.1

cued-rnnlm code

cued-rnnlm evalation on CPU

HTK HLRescore

HTK Lattice Rescore example (including su-RNNLM lattice rescoring)

Highlights

CUED-RNNLM v1.0 usage

cued-rnnlm code

cued-rnnlm evalation on CPU

HTK HLRescore, HTK Lattice Rescore example

Kaldi Patch (support to rescore Kaldi lattice directly), Kaldi Lattice Rescore example

CUED RNNLM Toolkit Document v1.0

Recipe for training and evaluation (PPL and N-best rescoring)

Highlights

CUED-RNNLM v0.1

cued-rnnlm v0.1 (both linux and windows supported)

cued-rnnlm v0.1a (modify gradient clipping for NCE training with shared noise sample(under testing))

tools to evaluate cued-rnnlm on CPU

HTK 3.4.1 patch(RNNLM latrescore supported in HLRescore) HTKLib change file list HTKTools change file list

Example of RNNLM lattice rescore

Kaldi lattice to HTK lattice Convert Tools HTK and Kaidi Lattices

External Link

HTK 3.5 supports RNNLMs generated by CUED-RNNLM toolkit for lattice rescoring now!

Highlights

Document

CUED RNNLM Toolkit

AMI Data and Recipe

Recipe

Log file for Google 1 Billion Corpus

log file

Usage

rnnlm is invoked by typing the commend line

Contact

Xie Chen - chenxie95 at gmail.com

References

Acknowledgement

This work is supported by Toshiba Research Europe Ltd, Cambridge Research Lab. It was also supported by EPSRC grant EP/I031022/1 (Natural Speech Technology).

We would like to thank Andreas Stolcke for porting it into windows (MS Visual C) and valuable advices.

We would like to thank Ricky Chan for providing the Kaldi lattice rescoring code for CUED-RNNLM v0.1

License

This toolkit is freely available under the BSD license and copyright from the RNNLM toolkit.