Simple and Efficient Model Filtering in Statistical Machine Translation

Simple and Efficient Model Filtering in Statistical Machine Translation” by J. Pino, A. Waite, and W. Byrne. The Prague Bulletin of Mathematical Linguistics, no. 98, 2012, pp. 5-24 (20 pages). Published online 6 September 2012.

Abstract

Data availability and distributed computing techniques have allowed statistical machine translation (SMT) researchers to build larger models. However, decoders need to be able to retrieve information efficiently from these models to be able to translate an input sentence or a set of input sentences. We introduce an easy to implement and general purpose solution to tackle this problem: we store SMT models as a set of key-value pairs in an HFile. We apply this strategy to two specific tasks: test set hierarchical phrase-based rule filtering and n-gram count filtering for language model lattice rescoring. We compare our approach to alternative strategies and show that its trade offs in terms of speed, memory and simplicity are competitive.

BibTeX entry:

@article{pbm12,
   author = {J. Pino and A. Waite and W. Byrne},
   title = {Simple and Efficient Model Filtering in Statistical Machine
	Translation},
   journal = {The Prague Bulletin of Mathematical Linguistics},
   number = {98},
   pages = {5--24 (20 pages)},
   year = {2012},
   note = {Published online 6 September 2012},
   url = {https://ufal.mff.cuni.cz/pbml/98/art-pino-waite-byrne.pdf}
}

Back to Bill Byrne publications.