The HTK Rich Audio Transcription project is funded by the DARPA
Effective, Affordable Reusable Speech-to-text (EARS) programme for 5
years which started in May 2002. The aim of the project is to very
significantly advance the state-of-the-art while tackling the hardest
speech recognition challenges including the transcription of broadcast
news and telephone conversations. A wide range of research areas will
be pursued aimed at both improving the word error rate of conventional
speech recognition systems and developing an enriched output format
with additional acoustic and linguistic metadata.
To generate enriched transcriptions which
contain the identity of the speaker, acoustic environment, channel
conditions and some linguistic mark-up, such as the location
of sentence-like boundaries or disfluent speech.
Task 3: Public HTK Development
To develop and enhance the core HTK software toolkit
available via the HTK Website.