Conference paper
Boosting Gaussian mixtures in an LVCSR system
Abstract
In this paper, we apply boosting to the problem of frame-level phone classification, and use the resulting system to perform voicemail transcription. We develop parallel, hierarchical, and restricted versions of the classic AdaBoost algorithm, which enable the technique to be used in large-scale speech recognition tasks with hundreds of thousands of Gaussians and tens of millions of training frames. We report small but consistent improvements in both frame recognition accuracy and word error rate.
Related
Conference paper
The IBM mandarin broadcast speech transcription system
Conference paper
The IBM 2006 GALE arabic ASR system
Conference paper