George Saon
ICASSP 2006
In this paper we describe some recent improvements to the performance of the Aurora 2 noisy digits speech recognition system for the matched training and test condition. The algorithms that we used pertain to discriminant acoustic modeling based on the Maximum Mutual Information (MMI) criterion, non-linear speaker/channel adaptation through probability distribution function matching. In addition, we revisited our last year's baseline system and improved its performance through cross-word context dependent modeling and Gaussian mixture components selection using the Bayesian Information Criterion (BIC). The aggregated result is 93.3% word accuracy for the multi-condition training data scenario.
George Saon
ICASSP 2006
George Saon, Samuel Thomas, et al.
INTERSPEECH 2013
Michael Picheny, Zoltan Tuske, et al.
INTERSPEECH 2019
George Saon, Tom Sercu, et al.
INTERSPEECH 2016