Publications

124 results for George Saon

A Non-autoregressive Model for Joint STT and TTS
- - Vishal Sunder
  - Brian Kingsbury
  - et al.
- 2025
- ICASSP 2025
Knowledge Distillation Based Training of Unified Conformer CTC Models for Multi-form ASR
- - Takashi Fukuda
  - Gakuto Kurata
  - et al.
- 2025
- ICASSP 2025
LLM based Text Generation for Improved Low-resource Speech Recognition Models
- - Tohru Nagano
  - Gakuto Kurata
  - et al.
- 2025
- ICASSP 2025
Bilevel Joint Unsupervised and Supervised Training for Automatic Speech Recognition
- - Xiaodong Cui
  - A.F.M. Saif
  - et al.
- 2024
- IEEE/ACM TASLP
Speech-enriched Memory for Inference-time Adaptation of ASR Models to Word Dictionaries
- - Ashish Mittal
  - Sunita Sarawagi
  - et al.
- 2023
- EMNLP 2023
Improving RNN Transducer Acoustic Models for English Conversational Speech Recognition
- - Xiaodong Cui
  - George Saon
  - et al.
- 2023
- INTERSPEECH 2023
Multi-Speaker Data Augmentation for Improved end-to-end Automatic Speech Recognition
- - Samuel Thomas
  - Hong-Kwang J. Kuo
  - et al.
- 2023
- ICASSP 2023
VQ-T: RNN Transducers using Vector-Quantized Prediction Network States
- - Jiatong Shi
  - George Saon
  - et al.
- 2022
- INTERSPEECH 2022
Global RNN Transducer Models For Multi-dialect Speech Recognition
- - Takashi Fukuda
  - Samuel Thomas
  - et al.
- 2022
- INTERSPEECH 2022
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
- - Xiaodong Cui
  - George Saon
  - et al.
- 2022
- INTERSPEECH 2022