Publications

134 results for Brian Kingsbury

VQ-T: RNN Transducers using Vector-Quantized Prediction Network States
- - Jiatong Shi
  - George Saon
  - et al.
- 2022
- INTERSPEECH 2022
Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization
- - Andrea Fasoli
  - Chia-Yu Chen
  - et al.
- 2022
- INTERSPEECH 2022
Global RNN Transducer Models For Multi-dialect Speech Recognition
- - Takashi Fukuda
  - Samuel Thomas
  - et al.
- 2022
- INTERSPEECH 2022
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
- - Xiaodong Cui
  - George Saon
  - et al.
- 2022
- INTERSPEECH 2022
Everything at Once - Multi-modal Fusion Transformer for Video Retrieval
- - Nina Shvetsova
  - Brian Chen
  - et al.
- 2022
- CVPR 2022
Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
- - Samuel Thomas
  - Brian Kingsbury
  - et al.
- 2022
- ICASSP 2022
A new data augmentation method for intent classification enhancement and its application on spoken conversation datasets
- - Zvi Kons
  - Aharon Satt
  - et al.
- 2022
- ICASSP 2022
Decentralized Bilevel Optimization for Personalized Client Learning
- - Songtao Lu
  - Xiaodong Cui
  - et al.
- 2022
- ICASSP 2022
Integrating dialog history into end-to-end spoken language understanding systems
- - Jatin Ganhotra
  - Samuel Thomas
  - et al.
- 2021
- INTERSPEECH 2021
Cascaded multilingual audio-visual learning from videos
- - Andrew Rouditchenko
  - Angie Boggust
  - et al.
- 2021
- INTERSPEECH 2021