Exploring the limits of decoder-only models trained on public speech recognition corporaAnkit GuptaGeorge Saonet al.2024INTERSPEECH 2024
M2 ASR: Multilingual Multi-task Automatic Speech Recognition via Multi-objective OptimizationA SaifLisha Chenet al.2024INTERSPEECH 2024
Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel OptimizationA F M SaifXiaodong Cuiet al.2024ICASSP 2024
Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding SystemsSamuel ThomasJeff Kuoet al.2022ICASSP 2022
Towards End-to-end Integration of Dialog History For Improved Spoken Language UnderstandingVishal SunderSamuel Thomaset al.2022ICASSP 2022
Improving End-to-End Models for Set Prediction in Spoken Language UnderstandingJeff KuoZoltan Tuskeet al.2022ICASSP 2022
High-Dimensional Smoothed Entropy Estimation via Dimensionality ReductionYuancheng YuKristjan Greenewaldet al.2023ISIT 2023
Understanding Unequal Gender Classification Accuracy from Face ImagesVidya MuthukumarTejaswini Pedapatiet al.2018arXiv