Improving ASR Robustness in Noisy Condition Through VAD IntegrationSashi NovitasariTakashi Fukudaet al.2022INTERSPEECH 2022
Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent SystemsVishal SunderEric Fosler-Lussieret al.2022INTERSPEECH 2022
Extending RNN-T-based speech recognition systems with emotion and language classificationZvi KonsHagai Aronowitzet al.2022INTERSPEECH 2022
Global RNN Transducer Models For Multi-dialect Speech RecognitionTakashi FukudaSamuel Thomaset al.2022INTERSPEECH 2022
Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech SynthesisRaul FernandezDavid Hawset al.2022INTERSPEECH 2022
CONTENTVEC: An Improved Self-Supervised Speech Representation by Disentangling SpeakersKaizhi QianYang Zhanget al.2022ICML 2022
Towards End-to-end Integration of Dialog History For Improved Spoken Language UnderstandingVishal SunderSamuel Thomaset al.2022ICASSP 2022
Decentralized Bilevel Optimization for Personalized Client LearningSongtao LuXiaodong Cuiet al.2022ICASSP 2022
Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding SystemsSamuel ThomasJeff Kuoet al.2022ICASSP 2022
A new data augmentation method for intent classification enhancement and its application on spoken conversation datasetsZvi KonsAharon Sattet al.2022ICASSP 2022