Exploring the Limits of Conformer CTC-Encoder for Speech Emotion Recognition using Large Language ModelsEdmilson Da Silva MoraisHagai Aronowitzet al.2025INTERSPEECH 2025
Low Bitrate High-Quality RVQGAN-based Discrete Speech TokenizerSlava ShechtmanAvihu Dekel2024INTERSPEECH 2024
Exploring the Benefits of Tokenization of Discrete Acoustic UnitsAvihu DekelRaul Fernandez2024INTERSPEECH 2024
Speak While You Think: Streaming Speech Synthesis During Text GenerationAvihu DekelSlava Shechtmanet al.2024ICASSP 2024