Low Bitrate High-Quality RVQGAN-based Discrete Speech TokenizerSlava ShechtmanAvihu Dekel2024INTERSPEECH 2024
Large Scale Generative AI Text Applied to Sports and MusicAaron BaughmanEduardo Moraleset al.2024KDD 2024
Harnessing Remote Speech Tasks for Early ALS Biomarker IdentificationCarla Agurto RiosMichele Merleret al.2024ICDH 2024
Exploring Chronic Pain Experiences: Leveraging Text and Audio Analysis to Infer Well-Being MetricsCarla Agurto RiosMichele Merleret al.2024ICDH 2024
Remotely-captured, free-text responses track with patient health states in chronic painJenna ReinenCarla Agurto Rioset al.2024ICDH 2024
Large Language Models as a Tool for Cognitive Stimulation: Chatbot Book Clubs for SeniorsHannah ZhouEmily Chenet al.2024ICDH 2024
Using Large Language Models to Understand Suicidality in a Social Media–Based Taxonomy of Mental Health Disorders: Linguistic Analysis of Reddit PostsBrian BauerRaquel Norelet al.2024JMIR Mental Health
Large Language Models are Efficient Learners of Noise-Robust Speech RecognitionYuchen HuChen Chenet al.2024ICLR 2024
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech RecognitionChen ChenRuizhe Liet al.2024ICLR 2024
MULTIPLE REPRESENTATION TRANSFER FROM LARGE LANGUAGE MODELS TO END-TO-END ASR SYSTEMSTakuma UdagawaMasayuki Suzukiet al.2024ICASSP 2024