Publications

120 results for Speech

Beyond the Clinic: Leveraging Speech Acoustics and Phonetics for Cognitive Monitoring in PKU
- - Kely Norel
  - Carla Agurto Rios
  - et al.
- 2025
- ICDH 2025
Comprehensive Layer-Wise Analysis of SSL Models for Audio Deepfake Detection
- - Yassine Elkheir
  - Younes Samih
  - et al.
- 2025
- NAACL 2025
A Non-autoregressive Model for Joint STT and TTS
- - Vishal Sunder
  - Brian Kingsbury
  - et al.
- 2025
- ICASSP 2025
LLM based Text Generation for Improved Low-resource Speech Recognition Models
- - Tohru Nagano
  - Gakuto Kurata
  - et al.
- 2025
- ICASSP 2025
Knowledge Distillation Based Training of Unified Conformer CTC Models for Multi-form ASR
- - Takashi Fukuda
  - Gakuto Kurata
  - et al.
- 2025
- ICASSP 2025
Beyond neuropsychological tests: AI speech analysis in PKU
- - Susan E. Waisbren
  - Raquel Norel
  - et al.
- 2024
- J. Inherit. Metab. Dis.
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models
- - Yuchen Hu
  - Chen Chen
  - et al.
- 2024
- NeurIPS 2024
Robust ASR Error Correction with Conservative Data Filtering
- - Takuma Udagawa
  - Masayuki Suzuki
  - et al.
- 2024
- EMNLP 2024
Exploring the limits of decoder-only models trained on public speech recognition corpora
- - Ankit Gupta
  - George Saon
  - et al.
- 2024
- INTERSPEECH 2024
Low Bitrate High-Quality RVQGAN-based Discrete Speech Tokenizer
- - Slava Shechtman
  - Avihu Dekel
- 2024
- INTERSPEECH 2024