A timbre space for speech
Hiroko Terasawa, Malcolm Slaney, et al.
INTERSPEECH - Eurospeech 2005
This paper describes techniques to change the playback speed of MPEG-compressed audio, without first decompressing the audio file. There are two primary contributions in this paper. 1) We describe three techniques to perform time-scale modification in the maximally decimated domain. 2) We show to infer the output of the auditory masking model on the new audio stream, using the information in the original file. This new FastMPEG algorithm is more than an order of magnitude more efficient than decompressing the audio, performing time-scale modification in the conventional time-domain, and recompressing. Samples of our results can be found at http://www.slaney.org/covell/Fast-MPEG/.
Hiroko Terasawa, Malcolm Slaney, et al.
INTERSPEECH - Eurospeech 2005
Nima Mesgarani, Malcolm Slaney, et al.
IEEE Transactions on Audio, Speech and Language Processing
Nima Mesgarani, Shihab Shamma, et al.
ICASSP 2004
Malcolm Slaney, Gerald McRoberts
Speech Communication