Gakuto Kurata

Title

Distinguished Engineer and Chief Scientist for Spoken Conversational Systems

Bio

Dr. Gakuto KURATA is a Distinguished Engineer and Chief Scientist for Spoken Conversational Systems at IBM Research. He is the senior manager of AI technologies at IBM Research - Tokyo and currently leads research and development of Large Language Models and Large Speech Models in Japan in collaboration with global IBM Research and Development teams. By closely collaborating with Sales and Consulting teams, he accelerates AI adoption in business by translating customer requirements into products.

He joined IBM in April 2004, after obtaining M.S. in Information Science and Technology from the University of Tokyo. He received a Ph.D. in Information Science and Technology from the University of Tokyo in 2013. He was the Technical Assistant to the Director of IBM Research - Tokyo in 2014. He is an IBM Master Inventor and a member of IBM Academy of Technology. He served as an elected member of the IEEE SLTC (Speech and Language Processing Technical Committee) from 2018 to 2024. He has more than 20 years of research and development experiences in speech technology, natural language processing, and their combinations. He earned Technology Development Award from Acoustical Society of Japan in 2018 and Industrial Achievement Award from Information Processing Society of Japan in 2021.

Publications

Unsupervised lexicon acquisition from speech and text
- - Gakuto Kurata
  - Shinsuke Mori
  - et al.
- 2007
- ICASSP 2007
Unsupervised adaptation of a stochastic language model using a Japanese raw corpus
- - Gakuto Kurata
  - Shinsuke Mori
  - et al.
- 2006
- ICASSP 2006
Phoneme-to-text transcription system with an infinite vocabulary
- - Shinsuke Mori
  - Daisuke Takuma
  - et al.
- 2006
- COLING/ACL 2006
Class-based variable memory length markov model
- - Shinsuke Mori
  - Gakuto Kurata
- 2005
- INTERSPEECH - Eurospeech 2005

Patents

- 30 Sep 2024
- US
- 12106215
Knowledge Transfer Between Recurrent Neural Networks
- 11 Sep 2024
- AU
- 2021391031
Learning Unpaired Multimodal Feature Matching For Semi-supervised Learning
- 23 May 2024
- CN
- ZL202010749013.4
Data Augmentation By Frame Insertion For Speech Data
- 02 Apr 2024
- GB
- 2616157
Fast - Learning Unpaired Multimodal Feature Matching For Semi-supervised Learning
- 18 Mar 2024
- US
- 11934938
Neural Network For Chemical Compounds
- 19 Feb 2024
- US
- 11908458
Customization Of Recurrent Neural Network Transducers For Speech Recognition
- 19 Feb 2024
- US
- 11908454
Integrating Text Inputs For Training And Adapting Neural Network Transducer Asr Models
- 05 Feb 2024
- US
- 11893983
Adding Words To A Prefix Tree For Improving Speech Recognition
- 10 Jan 2024
- TW
- I829312
Integrating Text Inputs For Training And Adapting Neural Network Transducer Asr Models
- 15 Oct 2023
- JP
- 7368479
Training Data Modification For Training Model

Projects

Top collaborators

TF

Takashi Fukuda

Takashi Fukuda

Senior Technical Staff Member, Master Inventor - Audio, Speech, and Language Processing

GS

George Saon

George Saon

Speech strategy lead, distinguished research scientist

ST

Samuel Thomas

Samuel Thomas

Senior Research Scientist - Speech Recognition and Spoken Language Understanding

MM

Masayasu Muraoka

Masayasu Muraoka

Researcher