Gakuto Kurata

Title

Distinguished Engineer and Chief Scientist for Spoken Conversational Systems

Bio

Dr. Gakuto KURATA is a Distinguished Engineer and Chief Scientist for Spoken Conversational Systems at IBM Research. He is the senior manager of AI technologies at IBM Research - Tokyo and currently leads research and development of Large Language Models and Large Speech Models in Japan in collaboration with global IBM Research and Development teams. By closely collaborating with Sales and Consulting teams, he accelerates AI adoption in business by translating customer requirements into products.

He joined IBM in April 2004, after obtaining M.S. in Information Science and Technology from the University of Tokyo. He received a Ph.D. in Information Science and Technology from the University of Tokyo in 2013. He was the Technical Assistant to the Director of IBM Research - Tokyo in 2014. He is an IBM Master Inventor and a member of IBM Academy of Technology. He served as an elected member of the IEEE SLTC (Speech and Language Processing Technical Committee) from 2018 to 2024. He has more than 20 years of research and development experiences in speech technology, natural language processing, and their combinations. He earned Technology Development Award from Acoustical Society of Japan in 2018 and Industrial Achievement Award from Information Processing Society of Japan in 2021.

Publications

Deep neural network training emphasizing central frames
- - Gakuto Kurata
  - Daniel Willett
- 2015
- INTERSPEECH 2015
A metric for evaluating speech recognizer output based on human-perception model
- - Nobuyasu Itoh
  - Gakuto Kurata
  - et al.
- 2015
- INTERSPEECH 2015
Discriminative re-ranking for automatic speech recognition by leveraging invariant structures
- - Masayuki Suzuki
  - Gakuto Kurata
  - et al.
- 2015
- Speech Communication
Leveraging phonetic context dependent invariant structure for continuous speech recognition
- - Congying Zhang
  - Masayuki Suzuki
  - et al.
- 2014
- ChinaSIP 2014
Discriminative reranking for LVCSR leveraging invariant structure
- - Masayuki Suzuki
  - Gakuto Kurata
  - et al.
- 2012
- INTERSPEECH 2012
Leveraging Word Confusion Networks for Named Entity modeling and detection from Conversational Telephone Speech
- - Gakuto Kurata
  - Nobuyasu Itoh
  - et al.
- 2012
- Speech Communication
Acoustically discriminative language model training with pseudo-hypothesis
- - Gakuto Kurata
  - Abhinav Sethy
  - et al.
- 2012
- Speech Communication
Acoustic model training with detecting transcription errors in the training data
- - Gakuto Kurata
  - Nobuyasu Itoh
  - et al.
- 2011
- INTERSPEECH 2011
Continuous digits recognition leveraging invariant structure
- - Masayuki Suzuki
  - Gakuto Kurata
  - et al.
- 2011
- INTERSPEECH 2011
Named entity recognition from Conversational Telephone Speech leveraging Word Confusion Networks for training and recognition
- - Gakuto Kurata
  - Nobuyasu Itoh
  - et al.
- 2011
- ICASSP 2011

Projects

Top collaborators

Gakuto Kurata

Title

Bio

Publications

Deep neural network training emphasizing central frames

A metric for evaluating speech recognizer output based on human-perception model

Discriminative re-ranking for automatic speech recognition by leveraging invariant structures

Leveraging phonetic context dependent invariant structure for continuous speech recognition

Discriminative reranking for LVCSR leveraging invariant structure

Leveraging Word Confusion Networks for Named Entity modeling and detection from Conversational Telephone Speech

Acoustically discriminative language model training with pseudo-hypothesis

Acoustic model training with detecting transcription errors in the training data

Continuous digits recognition leveraging invariant structure

Named entity recognition from Conversational Telephone Speech leveraging Word Confusion Networks for training and recognition

Patents

Method For Improving Acoustic Model, Computer For Improving Acoustic Model And Computer Program Thereof

Detecting Customers With Low Speech Recognition Accuracy By Investigating Consistency Of Conversation In Call-center

Training Deep Neural Network For Acoustic Modeling In Speech Recognition

System, Method And Program For Improving Pronunciation Accuracy In Speech Recognition

Speech Recognition Model Construction Method, Speech Recognition Method, Computer System, Speech Recognition Apparatus, Program, And Recording Medium

Immersive Interactive Telepresence

Unsupervised Training Method, Training Apparatus, And Training Program For N-gram Language Model

Speech Recognition Model Construction Method, Speech Recognition Method, Computer System, Speech Recognition Apparatus, Program, And Recording Medium

Method Of Selecting Training Text For Language Model, And Method Of Training Language Model Using The Training Text, And Computer And Computer Program For Executing The Methods

Method For Improving Acoustic Model, Computer For Improving Acoustic Model And Computer Program Thereof

Projects

AI in Tokyo

Speech Technologies

Top collaborators

Takashi Fukuda

Masayasu Muraoka

Hiroshi Kanayama

Samuel Thomas