Topics in decision tree based speech synthesis

R. Donovan

doi:10.1016/S0885-2308(02)00031-1

Computer Speech and Language

Paper

01 Jan 2003

Topics in decision tree based speech synthesis

View publication

Abstract

Most modern speech synthesis systems using context dependent decision trees in their acoustic synthesis modules are unit selection style concatenative speech synthesis systems using the trees essentially as a form of pruning during their segment search. The IBM Trainable Speech Synthesis System is one such system. This paper begins by discussing the advantages and disadvantages of the decision tree and non-decision tree approaches to unit selection synthesis. It goes on to present the results of formal listening tests conducted on the IBM system to investigate a number of different topics pertinent to decision tree based systems. These include the use of extended context features during clustering, the effect of using trees with different numbers of leaves and different numbers of segments per leaf, and the performance of several different offline segment preselection algorithms.

Conference paper

Supporting greater access to pre- and post-natal information and services for women in Rural Kenya

Jakita O. Thomas, Eric Mibuari, et al.

CHI 2011

Conference paper

Hierarchical variational loopy belief propagation for multi-talker speech recognition

Steven J. Rennie, John R. Hershey, et al.

ASRU 2009

Conference paper

Transforming the Ul for anyone, anywhere. Enabling an increased variety of users, devices, and tasks through interface transformations

Charles Wieeha, Pedro Szekely

CHI EA 2001

Paper

Autonomic features of the IBM DB2 universal database for Linux, UNIX, and Windows

Christian M. Garcia-Arellano, Sam S. Lightstone, et al.

IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews

View all publications

Abstract

Related

Supporting greater access to pre- and post-natal information and services for women in Rural Kenya

Hierarchical variational loopy belief propagation for multi-talker speech recognition

Transforming the Ul for anyone, anywhere. Enabling an increased variety of users, devices, and tasks through interface transformations

Autonomic features of the IBM DB2 universal database for Linux, UNIX, and Windows