Limit search to available items
Book Cover
E-book

Title Articulatory speech synthesis from the fluid dynamics of the vocal apparatus / Stephen Levinson [and others]
Published Cham, Switzerland : Springer, ©2012
Online access available from:
Synthesis Digital Library    View Resource Record  

Copies

Description 1 online resource (xii, 104 pages) : illustrations
Series Synthesis lectures on speech and audio processing, 1932-1678 ; #9
Synthesis lectures on speech and audio processing ; #9.
Contents 1. Introduction -- 1.1 History of speech synthesis -- 1.2 Speech production -- 1.3 Contributions -- 1.4 Organization of the book
2. Literature review -- 2.1 Overview of speech synthesis techniques -- 2.1.1 Concatenative synthesis -- 2.1.2 Formant synthesis -- 2.1.3 Articulatory synthesis -- 2.2 Overview of speech production model -- 2.2.1 Source-filter speech production model -- 2.2.2 Fricative model -- 2.2.3 Unvoiced speech sound production model -- 2.3 Overview of articulatory speech model -- 2.3.1 Coker's model -- 2.3.2 Synthesis of speech phonemes -- 2.3.3 Mermelstein's model -- 2.3.4 Task-dynamic model -- 2.4 Overview of the motor control of the articulator -- 2.4.1 A dynamic model of articulation -- 2.4.2 Motor control based on minimum cost principles -- 2.5 Summary
3. Estimation of dynamic articulatory parameters -- 3.1 Cubic spline method -- 3.2 Review of the signal representation techniques -- 3.2.1 Introduction -- 3.2.2 L2 space -- 3.2.3 Convolution-based signal representations -- 3.2.4 Interpolation and quasi-interpolation -- 3.2.5 Convolution-based least squares -- 3.2.6 Strang-fix conditions -- 3.3 Pointwise error analysis -- 3.3.1 Interpolation error -- 3.3.2 Least squares error -- 3.4 L2 error analysis -- 3.4.1 L2 error of quasi-interpolation -- 3.4.2 L2 error of the LS approximation -- 3.4.3 Comparison -- 3.5 Experimental results -- 3.6 Discussion -- 3.7 Future work -- 3.8 Summary
4. Construction of articulatory model based on MRI data -- 4.1 Problem formulation -- 4.2 Vocal cords models -- 4.3 Multi-mass model -- 4.4 Simulation result and future work
5. Vocal fold excitation models -- 5.1 Parametric models -- 5.1.1 Rosenberg's model -- 5.1.2 Titze's model -- 5.2 Mechanical model -- 5.2.1 Two-mass model -- 5.2.2 M-mass model -- 5.3 Simulation results -- 5.4 Discussion -- 5.5 Summary
6. Experimental results of articulatory synthesis -- 6.1 Governing equations, fluid dynamics analysis -- 6.2 Synthesized waveform -- 6.3 Speech analysis results -- 6.3.1 LPC spectrum and the short-time power spectrum -- 6.3.2 Spectrogram -- 6.4 Analysis of the velocity, vorticity, and pressure fields -- 6.5 Summary
7. Conclusion -- Bibliography -- Authors' biographies
Summary This book addresses the problem of articulatory speech synthesis based on computed vocal tract geometries and the basic physics of sound production in it. Unlike conventional methods based on analysis/synthesis using the well-known source filter model, which assumes the independence of the excitation and filter, we treat the entire vocal apparatus as one mechanical system that produces sound by means of fluid dynamics. The vocal apparatus is represented as a three-dimensional time-varying mechanism and the sound propagation inside it is due to the non-planar propagation of acoustic waves through a viscous, compressible fluid described by the Navier-Stokes equations. We propose a combined minimum energy and minimum jerk criterion to compute the dynamics of the vocal tract during articulation. Theoretical error bounds and experimental results show that this method obtains a close match to the phonetic target positions while avoiding abrupt changes in the articulatory trajectory. The vocal folds are set into aerodynamic oscillation by the flow of air from the lungs. The modulated air stream then excites the moving vocal tract. This method shows strong evidence for source-filter interaction. Based on our results, we propose that the articulatory speech production model has the potential to synthesize speech and provide a compact parameterization of the speech signal that can be useful in a wide variety of speech signal processing problems
Bibliography Includes bibliographical references (pages 95-101)
Notes Online resource; title from PDF title page (Morgan & Claypool, viewed July 23, 2012)
Subject Speech synthesis -- Mathematical models
Speech -- Physiological aspects -- Mathematical models
COMPUTERS -- Optical Data Processing.
Form Electronic book
Author Levinson, Stephen E
ISBN 9781598291797
1598291793
1598291785
9781598291780
9783031025631
3031025636