Chapter 6






J. Allen, M. S. Hunnicutt, and D. H. Klatt, From Text to Speech: The MITalk System. New York: Cambridge University Press, 1987. A description of a complete text-to-speech system.

R. Carlson and B. Granströum. "A Text-to-Speech System Based Entirely on Rules." Proc. Int. Conf. Acoust. Speech Signal Process. ICASSP-76 (1976): 686-88. A rule-based approach to text-to-speech synthesis.

C. H. Coker, N. Umeda, and C. P. Browman. "Automatic Synthesis from Ordinary English Text." IEEE Trans. Audio Electroacoust. AU-21 (1973): 293-97. A rule-based synthesizer using articulatory parameters.

H. Dudley, R. R. Riesz, and S. A. Watkins. "A Synthetic Speaker."J. Franklin Inst. 227 (1939): 739-64. A description of the 1938 speaking machine.

H. Dudley and T. H. Tarnoczy. "The Speaking Machine of Wolfgang Kempelen."J. Acoust. Soc. Am. 22 (1950): 151-66.

J. L. Flanagan. Speech Analysis, Synthesis and Perception. New York: Springer, 1972. This technical and comprehensive book includes historical sections and a large bibliography.

J. L. Flanagan, K. Ishizaka, and K. L. Shipley. "Synthesis of Speech from a Dynamic Model of the Vocal Cords and Vocal Tract." Bell Syst. Tech. J. 54 (1975): 485-506. A synthesis scheme from articulatory parameters.

H. L. F. v. Helmholtz. On the Sensation of Tone, A. J. Ellis, transl. New York: Dover, 1954. (Trans. of 4th German ed., 1877.) A comprehensive book on theories of acoustics.

J. Hirschberg. "Pitch Accent in Context: Predicting Intonational Prominence from Text." Artificial Intelligence 63, 1-2 (1993). Rules for pitch contours from prediction of sentence-level stress.

D. H. Klatt. "Linguistic Uses of Segmental Duration in English: Acoustic and Perceptual Evidence."J. Acoust. Soc. Am. 59 (1976): 1208-1221. The duration of phonemes in speech.

D. H. Klatt. "Review of Text-to-Speech Conversion for English." J. Acoust. Soc. Am. 82, no. 3 (1987): 737-93. A complete review of the work on speech synthesis to 1987.

I. Lehiste. Suprasegmentals. Cambridge, Mass.: MIT Press, 1970. Discussion of speech phenomena beyond the phonemes, including the prosody of speech.

J. P. Olive. "Rule Synthesis of Speech from Dyadic Units." Proc. Int. Conf. Acoust. Speech Signal Process. ICASSP-77 (1977): 568-70. Synthesis from stored segments.

J. P. Olive, A. Greenwood, and J. Coleman. Acoustics of American English Speech. New York: Springer, 1993. A description of speech sounds with an introduction to phonetics and the theory of speech sound.

J. P. Olive and L. H. Nakatani. "Rule Synthesis of Speech by Word Concatenation: A First Step." J. Acoust. Soc. Am. 55 (1974): 660-66.

G. E. Peterson, W. Wang, and E. Sivertsen. "Segmentation Techniques in Speech Synthesis." J. Acoust. Soc. Am. 30 (1958): 793-42. Synthesis from stored segments.

J. Pierrehumbert. "Synthesizing Intonation." J. Acoust. Soc. Am. 70 (1981): 985-95. Rules for synthesizing pitch contours.

L. R. Rabiner, R. W. Schafer, and L. L. Flanagan. "Computer Synthesis of Speech by Concatenation of Formant-Coded Words." Bell Syst. Tech. J. 50 (1971): 1541-58. Synthesis from stored words.


top of pageauthor infoorderfurther reading