|
J. Allen, M. S. Hunnicutt, and D. H. Klatt, From Text to Speech: The
MITalk System. New York: Cambridge University Press, 1987. A description
of a complete text-to-speech system.
R. Carlson and B. Granströum. "A Text-to-Speech System Based Entirely
on Rules." Proc. Int. Conf. Acoust. Speech Signal Process.
ICASSP-76 (1976): 686-88. A rule-based approach to text-to-speech synthesis.
C. H. Coker, N. Umeda, and C. P. Browman. "Automatic Synthesis from
Ordinary English Text." IEEE Trans. Audio Electroacoust. AU-21
(1973): 293-97. A rule-based synthesizer using articulatory parameters.
H. Dudley, R. R. Riesz, and S. A. Watkins. "A Synthetic Speaker."J.
Franklin Inst. 227 (1939): 739-64. A description of the 1938 speaking
machine.
H. Dudley and T. H. Tarnoczy. "The Speaking Machine of Wolfgang Kempelen."J.
Acoust. Soc. Am. 22 (1950): 151-66.
J. L. Flanagan. Speech Analysis, Synthesis and Perception. New
York: Springer, 1972. This technical and comprehensive book includes historical
sections and a large bibliography.
J. L. Flanagan, K. Ishizaka, and K. L. Shipley. "Synthesis of Speech
from a Dynamic Model of the Vocal Cords and Vocal Tract." Bell
Syst. Tech. J. 54 (1975): 485-506. A synthesis scheme from articulatory
parameters.
H. L. F. v. Helmholtz. On the Sensation of Tone, A. J. Ellis,
transl. New York: Dover, 1954. (Trans. of 4th German ed., 1877.) A comprehensive
book on theories of acoustics.
J. Hirschberg. "Pitch Accent in Context: Predicting Intonational Prominence
from Text." Artificial Intelligence 63, 1-2 (1993). Rules
for pitch contours from prediction of sentence-level stress.
D. H. Klatt. "Linguistic Uses of Segmental Duration in English: Acoustic
and Perceptual Evidence."J. Acoust. Soc. Am. 59 (1976): 1208-1221.
The duration of phonemes in speech.
D. H. Klatt. "Review of Text-to-Speech Conversion for English."
J. Acoust. Soc. Am. 82, no. 3 (1987): 737-93. A complete review
of the work on speech synthesis to 1987.
I. Lehiste. Suprasegmentals. Cambridge, Mass.: MIT Press, 1970.
Discussion of speech phenomena beyond the phonemes, including the prosody
of speech.
J. P. Olive. "Rule Synthesis of Speech from Dyadic Units." Proc.
Int. Conf. Acoust. Speech Signal Process. ICASSP-77 (1977): 568-70.
Synthesis from stored segments.
J. P. Olive, A. Greenwood, and J. Coleman. Acoustics of American English
Speech. New York: Springer, 1993. A description of speech sounds with
an introduction to phonetics and the theory of speech sound.
J. P. Olive and L. H. Nakatani. "Rule Synthesis of Speech by Word Concatenation:
A First Step." J. Acoust. Soc. Am. 55 (1974): 660-66.
G. E. Peterson, W. Wang, and E. Sivertsen. "Segmentation Techniques
in Speech Synthesis." J. Acoust. Soc. Am. 30 (1958): 793-42.
Synthesis from stored segments.
J. Pierrehumbert. "Synthesizing Intonation." J. Acoust. Soc.
Am. 70 (1981): 985-95. Rules for synthesizing pitch contours.
L. R. Rabiner, R. W. Schafer, and L. L. Flanagan. "Computer Synthesis
of Speech by Concatenation of Formant-Coded Words." Bell Syst.
Tech. J. 50 (1971): 1541-58. Synthesis from stored words.
   
|