Loading...
Please wait, while we are loading the content...
Similar Documents
Speech and Language Processing: an Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. 8 Speech Synthesis
| Content Provider | Semantic Scholar |
|---|---|
| Author | Jurafsky, Daniel Martin, James H. Barry, Dave |
| Abstract | And computers are getting smarter all the time: Scientists tell us that soon they will be able to talk to us. (By 'they' I mean 'comput-ers': I doubt scientists will ever be able to talk to us.) In Vienna in 1769, Wolfgang von Kempelen built for the Empress Maria Theresa the famous Mechanical Turk, a chess-playing automaton consisting of a wooden box filled with gears, and a mannequin sitting behind the box who played chess by moving pieces with his mechanical arm. The Turk toured Europe and the Americas for decades, defeating Napolean Bonaparte and even playing Charles Babbage. The Mechanical Turk might have been one of the early successes of artificial intelligence if it were not for the fact that it was, alas, a hoax, powered by a chessplayer hidden inside the box. What is perhaps less well-known is that von Kempelen, an extraordinarily prolific inventor, also built between 1769 and 1790 what is definitely not a hoax: the first full-sentence speech synthesizer. His device consisted of a bellows to simulate the lungs, a rubber mouthpiece and a nose aperature, a reed to simulate the vocal folds, various whistles for each of the fricatives. and a small auxiliary bellows to provide the puff of air for plosives. By moving levers with both hands, opening and closing various openings, and adjusting the flexible leather 'vocal tract', different consonants and vowels could be produced. More than two centuries later, we no longer build our speech synthesizers out of wood, leather, and rubber, nor do we need trained human operators. The modern task of speech synthesis, also called text-to-speech or TTS, is to produce speech (acoustic SPEECH SYNTHESIS TEXT-TO-SPEECH TTS waveforms) from text input. Modern speech synthesis has a wide variety of applications. Synthesizers are used, together with speech recognizers, in telephone-based conversational agents that conduct dialogues with people (see Ch. 23). Synthesizer are also important in non-conversational applications that speak to people, such as in devices that read out loud for the blind, or in video games or children's toys. Finally, speech synthesis can be used to speak for sufferers of neurological disorders, such as astrophysicist Steven Hawking who, having lost the use of his voice due to ALS, speaks by typing to a speech synthesizer and having the synthesizer speak out the words. State of the art |
| File Format | PDF HTM / HTML |
| Alternate Webpage(s) | http://www1.cs.columbia.edu/~julia/papers/J&M-tts.pdf |
| Alternate Webpage(s) | http://www.cs.columbia.edu/~julia/papers/J&M-tts.pdf |
| Language | English |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |