Loading...
Please wait, while we are loading the content...
Similar Documents
Inversion from Audiovisual Speech to Articulatory Information by Exploiting Multimodal Data
| Content Provider | Semantic Scholar |
|---|---|
| Author | Katsamanis, Athanasios Roussos, Anastasios Maragos, Petros Aron, Michael Berger, Marie-Odile |
| Copyright Year | 2008 |
| Abstract | We present an inversion framework to identify speech production properties from audiovisual information. Our system is built on a multimodal articulatory dataset comprising ultrasound, X-ray, magnetic resonance images as well as audio and stereovisual recordings of the speaker. Visual information is captured via stereovision while the vocal tract state is represented by a properly trained articulatory model. Inversion is based on an adaptive piecewise linear approximation of the audiovisualto- articulation mapping. The presented system can recover the hidden vocal tract shapes and may serve as a basis for a more widely applicable inversion setup. |
| File Format | PDF HTM / HTML |
| Alternate Webpage(s) | http://cvsp.cs.ntua.gr/publications/confr/KatsamanisRoussosMaragosAronBerger_AVInversionMultimodalArtData_ISSP2008.pdf |
| Alternate Webpage(s) | http://issp2008.loria.fr/Proceedings/PDF/issp2008-69.pdf |
| Alternate Webpage(s) | http://hal.archives-ouvertes.fr/docs/00/32/70/31/PDF/KatsamanisRoussosMaragosAronBerger_AVInversionMultimodalArtData_issp2008.pdf |
| Alternate Webpage(s) | http://cvsp.cs.ntua.gr/~nassos/presentations/ASPI_MultimodalData_issp08_forWeb.pdf |
| Language | English |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |