Loading...
Please wait, while we are loading the content...
Similar Documents
Visual coding and tracking of speech related facial motion (2001)
| Content Provider | CiteSeerX |
|---|---|
| Author | Reveret, Lionel Essa, Irfan |
| Abstract | This article present a visual characterization of facial motions inherent with speaking. We propose a set of four Facial Speech Parameters (FSP): jaw opening, lips rounding, lips closure, and lips raising, to represent the primary visual gestures of speech articulation into a multidimensional linear manifold. This manifold is initially generated as a statistical model, obtained by analyzing accurate 3D data of a reference human subject. The FSP are then associated to the linear modes of this statistical model, resulting in a 3D parametric facial mesh. We have tested the speaker-independent hypothesis of this manifold with a model-based video tracking task applied on different subjects. Firstly, the parametric model is adapted and aligned to a subject’s face for a single shape. Then the face motion is tracked by optimally aligning the incoming video frames with the face model, textured with the first image, and deformed by varying the FSP, head rotations, and translations. We show results of the tracking for different subjects using our method. Finally, we demonstrate the facial activity encoding into the four FSP values to represent speaker-independent phonetic information. 1 |
| File Format | |
| Journal | IEEE CVPR International Workshop on Cues in Communication, Hawai, USA, Decembre 9 |
| Language | English |
| Publisher Date | 2001-01-01 |
| Access Restriction | Open |
| Subject Keyword | Facial Motion Visual Coding Statistical Model Different Subject Linear Mode Model-based Video Speech Articulation Parametric Facial Mesh Subject Face Primary Visual Gesture Fsp Value Multidimensional Linear Manifold Parametric Model Head Rotation Single Shape Face Model Facial Activity Speaker-independent Hypothesis Lip Closure Face Motion Visual Characterization Jaw Opening First Image Speaker-independent Phonetic Information Reference Human Subject Facial Speech Parameter Incoming Video Frame |
| Content Type | Text |
| Resource Type | Technical Report |