أعرض تسجيلة المادة بشكل مبسط

dc.creator Ezzat, Tony
dc.creator Poggio, Tomaso
dc.date 2004-10-20T21:04:35Z
dc.date 2004-10-20T21:04:35Z
dc.date 1999-05-01
dc.date.accessioned 2013-10-09T02:48:50Z
dc.date.available 2013-10-09T02:48:50Z
dc.date.issued 2013-10-09
dc.identifier AIM-1658
dc.identifier CBCL-173
dc.identifier http://hdl.handle.net/1721.1/7263
dc.identifier.uri http://koha.mediu.edu.my:8181/xmlui/handle/1721
dc.description We present MikeTalk, a text-to-audiovisual speech synthesizer which converts input text into an audiovisual speech stream. MikeTalk is built using visemes, which are a small set of images spanning a large range of mouth shapes. The visemes are acquired from a recorded visual corpus of a human subject which is specifically designed to elicit one instantiation of each viseme. Using optical flow methods, correspondence from every viseme to every other viseme is computed automatically. By morphing along this correspondence, a smooth transition between viseme images may be generated. A complete visual utterance is constructed by concatenating viseme transitions. Finally, phoneme and timing information extracted from a text-to-speech synthesizer is exploited to determine which viseme transitions to use, and the rate at which the morphing process should occur. In this manner, we are able to synchronize the visual speech stream with the audio speech stream, and hence give the impression of a photorealistic talking face.
dc.format 5662753 bytes
dc.format 1408669 bytes
dc.format application/postscript
dc.format application/pdf
dc.language en_US
dc.relation AIM-1658
dc.relation CBCL-173
dc.title Visual Speech Synthesis by Morphing Visemes


الملفات في هذه المادة

الملفات الحجم الصيغة عرض

لا توجد أي ملفات مرتبطة بهذه المادة.

هذه المادة تبدو في المجموعات التالية:

أعرض تسجيلة المادة بشكل مبسط