Patent Number: 6,250,928

Title: Talking facial display method and apparatus

Abstract: A method and apparatus of converting input text into an audio-visual speech stream resulting in a talking face image enunciating the text. This method of converting input text into an audio-visual speech stream comprises the steps of: recording a visual corpus of a human-subject, building a viseme interpolation database, and synchronizing the talking face image with the text stream. In a preferred embodiment, viseme transitions are automatically calculated using optical flow methods, and morphing techniques are employed to result in smooth viseme transitions. The viseme transitions are concatenated together and synchronized with the phonemes according to the timing information. The audio-visual speech stream is then displayed in real time, thereby displaying a photo-realistic talking face.

Inventors: Poggio; Tomaso A. (Wellesley, MA), Ezzat; Antoine F. (Boston, MA)

Assignee: Massachusetts Institute of Technology

International Classification: G09B 19/04 (20060101); G09B 019/04 ()

Expiration Date: 06/26/2018