MikeTalk:An Adaptive Man-Machine Interface

Tony EzzatVolker Blanz

Tomaso Poggio

TTVS Overview

• Input: Text

• Output: Photo-realistic talking face uttering text

Desktop Agents

You have received 1 email from Tommy Poggio.

Customer Support

You have bought 20 shares of SONYat $40 each.

Advertisements

Hi Tony, would you be interestedin a ticket from Boston to New

York for $50.00?

Modules

Phoneme Corpus

Step 1:

– collect a visual corpus from a subject

– corpus contains 44 words

–one word for each American English phoneme

6 Consonantal Visemes

Step 2:

– extract one image per phoneme: viseme

–group visemes together by visual similarity

9 Vocalic Visemes (+ 1 SilenceViseme)

Problem1:Need to Interpolate!

Solution: Morphing!

Problem 2: too tedious to specify correspondence by hand across many images!

Simultaneous interpolation of shape & texture. (Beier & Neely 1992)

Solution: Optical Flow

• To interpolate between two visemes, optical flow is first computed

• A 2D motion vector field is produced:

dx(x,y) dy(x,y)

(Horn & Schunk 1986) (Lucas & Kanade 1988)

Morphing

• Forward warping A to B

• Forward warping B to A

• Blending

• Holefilling

Synthesis Database

• 16 Visemes total

• 256 Optical flow vectors total, from every viseme to every other viseme

Concatenation and Lip Sync

• Load the correct viseme transitions

• Concatenate viseme transitions

• Sample the viseme transitions using audio durations

Examples

“1, 2, 3, 4, 5”

“cat, dog, pig,cow, moose, horse,sheep”

“you have received10 email messages.”

Current Work

• Coarticulation

• Eye + head movements

• Emotion

• 3D instead of 2d

• Psychophysics

3DWith Volker Blanz

The End

Co-articulation

• Problem: Current method does not handle coarticulation, so speech looks overly articulated

• Can record all possible triphones/ quadriphones but this approach requires a lot of data!

• Best method is to learn a model for coarticulation, but what is the representation for the lips?

Principal Components Analysis

• Each image is a vector in a high-dimensional space

• Using PCA, find the optimal set of vectors that span the space

• Project the entire corpus onto those basis vectors

Top 2 PCA Bases for /buut/

Top 2 PCA Bases for /get/

Problem: Too nonlinear!

Flow Component Analysis

• Compute optical from a reference lip image to all other images in the corpus

• Compute PCA on all the flows

Top 2 FPCA Bases for /buut/

Top 2 FPCA Bases for /get/

Much more linear behavior!

Current Work

• Now that we have parameterized the mouth, what is the model for mouth synthesis?

• How is that model fit to the PCA data?

MikeTalk:An Adaptive Man-Machine Interface

Documents

An Adaptive Hardware/Software Interface for …An Adaptive Hardware/Software Interface for EmbedNet Semester Thesis SA-2015-09 March 2015 to June 2015 Tutor: Dr. Markus Happe Supervisor:

Study group 70 480 - Implementing an adaptive user interface

The Interface Theory of Perceptionpeople.psych.cornell.edu/~jec7/pcd 2015-16 pubs/interface.pdf · adaptive behaviors. Perception is an adaptive interface. ... and which consequently

Pattern Adaptive and Finger Image-guided Keypad Interface

Learn Piano with BACh: An Adaptive Learning Interface that ...emp017/papers/yuksel_musiclearning_2016.pdf · Learn Piano with BACh: An Adaptive Learning Interface that Adjusts Task

Digital Dual Schedule Pulse MMI (Man Machine Interface

The Conversation Gets Interesting: Creating the Adaptive Interface

Self adaptive based natural language interface for disambiguation of

Intelligent Adaptive Interface: A Design Tool for

A METHODOLOGY FOR ADAPTIVE USER INTERFACE DESIGN

Adaptive Hu man coding - icg.isy.liu.se · Adaptive Hu man coding If we want to code a sequence from an unknown source using Hu man coding, we need to know the probabilities of the

MAN-MACHINE INTERFACE RISK ASSESSMENT · creative thinking. custom solutions.creative thinking. ® custom solutions.® MAN-MACHINE INTERFACE RISK ASSESSMENT Presented By: Bayless

ENERGY-EFFICIENT I/O INTERFACE DESIGN WITH ADAPTIVE …energy-efficient i/o interface design with adaptive power-supply regulation adissertation submitted to the department of electrical

Rulebasedintelligentsystemverbalizing mathematicalnotation · Keywords Text-to-speech interface · Verbalizing mathematical notation · Adaptive interface · Visually impaired learning

MAN-MACHINE INTERFACE REQUIREMENTS FOR SATELLITE RETRIEVAL ... · teleoperatbor system man-machine interface requirements for satellite retrieval and servicing volume ii: design criteria

Adaptive User Interface Modelling for Web-environments

BIOMETRICS THE MAN MACHINE INTERFACE

ATRV-Jr. Interface Human-Robot Interaction: A Straw Man approach to Interface Iteration

Hp Man PPM9.20 Multilingual User Interface Guide

Cognitive Adaptive Man-Machine Interface€¦ · CAMMI addresses the priorities detailed in the Human-Centric Design sub-programme of the ARTEMIS AWP. It will provide solutions for