Visual speech to text conversion applicable to telephone communication

Visual-speech to text

conversion applicable

to telephone

communication for deaf

individuals

30TH APRIL 2013

Lip-reading technique,

speech can be understood by interpreting

movements of lips, face and tongue.

not one-to-one

Impossible to distinguish phonemes using

visual information alone

Visual-speech to text conversion applicable to telephone communication for deaf individuals

INTRODUCTION

developed by Cornett

contains two components:

the hand shape the hand position relative to the

Hand shapes- consonant phonemes

hand positions -vowel phonemes.

improves speech perception to a large extent

the Cued Speech system

AIM OF NEW SYSTEM

To investigate the designing of a system able to

automatically recognize Cued Speech and convert it

to text.

Possible for deaf or speech-impaired individuals to

communicate with each other and also with normal-

hearing persons

Using gestures

captured by devices equipped by a camera

METHODS

Corpus, feature extraction, and

statistical modeling

The speakers’ lips were painted blue, and color

marks were placed on the speakers’ fingers. .

The data were derived from a video recording of

the cuers pronouncing and coding in Cued

Speech

landmarks with different colors were placed on

the fingers

faster and more accurate image processing

stage.

The audio part of the video recording was

synchronized with the image.

An automatic image processing method was

applied to the video lip width (A),

lip aperture (B),

lip area (S).

pinching of the upper lip (Bsup)

lower (Binf) lip

Concatenative feature fusion

Tracks and extracts the xy coordinates

each time frame,

uses those values as features in the

HMM modeling.

uses the concatenation of the

synchronous lip shape and hand features

as the joint feature vector given by,

Lip shape

feature vector,

Joint lip hand

feature vector,

Hand feature

vector,

Dimensionality of the

joint feature vector

Parameters used for lip

shape modeling.

RESULTS

Isolated word recognition

1. Recognition in normal-hearing subject

2. Recognition in deaf subject

3. Multi-speaker isolated word recognition:

investigate whether it is possible to train speaker-

independent HMMs for Cued Speech recognition.

The training data consisted of 750 words from the

normal-hearing subject, and 750 words from the

deaf subject.

For testing 700 words from normal-hearing subject

and 700 words from the deaf subject were used,

respectively.

Each state was modeled with a mixture of 4

Gaussian distributions.

For lip shape and hand shape integration,

concatenative feature fusion was used.

4. Continuous phoneme recognition

Phoneme correct for continuous phoneme word

recognition in the case of a normal-hearing subject.

Phoneme correct for continuous phoneme word

recognition in the case of a deaf subject.

Hand shapes and lips shape were integrated

using concatenative feature fusion and HMM-

based automatic recognition was conducted.

For continuous phoneme recognition, a 86%

phoneme correct was achieved for the normal-

hearing cuer and a 82.7% phoneme correct for

the dead cuer were achieved, respectively.

Speech in both normal-hearing and deaf

subjects were also conducted obtaining a

94.9% and a 89% accuracy, respectively.

CONCLUSION

A multi-speaker experiment using data

from both normal-hearing and deaf subject

showed a 89.6% word accuracy, on

average.

This result indicates that training speaker-

independent HMMs for Cued Speech using

a large number of subjects should not face

particular difficulties

CONCLUSION

REFERENCES

G. Potamianos, C. Neti, G. Gravier, A. Garg, and A.W. Senior,

“recent Advances in the automatic recognition of audiovisual

speech,” in Proceedings of the IEEE, vol. 91, issue 9, pp.

1306–1326, 2003.

S. Nakamura, K. Kumatani, and S. Tamura, “Multi-modal

temporal asynchronicity modeling by product hmms for

robust audio-visual speech recognition,” in Proceedings of

Fourth IEEE International Conference on Multimodal

Interfaces (ICMI’02), p. 305, 2002.

R. O. Cornett, “Cued speech,” American Annals of the Deaf,

vol. 112, pp. 3–13, 1967.

J. Leybaert, “Phonology acquired through the eyes and

spelling in deaf children,”Journal of Experimental Child

Psychology, vol. 75, pp. 291– 318, 2000

Thank you!

QUESTION

Visual speech to text conversion applicable to telephone communication

Engineering

’s Telephone Number, Including Area Code) Not Applicable (Former Name or Former Address, if Changed Since Last Report)

SPEED FACER · • Shipping address • Telephone number • Machine model • Serial number (if applicable) • Date of purchase CLIMAX World Headquarters 2712 East 2nd Street Newberg,

2 International Meeting - socphyschemserb.orgfriendly energy conversion technologies, the oxide ion conductors applicable in solid oxide fuel cells (S OFCs) have widely been investigated

Comfort Homes at Bhiwadi - Ashiana HousingRera)-1561452282.pdfstructure in accordance with applicable earthquake zone and BIS codes. TELEPHONE/T.V.Points provided in drawing/dining

TSX-V: NZ OTCQX: NZERF - s1.q4cdn.coms1.q4cdn.com/113276123/files/doc_presentations/... · conversion ratio is based on an energy equivalency conversion method primarily applicable

CÔTE D’IVOIRE INVESTMENT CLIMATE STATEMENT 2015 › documents › organization › 241738.pdf · Investment Trends 1.9.1. Tables 1 and if applicable, Table 1B 2. Conversion and

new sleep inventory - Sleep Disorders Institute | (212) … who referred you to the Sleep Disorders Institute (if applicable): Name Address Telephone ( ) Your referring doctor’s

MASTERCARD INCd1lge852tjjqow.cloudfront.net/CIK-0001141391/8802927e-97... · 2014-11-28 · (Registrant s telephone number, including area code ) NOT APPLICABLE (Former name or former

UX-P115 Operation Manual - Sharp USAfiles.sharpusa.com/Downloads/ForHome/HomeOffice/PrintersFax... · encouraged to try to correct the interference by one or ... Applicable telephone

Note: This sheet is applicable for uploading the particulars related … Dividend Transferred to... · 2018. 9. 26. · sudhakar prakash awasthi manglacharan awasthi telephone exch

AMI CONVERSION RIDER APPLICABLE EFFECTIVENESS OF TARIFF … modify the tariff as shown hereon. Sheet 1 of 2 Sheets

Telephone - Telephone Set

frontcover - Sharp Australiasupport.sharp.net.au/downloads/opmanuals/FO5900OM.pdf · 2 Compression scheme MMR, MR, MH, Sharp (H2), JBIG Halftone (grayscale) 64 levels Applicable telephone

I-------------- · San Francisco, CA 94102-7004 Telephone: (415) ... subdivision (c). In addition, ... applicable federal and state laws and regulations governing phmmacy,

Digital Telephone System System Manual - ACC TELECOM ......Digital Telephone System System Manual For The Impact DSU This publication is applicable to the following equipment: G0408,

ENERGY ENGINEERING...Energy in wind study of wind applicable Indian standards Steel Tables, Structural Engineering.Variables in wind energy conversion systems wind power density power

RATES APPLICABLE TO LOCAL TELECOMMUNICATIONS SERVICES · 2020-07-02 · rates applicable to local telecommunications services . furnished by: interior telephone company . regulatory

Overhead To Underground Utility Conversion Report …...Jan 14, 2002 · electrical conductors, television and telephone cables in conjunction with above ground transformers, pedestals

Study Abroad Mark Conversion Conversion · PDF fileStudy Abroad Mark Conversion Conversion Narratives . 2

Instrument Pipe & ISO Conversion Fittings · applicable ASTM bar stock specifications shaped fittings are manufactured from close grain forgings. WARNING FAILURE, IMPROPER SELECTION