9
Exploring Universal Attri bute Characterization of Spoken Languages for Spok en Language Recognition

Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition

  • View
    213

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition

Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recogniti

on

Page 2: Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition

Outline

• Introduction

• UAR-FrondEnd

• VSM-BackEnd

• Experiment

Page 3: Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition

Introduction

• Here we focus on the token-based.

• Propse an alternative universal acoustic characterization of spoken languages based on acoustic phonetic feature.

• The advantage of using attribute-based unit is they can be define universally across all language.

Page 4: Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition

System Overview

Page 5: Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition

UAR-FrondEnd

• The frond-end processing module tokenize all spoken utterances into sequences of speech unit using a universal attribute recognizer.

• Two phoneme-to-attribute table are created that are phoneme-to-manner and phoneme-to-place.

Page 6: Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition

VSM-BackEnd

• Each transcription is converted into a vector-based representation by applying LSA.

Page 7: Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition

Experiment

• The OGI-TS corpus is used to train the articulatory recognizer. This corpus has phonetic transcriptions for six language.

Page 8: Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition

Experiment

• CallFriend corpus is used for training the back-end language models.

• Test are carried out on the NIST 2003 spoken language evaluation material.

Page 9: Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition

Experiment