Speech Recognition 1

  • Upload
    f008

  • View
    220

  • Download
    0

Embed Size (px)

Citation preview

  • 8/8/2019 Speech Recognition 1

    1/18

    1

    Speech Recognition

  • 8/8/2019 Speech Recognition 1

    2/18

    2

    Introduction

    What is Speech Recognition?

    - Voice Recognition? Where can it be used?

    - Dictation

    - System control/navigation- Commercial/Industrial applications

    - Hand held digital recorders

  • 8/8/2019 Speech Recognition 1

    3/18

    3

    Contents:

    Continuous/Discrete

    How does it work? Recent improvements

    Current software options

    Future of SR

  • 8/8/2019 Speech Recognition 1

    4/18

    4

    Continuous or Discrete? Continuous speech

    - dictation

    Discrete speech

    - system controls

  • 8/8/2019 Speech Recognition 1

    5/18

    5

    How does SR work? Recognition

    Training

    Correction

    Command/Control

  • 8/8/2019 Speech Recognition 1

    6/18

    6

    Recognition (1)Voice Input Analog to Digital Acoustic Model

    Language Model

    Display Speech EngineFeedback

  • 8/8/2019 Speech Recognition 1

    7/18

    7

    Recognition (2)

    Acoustic Modeling

    Spoken words: I think there are..

    Phonemes: ay th-in-nk-kd dh-eh-r aa-r

    H.M.M.s: 5 state representation

    Speech Engine

  • 8/8/2019 Speech Recognition 1

    8/18

    8

    Recognition (3)

    Language Modeling

    Word context

    Word frequency

    Transition possibilities

  • 8/8/2019 Speech Recognition 1

    9/18

    9

    Voice Training (1)Can be done by:

    Predetermined text segments

    Individual words

    Compare new acoustic with old and combines

    More training = better recognition

  • 8/8/2019 Speech Recognition 1

    10/18

    10

    Voice Training (2)

    User specific Voice file

    Voice qualities

    Pronunciation

    Patterns of word use

    Preferred vocabulary

  • 8/8/2019 Speech Recognition 1

    11/18

    11

    Making Corrections Move cursor by voice command

    Memorize edit commands

    List of possible alternatives

    Make correction manually

  • 8/8/2019 Speech Recognition 1

    12/18

    12

    Command/Control Desktop grid

    Program or Link name/number

    URL name

    Memorized commands

  • 8/8/2019 Speech Recognition 1

    13/18

    13

    RecentImprovements in SR

    Faster training ~10 min.

    Better recognition ~95%

    More compatible software

    Better system control/command

  • 8/8/2019 Speech Recognition 1

    14/18

    14

    Current Software Options for PC Dragon Systems Naturally Speaking

    Philips FreeSpeech

    IBM ViaVoice

    Lernout & Hauspie Voice Xpress

  • 8/8/2019 Speech Recognition 1

    15/18

    15

    How well do the work?Training Dictation

    Correct.

    App.

    Integrat.

    Command

    - Control

    Dragon Excellent Excellent Good Good

    Philips Fair Fair Good Good

    IBM Excellent Good Good Excellent

    L & H Good Good Good Good

  • 8/8/2019 Speech Recognition 1

    16/18

    16

    Future of SR SUI Speech-based UserInterface

    Improvements needed:

    - Greater accuracy

    - Greater system control/command

    - More compatible software

  • 8/8/2019 Speech Recognition 1

    17/18

    17

    Conclusion SR Uses

    How does it work?

    Current Software

    Problems of SR

    More SR coming soon.

  • 8/8/2019 Speech Recognition 1

    18/18

    18

    References 1. Alwang, Greg. Speech Recognition, PC Magazine, December 1

    1999

    2. Hauptmann, Alexander G. Jang, Photina Jaeyun. Carnegie Mellon

    University. Learning to Recognize Speech by Watching Television,IEEE Intelligent Systems, September/October 1999.

    3. Miastkowski, Stan. Latest Speech Software Gets You Up and

    Running Faster, PC World, November 1999.