Upload
f008
View
220
Download
0
Embed Size (px)
Citation preview
8/8/2019 Speech Recognition 1
1/18
1
Speech Recognition
8/8/2019 Speech Recognition 1
2/18
2
Introduction
What is Speech Recognition?
- Voice Recognition? Where can it be used?
- Dictation
- System control/navigation- Commercial/Industrial applications
- Hand held digital recorders
8/8/2019 Speech Recognition 1
3/18
3
Contents:
Continuous/Discrete
How does it work? Recent improvements
Current software options
Future of SR
8/8/2019 Speech Recognition 1
4/18
4
Continuous or Discrete? Continuous speech
- dictation
Discrete speech
- system controls
8/8/2019 Speech Recognition 1
5/18
5
How does SR work? Recognition
Training
Correction
Command/Control
8/8/2019 Speech Recognition 1
6/18
6
Recognition (1)Voice Input Analog to Digital Acoustic Model
Language Model
Display Speech EngineFeedback
8/8/2019 Speech Recognition 1
7/18
7
Recognition (2)
Acoustic Modeling
Spoken words: I think there are..
Phonemes: ay th-in-nk-kd dh-eh-r aa-r
H.M.M.s: 5 state representation
Speech Engine
8/8/2019 Speech Recognition 1
8/18
8
Recognition (3)
Language Modeling
Word context
Word frequency
Transition possibilities
8/8/2019 Speech Recognition 1
9/18
9
Voice Training (1)Can be done by:
Predetermined text segments
Individual words
Compare new acoustic with old and combines
More training = better recognition
8/8/2019 Speech Recognition 1
10/18
10
Voice Training (2)
User specific Voice file
Voice qualities
Pronunciation
Patterns of word use
Preferred vocabulary
8/8/2019 Speech Recognition 1
11/18
11
Making Corrections Move cursor by voice command
Memorize edit commands
List of possible alternatives
Make correction manually
8/8/2019 Speech Recognition 1
12/18
12
Command/Control Desktop grid
Program or Link name/number
URL name
Memorized commands
8/8/2019 Speech Recognition 1
13/18
13
RecentImprovements in SR
Faster training ~10 min.
Better recognition ~95%
More compatible software
Better system control/command
8/8/2019 Speech Recognition 1
14/18
14
Current Software Options for PC Dragon Systems Naturally Speaking
Philips FreeSpeech
IBM ViaVoice
Lernout & Hauspie Voice Xpress
8/8/2019 Speech Recognition 1
15/18
15
How well do the work?Training Dictation
Correct.
App.
Integrat.
Command
- Control
Dragon Excellent Excellent Good Good
Philips Fair Fair Good Good
IBM Excellent Good Good Excellent
L & H Good Good Good Good
8/8/2019 Speech Recognition 1
16/18
16
Future of SR SUI Speech-based UserInterface
Improvements needed:
- Greater accuracy
- Greater system control/command
- More compatible software
8/8/2019 Speech Recognition 1
17/18
17
Conclusion SR Uses
How does it work?
Current Software
Problems of SR
More SR coming soon.
8/8/2019 Speech Recognition 1
18/18
18
References 1. Alwang, Greg. Speech Recognition, PC Magazine, December 1
1999
2. Hauptmann, Alexander G. Jang, Photina Jaeyun. Carnegie Mellon
University. Learning to Recognize Speech by Watching Television,IEEE Intelligent Systems, September/October 1999.
3. Miastkowski, Stan. Latest Speech Software Gets You Up and
Running Faster, PC World, November 1999.