Upload
others
View
9
Download
0
Embed Size (px)
Citation preview
1
Speech & Audio Processing / Part-II
Digital Audio Signal Processing DASP
Marc Moonen Dept. E.E./ESAT-STADIUS, KU Leuven
[email protected] homes.esat.kuleuven.be/~moonen/
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 2 / 32
Outline
• Aims/scope
• Contents
• Case study: Hearing instruments
• Lectures/Exercise Sessions/Exam
Website/Questions
2
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 3 / 32
Aims/Scope
Aim is 2-fold.. • Speech & audio per se S & A industry in Belgium/Europe
Topics: Noise Reduction / Acoustic Echo & Feedback Cancellation / Active Noise Control / Sound Reproduction / ..etc
• Develop basic signal processing tools/principles which are also used in many other fields
Spatial filter design (for wireless comms) Adaptive filter algorithms, filtered-X LMS (for active vibration control) Kalman filters (for automatic control, navigation, ..) Time-frequency analysis/processing ..etc
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 4 / 32
Outline
• Aims/scope
• Contents
• Case study: Hearing instruments
• Lectures/Exercise Sessions/Exam
Website/Questions
3
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 5 / 32
Contents: Chapter-2
Noise Reduction-I Single-Channel Noise Reduction
• Spectral subtraction methods (spectral filtering) • Iterative methods based on speech modeling (Wiener & Kalman Filters) Applications: smartphones, conferencing, hearing aids, …
desired signal estimate
desired signal s[k]
noise signal(s)
? ][ks⌢y[k] ][][][ knksky +=
desired signal contribution
noise contribution
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 6 / 32
Contents: Chapter-3
Noise Reduction-II
Microphone Array Processing - Fixed Beamforming
Referred to as ‘spatial filtering’ (similar to ‘spectral filtering’)
Applications: smartphones, conferencing, hearing aids, …
),(1 θωY
),(2 θωY
),(1 θωY
)(1 ωF
)(2 ωF
)(ωmF),( θωmY
)(ωMF),( θωMY
md),(1 θωY
)(ωS
θ
Σ),( θωZ
θcosmd
Filter-and-sum beamformer
4
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 7 / 32
Contents: Chapter-4 Noise Reduction-III
Adaptive Beamforming & Multi-Channel Noise Reduction • Adaptive Beamforming
– Known (fixed) speaker position – Unknown (time-varying) noise field
• Multi-channel noise reduction – Wiener filtering approach = spectral+spatial filtering
Applications: smartphones, conferencing, hearing aids, …
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 8 / 32
Acoustic Echo & Feedback Cancellation Adaptive filtering problem: • Non-stationary/wideband/… speech signals • Non-stationary/long/… acoustic channels
Adaptive filtering algorithms AEC Control AEC Post-processing Stereo AEC
Contents: Chapter-5
5
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 9 / 32
Contents: Chapter-5 Acoustic Echo & Feedback Cancellation • Hearing aids, public address (PA) systems • Correlation between filter input (`x ’) and near-end signal (‘ n ’) • Fixes : noise injection, pitch shifting, notch filtering, …
Model
F -
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 10 / 32
Contents: Chapter-6
Active Noise Control • Solution based on `filtered-X LMS’ • Application : active headsets/ear defenders
6
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 11 / 32
Contents: Chapter-7
Distributed Signal Processing in Wireless Acoustic Sensor Networks
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 12 / 32
Guest Lecture: Dr. Enzo De Sena, University of Surrey, UK
‘Sound Field Recording and Reproduction’
Contents: Chapter-8
7
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 13 / 32
Outline
• Aims/scope
• Contents
• Case study: Hearing instruments
• Lectures/Exercise Sessions/Exam
Website/Questions
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 14 / 32
Hearing • Tonotopy of inner ear
Hearing loss • Aging • Exposure to loud sounds
= Cochlea
Low-freq tone
High-freq tone
Neural activity for low-freq tone
Neural activitity for high-freq tone
© w
ww
.cm
.be
Case Study: Hearing Instruments 1/10
8
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 15 / 32
DASP Challenges in hearing instruments (next slides) • Dynamic range compression • Noise reduction • Dereverberation • Acoustic feedback cancellation • Active noise control • Etc.
Technology Challenges in hearing instruments • Small form factor (cfr. user acceptance) • Low power: 1…5mW (cfr. battery lifetime ≈ 1 week) • Low processing delay: 10msec (cfr. synchronization with lip
reading)
Case Study: Hearing Instruments 2/10
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 16 / 32
DASP Challenges: Dynamic range compression Dynamic range & audibility… Normal hearing Hearing impaired `signal dependent amplification’ subjects subjects
Case Study: Hearing Instruments 3/10
Level
100dB
0dB Input Level (dB)
Out
put L
evel
(dB
)
0dB 100dB
0dB
100dB
9
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 17 / 32
• However: Audibility does not imply intelligibility
• Hearing impaired subjects need 5..10dB larger signal-to-noise ratio (SNR) for speech understanding in noisy environments
• Need for noise reduction (=speech enhancement) algorithms: • State-of-the-art: monaural 2-microphone adaptive noise reduction • Near future: binaural noise reduction (2 times 2 microphones) • Not-so-near future: multi-node noise reduction (see below)
Case Study: Hearing Instruments 4/10
SNR
20dB
0dB 30 50 70 90
Hearing loss (dB, 3-freq-average)
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 18 / 32
DASP Challenges: Noise reduction & beamforming Multimicrophone ‘beamforming’, typically with 2 microphones
e.g. ‘directional’ front microphone and ‘omnidirectional’ back microphone
Case Study: Hearing Instruments 5/10
“filter-and-sum” the
microphone signals
Chapter 3-4
10
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 19 / 32
Future: Multi-node noise reduction in sensor networks/Internet-of-Things
Case Study: Hearing Instruments 6/10
Chapter 7
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 20 / 32
DASP Challenges: Dereverberation • Reverb = filtering effect (‘echo-ing’) of acoustic channel in
between speaker and microphone(s)
• Reverb has an impact on speech understanding
• Dereverberation = undo filtering by acoustic channel (e.g. ‘inverse filtering’)
Case Study: Hearing Instruments 7/10
11
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 21 / 32
DASP Challenges: Acoustic feedback cancellation • Problem statement: Loudspeaker signal is fed back into microphone,
then amplified and played back again • Closed loop system may become unstable (howling) • Similar to feedback problem in public address systems (for the
musicians amongst you)
Case Study: Hearing Instruments 8/10
Model
F
-
Similar to echo cancellation in GSM handsets, Skype,… but more difficult due to signal correlation
Chapter 5
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 22 / 32
Chapter 6
DASP Challenges: Active noise control (ANC) ANC to counteract noise leakage & occlusion effect, exploiting additional ‘internal’ reference microphone
Case Study: Hearing Instruments 9/10 ©
S.D
oclo
/Uni
v.O
lden
burg
12
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 23 / 32
Case Study: Hearing Instruments 10/10
DASP Challenges: …piecing things together
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 24 / 32
Outline
• Aims/scope
• Contents
• Case study: Hearing instruments
• Lectures/Exercise Sessions/Exam
Website/Questions
13
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 25 / 32
Lectures
Lectures: 1 Intro (count as half) + 7 Lectures PS: Time budget = 7.5*(2hrs)*4 = 60 hrs
Course Material: Slides – Use version 2019-2020 ! – Download from DASP webpage
homes.esat.kuleuven.be/~dspuser/dasp/ – Master copy available @ ESAT B00.10 (Ida Tassens)
Course Prerequisite: DSP-CIS H05F3A/H05F1A (filter design, filter banks, optimal & adaptive filters)
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 26 / 32
Literature
Literature (general) (available in DSP-CIS library) • Simon Haykin `Adaptive Filter Theory’ (Prentice Hall 1996) • P.P. Vaidyanathan `Multirate Systems and Filter Banks’ (Prentice Hall 1993) Literature (specialized) (available in DSP-CIS library) • S.L. Gay & J. Benesty `Acoustic Signal Processing for Telecommunication’ (Kluwer 2000) • M. Kahrs & K. Brandenburg (Eds) `Applications of Digital Signal Processing to Audio and Acoustics’ (Kluwer1998) • B. Gold & N. Morgan `Speech and Audio Signal Processing’ (Wiley 2000)
14
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 27 / 32
Exercise Sessions/Project Acoustic source localization
– Direction-of-arrival estimation (‘MUSIC’ Algorithm) – Noise reduction (‘DoA informed beamforming’) – Binaural synthesis and 3D audio – …all in a simulated set-up
Direction-of-arrival θ
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 28 / 32
..be there !
PS: groups of 2
Exercise Sessions/Project
• Runs over 4 weeks (non-consecutive) • Each week
– 1 PC/Matlab session (supervised, 2.5hrs) – 2 ‘Homework’ sesions (unsupervised, 2*2.5hrs)
PS: Time budget = 4*(2.5hrs+5hrs) = 30 hrs • ‘Deliverables’ after week 2 & 4 • Grading: based on deliverables, evaluated during sessions
• TAs: guiliano.bernardi@esat (English+Italian)
randall.ali@esat (English+Spanish)
15
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 29 / 32
• Oral exam, with preparation time • Open book • Grading
7 for question-1 7 for question-2 +6 for project ___ = 20
Exam
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 30 / 32
• Oral exam, with preparation time • Open book • Grading
7 for question-1 7 for question-2 +6 for question-3 (related to project work) ___ = 20
Retake Exam
16
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 31 / 32
Website
1) TOLEDO 2) http://homes.esat.kuleuven.be/~dspuser/dasp/ • Contact: guiliano.bernardi@esat • Slides (use version 2018 !!) • Schedule • DSP-library • FAQs (send questions to marc.moonen@esat)
Digital Audio Signal Processing Version 2019-2020 Chapter-1: Introduction 32 / 32
Questions?
1) Ask teaching assistant (during exercises sessions)
2) E-mail questions to teaching assistant or marc.moonen@esat 3) Make appointment marc.moonen@esat ESAT Room B.00.14