59
Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign Nirupam Roy Haitham Hassanieh Romit Roy Choudhury Sheng Shen

Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Inaudible Voice Commands:

The Long-Range Attack and Defense

University of Illinois at Urbana-Champaign

Nirupam Roy Haitham Hassanieh Romit Roy ChoudhurySheng Shen

Page 2: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

50 million voice assistants are sold in US

Page 3: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Inaudible Acoustics

Normal Sound(< 24 kHz)

Ultrasound

(> 25 kHz)

“Inaudible Acoustics”

(> 25 kHz)

“Alexa, open the garage door!”Ok

Page 4: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Inaudible Acoustics

Page 5: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Talk Outline

0. [BackDoor], [DolphinAttack], [Princeton Video]

MobiSys’17(Best Paper)

CCS’17 arXiv

Page 6: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Talk Outline

Today’s Talk:

0. [BackDoor], [DolphinAttack], [Princeton Video]

2. How to defend against these attacks?

1. How to launch long-range (realistic) attacks?

Page 7: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Talk Outline

Today’s Talk:

0. [BackDoor], [DolphinAttack], [Princeton Video]

1. How to launch long-range (realistic) attacks?

2. How to defend against these attacks?

Page 8: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

10k 20k 30k 40k 50k 60k 70k 80k 90k 100k

Microphone frequency spectrum

Diaphragm Amplifier Filter ADC

Am

plit

ud

e

Page 9: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Microphonefilter

10k 20k 30k 40k 50k 60k 70k 80k 90k 100k

Microphone frequency spectrum

Diaphragm Amplifier Filter ADC

Am

plit

ud

e

Page 10: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Amplifier Filter ADC

Microphonefilter

Am

plit

ud

e

10k 20k 30k 40k 50k 60k 70k 80k 90k 100k

Microphone frequency spectrum

Diaphragm

Air Vibration Electric Voltage

Page 11: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Input

Output

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

AmplifierDiaphragm

Vout = a1Vin

Page 12: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Vout = a1Vin

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Vout = a1Vin+ a2Vin2 +...

AmplifierDiaphragm

Input

Output

Nonlinear

Input

Output

Page 13: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Vout = a1Vin

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Vout = a1Vin+ a2Vin2 +...

AmplifierDiaphragm

Input

Output

Nonlinear

Input

Output

a2Vin2

Page 14: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign
Page 15: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

10k

Frequency20k 30k 40k 50k 60k 70k 80k 90k 100k

Am

plit

ud

e

InaudibleAudible

Microphonefilter

Page 16: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

F1F2

10k

Frequency20k 30k 40k 50k 60k 70k 80k 90k 100k

Am

plit

ud

e

InaudibleAudible

Microphonefilter

Page 17: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Vout = a1Vin+ a2Vin2

( sin F1 + sin F2 )2

F1F2

10k

Frequency20k 30k 40k 50k 60k 70k 80k 90k 100k

= cos 2F1

cos 2F2

cos (F1+F2)

cos (F1- F2)

-

-

-

+

Am

plit

ud

e

InaudibleAudible

Microphonefilter

Page 18: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Vout = a1Vin+ a2Vin2

F1F2

( sin F1 + sin F2 )2 = cos 2F1

cos 2F2

cos (F1+F2)

cos (F1- F2)

2F2 (F1+F2)2F1

10k

Frequency20k 30k 40k 50k 60k 70k 80k 90k 100k

-

-

-

+

Am

plit

ud

e

InaudibleAudible

Microphonefilter

Page 19: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

2F2 (F1+F2)2F1

Vout = a1Vin+ a2Vin2

F1F2

( sin F1 + sin F2 )2 = cos 2F1

cos 2F2

cos (F1+F2)

cos (F1- F2)

10k

Frequency20k 30k 40k 50k 60k 70k 80k 90k 100k

-

-

-

+

Am

plit

ud

e

InaudibleAudible

Microphonefilter

Page 20: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

2F2 (F1+F2)2F1

Vout = a1Vin+ a2Vin2

F1F2

( sin F1 + sin F2 )2 = cos 2F1

cos 2F2

cos (F1+F2)

cos (F1- F2)

(F1-F2)

10k

Frequency20k 30k 40k 50k 60k 70k 80k 90k 100k

-

-

-

+

Am

plit

ud

e

InaudibleAudible

Microphonefilter

Page 21: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

F1F2

10k

Frequency20k 30k 40k 50k 60k 70k 80k 100k90k

(F1-F2)

Am

plit

ud

e

InaudibleAudible

Microphonefilter

Page 22: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

F2

10k

Frequency20k 30k 40k 50k 60k 70k 80k 90k 100k

Am

plit

ud

e

InaudibleAudible

Microphonefilter

V(t) = “Alexa, open the garage door!”

Page 23: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

F2

10k

Frequency20k 30k 40k 50k 60k 70k 80k 90k 100k

V(t) = “Alexa, open the garage door!”

Ok

Am

plit

ud

e

InaudibleAudible

Microphonefilter

Page 24: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

3-5 ft

Page 25: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Can someone attack from a longer range?

Page 26: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Can someone attack from a longer range?

Page 27: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Can someone attack from a longer range?

High power makes ultrasonic speakers audible

Page 28: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Speakers have nonlinearity too!

Page 29: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k

Voice Command:

Page 30: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k

Speaker Nonlinearity

=

Page 31: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k

Speaker Nonlinearity

Am

plit

ud

e

10k 20k 30k 40k 50k 60k 70k 80k 90k

=

Page 32: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k

Speaker Nonlinearity

Am

plit

ud

e

10k 20k 30k 40k 50k 60k 70k 80k 90k

+

=

Page 33: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k

Speaker Nonlinearity

Am

plit

ud

e

10k 20k 30k 40k 50k 60k 70k 80k 90k

Audible Inaudible

=

General to all speakers!

Our Solution: “Leakage Optimization”

Page 34: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Speaker Nonlinearity Audible Leakage

Speaker input

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Speaker output

Bandwidth: B

Bandwidth: B

𝑉 −𝑓 ∗ 𝑉(𝑓)

Page 35: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

eA

mp

litu

de

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Speaker input

Speaker output

Speaker Nonlinearity Audible Leakage

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Page 36: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Speaker input

Speaker output

Speaker Nonlinearity Audible Leakage

Page 37: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Speaker input

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Speaker output

Speaker Nonlinearity Audible Leakage

Page 38: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Speaker input

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Speaker output

Speaker Nonlinearity Audible Leakage

Chopping compresses the leakage band

Page 39: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Speaker input

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Speaker output

Leakage from Speaker

Sound P

ressure

Level in

dB

Frequency in Hz

15.6 31.2 62.5 125 250 500 1000 2000 4000 8000 16000

80

60

40

20

0

-20

Threshold of Hearing Curve

𝐿 𝑓

𝑇 𝑓

Human Hearing Threshold

min𝑓[𝑇 𝑓 − 𝐿(𝑓)]𝑀𝑎𝑥𝑖𝑚𝑖𝑧𝑒

𝑠𝑢𝑏𝑗𝑒𝑐𝑡 𝑡𝑜 𝑓0 ≤ 𝑓1 ≤ 𝑓2 ≤ … ≤ 𝑓𝑁

Minimum Gap

Page 40: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Speaker inputA

mp

litu

de

Speaker output

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Page 41: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Speaker inputA

mp

litu

de

Speaker output

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Inaudible

Page 42: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

F2

F2

F2

Am

plit

ud

e

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Speaker inputA

mp

litu

de

Speaker output

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Microphonefilter

Microphone output

10k

Frequency

20k 30k 40k 50k 60k 70k 80k 90k 100k

Am

plit

ud

e

Inaudible

Page 43: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Evaluation

Page 44: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Inaudible voice command: Long rangeInaudible voice commands: Long range

25 feet

Speaker array running leakage optimization

Page 45: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Evaluation

Wake-word hit rate

Page 46: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Evaluation

Wake-word hit rate Command detection accuracy

Page 47: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Evaluation

Maximum activation distance for different input power

Page 48: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Talk Outline

Today’s Talk:

0. [BackDoor], [DolphinAttack], [Princeton Video]

1. How to launch long-range (realistic) attacks?

2. How to defend against these attacks?

Page 49: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Talk Outline

Today’s Talk:

0. [BackDoor], [DolphinAttack], [Princeton Video]

2. How to defend against these attacks?

1. How to launch long-range (realistic) attacks?

Page 50: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

e

10kFrequency

20k 30k 40k 50k 60k 70k

Core Question:

Is this a “non-linear signal” or normally recorded signal

?Voice signal

Inaudible Voice Attack

𝑣(𝑡) ∙ sin 𝜔1𝑡 + 𝑐 ∙ sin 𝜔1𝑡2

= 𝑣 𝑡 + 𝑐′𝑣2 𝑡 + ⋯

𝑣 𝑡

Page 51: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

e

10kFrequency

20k 30k 40k 50k 60k 70k

Core Question:

Is this a “non-linear signal” or normally recorded signal

Nonlineardistortion

Voice signal

𝑣(𝑡) ∙ sin 𝜔1𝑡 + 𝑐 ∙ sin 𝜔1𝑡2 = 𝑣 𝑡 + 𝑐′𝑣2 𝑡 + ⋯

Page 52: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Difficult to decouple “voice signal” and “non-linear signal”

Am

plit

ud

e

10kFrequency

20k 30k 40k 50k 60k 70k

Human voice signals present opportunities …

Signal Forensics

Page 53: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

eFrequency

f

2f3f

4f

Opportunity #1: Voice > 50 Hz

Human Voice

Page 54: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

eFrequency

f

2f3f

4f

Energy atsub-50Hz band

Opportunity #1: Voice > 50 Hz

Human Voice

Page 55: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

pl.

Time

Energy variation in v(t)

Am

pl.

Time

Energy variation in v2(t)

Correlation

Opportunity #2: Correlation

Page 56: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Am

plit

ud

e

10kFrequency

20k 30k 40k 50k 60k 70k

Opportunity #3: Amplitude Skewness

=Amplitude skew

+

Page 57: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

1000-10010

-1

8

0

2

4

6

Attack voiceLegitimate voice

Sub-50Hz power Correlation

Am

plit

ude

ske

w

Real voice

Attack voice

5000 Test Cases

Page 58: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

Overall Detection Accuracy

Page 59: Inaudible Voice Commands: The Long-Range Attack and Defense · 2019. 5. 13. · Inaudible Voice Commands: The Long-Range Attack and Defense University of Illinois at Urbana-Champaign

To summarize…

Inaudible Acoustics (> 25 kHz): “Alexa, open the garage door!”Ok

Ok

Ok