80
1 Pitch Analysis for Active Music Discovery 1 Music and Audio Research Laboratory 2 Center for Urban Science and Progress New York University, USA Thursday June 23 rd 2016 Justin Salamon 1,2 @justin_salamon [email protected] www.justinsalamon.com Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar, Slim Essid, Emilia Gómez, Paco Gómez, Sankalp Gulati, Helga Jiang, Edith Law, Matthias Mauch, Joaquin Mora, Sergio Oramas, Geoffroy Peeters, Aggelos Pikrakis, Axel Röbel, Bruno Rocha, Joan Serrà, Xavier Serra, Tim Tse, Alex Williams

Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

  • Upload
    others

  • View
    8

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

1

Pitch Analysis for Active Music Discovery

1Music and Audio Research Laboratory2Center for Urban Science and Progress

New York University, USA

Thursday June 23rd 2016

Justin Salamon1,2

@justin_salamon [email protected] www.justinsalamon.com

Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar, Slim Essid, Emilia Gómez, Paco Gómez, Sankalp Gulati, Helga Jiang, Edith Law, Matthias Mauch, Joaquin Mora, Sergio Oramas, Geoffroy Peeters, Aggelos Pikrakis, Axel Röbel, Bruno Rocha, Joan Serrà, Xavier Serra, Tim Tse, Alex Williams

Page 2: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

In A Nutshell 2/35

Page 3: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

In A Nutshell

‣  GOAL: active music discovery

3/35

Page 4: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

In A Nutshell

‣  GOAL: active music discovery

‣  MEANS: pitch content analysis

4/35

Page 5: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

In A Nutshell

‣  GOAL: active music discovery

‣  MEANS: pitch content analysis

‣  CHALLENGE: data scarcity (and solution!)

5/35

Page 6: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Active Music Discovery

Page 7: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Music Recommendation 7/35

Page 8: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Music Recommendation 8/35

Page 9: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Active Search & Discovery

9/35

Page 10: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Active Search & Discovery

10/35Song A

Song B

Versions?

Yes! Similarity = 0.92

COVER SONG ID

(Salamon, Serrà & Gómez, 2013)

Page 11: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Active Search & Discovery

11/35Song A

Song B

Versions?

Yes! Similarity = 0.92

COVER SONG ID

(Salamon, Serrà & Gómez, 2013)

Query-by-humming

(Salamon, Serrà & Gómez, 2013)

Page 12: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Active Search & Discovery

12/35Song A

Song B

Versions?

Yes! Similarity = 0.92

COVER SONG ID

(Salamon, Serrà & Gómez, 2013)

Query-by-humming

(Salamon, Serrà & Gómez, 2013)

melodic pattern discovery

(Pikrakis et al., 2012)

Page 13: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Active Search & Discovery

13/35Song A

Song B

Versions?

Yes! Similarity = 0.92

COVER SONG ID

(Salamon, Serrà & Gómez, 2013)

Query-by-humming

(Salamon, Serrà & Gómez, 2013)

100 150 200 250 300 350 400

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Frequency bins (1 bin = 10 cents), Ref: 55Hz

Norm

aliz

ed s

alie

nce

Multipitch Histogram

ρ2

ρ4 ρ3

ρ5

Tonic ID

(Salamon, Gulati & Serra, 2012)

melodic pattern discovery

(Pikrakis et al., 2012)

Page 14: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Active Search & Discovery

14/35Song A

Song B

Versions?

Yes! Similarity = 0.92

COVER SONG ID

(Salamon, Serrà & Gómez, 2013)

Query-by-humming

(Salamon, Serrà & Gómez, 2013)

5.6 5.8 6 6.2 6.4 6.60

0.1

0.2

0.3

0.4

0.5

Mean vibrato rate (vr:mean*)

Me

an

vib

rato

co

ve

rag

e (

v c:me

an

*)

Flamenco

Inst. jazz

Opera

Pop

Vocal jazz

SINGING STYLE classification

(Salamon, Rocha & Gómez, 2012)

100 150 200 250 300 350 400

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Frequency bins (1 bin = 10 cents), Ref: 55Hz

Norm

aliz

ed s

alie

nce

Multipitch Histogram

ρ2

ρ4 ρ3

ρ5

Tonic ID

(Salamon, Gulati & Serra, 2012)

melodic pattern discovery

(Pikrakis et al., 2012)

Page 15: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Machine Learning for Pitch Analysis

Page 16: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

16/35Melody Extraction

Page 17: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

17/35Melody Extraction

Page 18: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

18/35Melody Extraction

Page 19: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

19/35Melodia (Salamon & Gómez, 2012)

Page 20: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

20/35Melodia (Salamon & Gómez, 2012)

Contour Extraction Melody f0Contour Selection

Page 21: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

21/35Melodia (Salamon & Gómez, 2012)

Melody f0Contour Extraction Contour Selection

Page 22: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

22/35Melodia (Salamon & Gómez, 2012)

Melody f0Contour Extraction Contour Selection

Page 23: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

23/35Melodia (Salamon & Gómez, 2012)

Melody f0Contour Extraction Contour Selection

Page 24: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

24/35Melodia (Salamon & Gómez, 2012)

Melody f0Contour Extraction Contour Selection

Page 25: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

25/35Melodia (Salamon & Gómez, 2012)

Melody f0

Audio signal

Sinusoid extraction

Salience function

Pitch contour creation

Melody selection

Iterative

Melody f0 sequence

Spectral peaks

Time-frequency salience

Pitch contours

Bin salience mapping with harmonic weighting

Contour characterisation

Peak streaming Peak filtering

Frequency/amplitude correction

Spectral transform

Equal loudness filter

Melody peak selection

Pitch outlier removal

Melody pitch mean

Octave error removal

Voicing detection

Melody pitch mean

Audio signal

Sinusoid extraction

Salience function

Pitch contour creation

Melody selection

Iterative

Melody f0 sequence

Spectral peaks

Time-frequency salience

Pitch contours

Bin salience mapping with harmonic weighting

Contour characterisation

Peak streaming Peak filtering

Frequency/amplitude correction

Spectral transform

Equal loudness filter

Melody peak selection

Pitch outlier removal

Melody pitch mean

Octave error removal

Voicing detection

Melody pitch mean

Contour Extraction Contour Selection

Page 26: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

26/35Melodia (Salamon & Gómez, 2012)

Melody f0

Audio signal

Sinusoid extraction

Salience function

Pitch contour creation

Melody selection

Iterative

Melody f0 sequence

Spectral peaks

Time-frequency salience

Pitch contours

Bin salience mapping with harmonic weighting

Contour characterisation

Peak streaming Peak filtering

Frequency/amplitude correction

Spectral transform

Equal loudness filter

Melody peak selection

Pitch outlier removal

Melody pitch mean

Octave error removal

Voicing detection

Melody pitch mean

Audio signal

Sinusoid extraction

Salience function

Pitch contour creation

Melody selection

Iterative

Melody f0 sequence

Spectral peaks

Time-frequency salience

Pitch contours

Bin salience mapping with harmonic weighting

Contour characterisation

Peak streaming Peak filtering

Frequency/amplitude correction

Spectral transform

Equal loudness filter

Melody peak selection

Pitch outlier removal

Melody pitch mean

Octave error removal

Voicing detection

Melody pitch mean

Contour Extraction Contour Selection

Page 27: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

27/35Melodia (Salamon & Gómez, 2012)

Melody f0

Audio signal

Sinusoid extraction

Salience function

Pitch contour creation

Melody selection

Iterative

Melody f0 sequence

Spectral peaks

Time-frequency salience

Pitch contours

Bin salience mapping with harmonic weighting

Contour characterisation

Peak streaming Peak filtering

Frequency/amplitude correction

Spectral transform

Equal loudness filter

Melody peak selection

Pitch outlier removal

Melody pitch mean

Octave error removal

Voicing detection

Melody pitch mean

Audio signal

Sinusoid extraction

Salience function

Pitch contour creation

Melody selection

Iterative

Melody f0 sequence

Spectral peaks

Time-frequency salience

Pitch contours

Bin salience mapping with harmonic weighting

Contour characterisation

Peak streaming Peak filtering

Frequency/amplitude correction

Spectral transform

Equal loudness filter

Melody peak selection

Pitch outlier removal

Melody pitch mean

Octave error removal

Voicing detection

Melody pitch mean

Contour Extraction Contour Selection

?

Page 28: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Melodia (Salamon & Gómez, 2012)

Page 29: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Melodia (Salamon & Gómez, 2012)

vibrato

pitch deviation

length

pitch mean

salience

rate extent

Page 30: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

1000 2000 3000 4000 5000 60000

0.1

0.2

(a)Pitch mean distribution (cents)

0 100 200 300 400 5000

0.4

0.8

(b)Pitch standard deviation distribution (cents)

0 1 2 30

0.075

0.15

(c)Contour mean salience distribution

0 1 2 3 4 50

0.15

0.3

(d)Contour salience standard deviation distribution

0 2 4 6 8 100

0.4

0.8

(e)Contour total salience distribution

0 2 4 60

0.4

0.8

(f)Contour length distribution (seconds)

Melodia (Salamon & Gómez, 2012)

Page 31: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

31/35

Melody f0

Audio signal

Sinusoid extraction

Salience function

Pitch contour creation

Melody selection

Iterative

Melody f0 sequence

Spectral peaks

Time-frequency salience

Pitch contours

Bin salience mapping with harmonic weighting

Contour characterisation

Peak streaming Peak filtering

Frequency/amplitude correction

Spectral transform

Equal loudness filter

Melody peak selection

Pitch outlier removal

Melody pitch mean

Octave error removal

Voicing detection

Melody pitch mean

Audio signal

Sinusoid extraction

Salience function

Pitch contour creation

Melody selection

Iterative

Melody f0 sequence

Spectral peaks

Time-frequency salience

Pitch contours

Bin salience mapping with harmonic weighting

Contour characterisation

Peak streaming Peak filtering

Frequency/amplitude correction

Spectral transform

Equal loudness filter

Melody peak selection

Pitch outlier removal

Melody pitch mean

Octave error removal

Voicing detection

Melody pitch mean

Melodia (Salamon & Gómez, 2012)

Contour Extraction Contour Selection

Page 32: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

32/35

Melody f0

Audio signal

Sinusoid extraction

Salience function

Pitch contour creation

Melody selection

Iterative

Melody f0 sequence

Spectral peaks

Time-frequency salience

Pitch contours

Bin salience mapping with harmonic weighting

Contour characterisation

Peak streaming Peak filtering

Frequency/amplitude correction

Spectral transform

Equal loudness filter

Melody peak selection

Pitch outlier removal

Melody pitch mean

Octave error removal

Voicing detection

Melody pitch mean

Melodia (Salamon & Gómez, 2012)

Contour Extraction Contour Selection

Page 33: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

33/35

Melody f0

Audio signal

Sinusoid extraction

Salience function

Pitch contour creation

Melody selection

Iterative

Melody f0 sequence

Spectral peaks

Time-frequency salience

Pitch contours

Bin salience mapping with harmonic weighting

Contour characterisation

Peak streaming Peak filtering

Frequency/amplitude correction

Spectral transform

Equal loudness filter

Melody peak selection

Pitch outlier removal

Melody pitch mean

Octave error removal

Voicing detection

Melody pitch mean

Melodia (Salamon & Gómez, 2012)

Contour Extraction Contour Selection

MIREX: 75%MedleyDB: 57%

Page 34: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

34/35Contour Classification (Salamon, Peeters & Röbel, 2012)

Melody f0

Audio signal

Sinusoid extraction

Salience function

Pitch contour creation

Melody selection

Iterative

Melody f0 sequence

Spectral peaks

Time-frequency salience

Pitch contours

Bin salience mapping with harmonic weighting

Contour characterisation

Peak streaming Peak filtering

Frequency/amplitude correction

Spectral transform

Equal loudness filter

Melody peak selection

Pitch outlier removal

Melody pitch mean

Octave error removal

Voicing detection

Melody pitch mean

Melody contour features

Accompaniment contour features

Contour Extraction Contour Selection

Page 35: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

35/35Contour Classification (Bittner et al., 2015)

Melody f0

Audio signal

Sinusoid extraction

Salience function

Pitch contour creation

Melody selection

Iterative

Melody f0 sequence

Spectral peaks

Time-frequency salience

Pitch contours

Bin salience mapping with harmonic weighting

Contour characterisation

Peak streaming Peak filtering

Frequency/amplitude correction

Spectral transform

Equal loudness filter

Melody peak selection

Pitch outlier removal

Melody pitch mean

Octave error removal

Voicing detection

Melody pitch mean

Contour Extraction Contour Selection

Page 36: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

36/35Contour Classification (Bittner et al., 2015)

Melody f0

Audio signal

Sinusoid extraction

Salience function

Pitch contour creation

Melody selection

Iterative

Melody f0 sequence

Spectral peaks

Time-frequency salience

Pitch contours

Bin salience mapping with harmonic weighting

Contour characterisation

Peak streaming Peak filtering

Frequency/amplitude correction

Spectral transform

Equal loudness filter

Melody peak selection

Pitch outlier removal

Melody pitch mean

Octave error removal

Voicing detection

Melody pitch mean

Contour Extraction Contour Selection

Page 37: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

37/35Contour Classification (Bosch et al., 2016)

Melody f0Contour Extraction Contour Selection

Page 38: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

38/35Contour Classification (???, 2017)

Melody f0Contour Extraction Contour Selection

?

Page 39: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

39/35Contour Classification (???, 2017)

Melody f0Deep Melody Net

?

Page 40: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Data Scarcity

Page 41: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Continuous Melody f0 Annotation

Page 42: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Continuous Melody f0 Annotation

Page 43: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Continuous Melody f0 Annotation

MonophonicPitch Tracker

Page 44: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Continuous Melody f0 Annotation

MonophonicPitch Tracker

Tony (Mauch et al., 2015)

Page 45: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Datasets for Melody Extraction Eval

Page 46: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Datasets for Melody Extraction Eval

‣  MIREX‣  ADC2004: 6 minutes (20 excerpts)‣  MIREX05: 12 minutes (25 excerpts)

Page 47: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Datasets for Melody Extraction Eval

‣  MIREX‣  ADC2004: 6 minutes (20 excerpts)‣  MIREX05: 12 minutes (25 excerpts)

‣  MIR1K: 2.2 hr (1000 excerpts), C-Pop

Page 48: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Datasets for Melody Extraction Eval

‣  MIREX‣  ADC2004: 6 minutes (20 excerpts)‣  MIREX05: 12 minutes (25 excerpts)

‣  MIR1K: 2.2 hr (1000 excerpts), C-Pop‣  RWC-Pop: 6.8 hr (100 songs), J-Pop

Page 49: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Datasets for Melody Extraction Eval

‣  MIREX‣  ADC2004: 6 minutes (20 excerpts)‣  MIREX05: 12 minutes (25 excerpts)

‣  MIR1K: 2.2 hr (1000 excerpts), C-Pop‣  RWC-Pop: 6.8 hr (100 songs), J-Pop‣  MedleyDB (Bittner et al., 2014):

‣  7.5 hr (108 songs), varied

Page 50: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Datasets for Melody Extraction Eval

‣  MIREX‣  ADC2004: 6 minutes (20 excerpts)‣  MIREX05: 12 minutes (25 excerpts)

‣  MIR1K: 2.2 hr (1000 excerpts), C-Pop‣  RWC-Pop: 6.8 hr (100 songs), J-Pop‣  MedleyDB (Bittner et al., 2014):

‣  7.5 hr (108 songs), varied‣  MedleyDBv2 (coming soon):

‣  ~17hr (~240 songs), varied

Page 51: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Datasets for Melody Extraction Eval

‣  MIREX‣  ADC2004: 6 minutes (20 excerpts)‣  MIREX05: 12 minutes (25 excerpts)

‣  MIR1K: 2.2 hr (1000 excerpts), C-Pop‣  RWC-Pop: 6.8 hr (100 songs), J-Pop‣  MedleyDB (Bittner et al., 2014):

‣  7.5 hr (108 songs), varied‣  MedleyDBv2 (coming soon):

‣  ~17hr (~240 songs), varied

~50 annotator-hours108 songs

Page 52: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Datasets for Melody Extraction Eval

‣  MIREX‣  ADC2004: 6 minutes (20 excerpts)‣  MIREX05: 12 minutes (25 excerpts)

‣  MIR1K: 2.2 hr (1000 excerpts), C-Pop‣  RWC-Pop: 6.8 hr (100 songs), J-Pop‣  MedleyDB (Bittner et al., 2014):

‣  7.5 hr (108 songs), varied‣  MedleyDBv2 (coming soon):

‣  ~17hr (~240 songs), varied

~50 annotator-hours108 songs 20 million songs

Page 53: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Datasets for Melody Extraction Eval

‣  MIREX‣  ADC2004: 6 minutes (20 excerpts)‣  MIREX05: 12 minutes (25 excerpts)

‣  MIR1K: 2.2 hr (1000 excerpts), C-Pop‣  RWC-Pop: 6.8 hr (100 songs), J-Pop‣  MedleyDB (Bittner et al., 2014):

‣  7.5 hr (108 songs), varied‣  MedleyDBv2 (coming soon):

‣  ~17hr (~240 songs), varied

~50 annotator-hours108 songs

~1057 annotator-years20 million songs

Page 54: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Data Scarcity

Page 55: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Data ScarcityOvercoming

Page 56: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Crowdsourcing Melody Note Annotations

Page 57: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Crowdsourcing Melody Note Annotations

Ensemble (Tse et al., 2016)

Page 58: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Crowdsourcing Melody Note Annotations

Ensemble (Tse et al., 2016)

Page 59: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Data Augmentation: f0 Annotation-by-Synthesis

Page 60: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Data Augmentation: f0 Annotation-by-Synthesis

Page 61: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Data Augmentation: f0 Annotation-by-Synthesis

MonophonicPitch Tracker

Page 62: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Data Augmentation: f0 Annotation-by-Synthesis

Cleaning +Smoothing

MonophonicPitch Tracker

Page 63: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Data Augmentation: f0 Annotation-by-Synthesis

Cleaning +Smoothing

MonophonicPitch Tracker

SinusoidalModelling

Page 64: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Data Augmentation: f0 Annotation-by-Synthesis

Cleaning +Smoothing

MonophonicPitch Tracker

SinusoidalModelling

Page 65: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Data Augmentation: f0 Annotation-by-Synthesis

Cleaning +Smoothing

MonophonicPitch Tracker

SinusoidalModelling

Synthesis

Page 66: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Data Augmentation: f0 Annotation-by-Synthesis

Cleaning +Smoothing

MonophonicPitch Tracker

SinusoidalModelling

SynthesisMixing

Page 67: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Data Augmentation: f0 Annotation-by-Synthesis

Cleaning +Smoothing

MonophonicPitch Tracker

SinusoidalModelling

SynthesisMixing

Page 68: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Data Augmentation: f0 Annotation-by-Synthesis

Cleaning +Smoothing

MonophonicPitch Tracker

SinusoidalModelling

SynthesisMixing

Page 69: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Algorithm 1

Algorithm 2

Algorithm 3

Metrics: A B C D E

Original data + manual annotations

Synthesized data + automatic annotations

Data Augmentation: f0 Annotation-by-Synthesis

Page 70: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Summary 70/35

Page 71: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Summary

‣  Active music discovery: user plays active role in retrieval

71/35

Page 72: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Summary

‣  Active music discovery: user plays active role in retrieval‣  Examples: QBH, search-by-singing-style, computational

(ethno)musicology…

72/35

Page 73: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Summary

‣  Active music discovery: user plays active role in retrieval‣  Examples: QBH, search-by-singing-style, computational

(ethno)musicology…‣  Require (auto) extraction of pitch content: melody extraction

73/35

Page 74: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Summary

‣  Active music discovery: user plays active role in retrieval‣  Examples: QBH, search-by-singing-style, computational

(ethno)musicology…‣  Require (auto) extraction of pitch content: melody extraction

‣  Melody extraction: classification at contour level

74/35

Page 75: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Summary

‣  Active music discovery: user plays active role in retrieval‣  Examples: QBH, search-by-singing-style, computational

(ethno)musicology…‣  Require (auto) extraction of pitch content: melody extraction

‣  Melody extraction: classification at contour level‣  Data scarcity: can’t explore high-capacity (and data

hungry) models

75/35

Page 76: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Summary

‣  Active music discovery: user plays active role in retrieval‣  Examples: QBH, search-by-singing-style, computational

(ethno)musicology…‣  Require (auto) extraction of pitch content: melody extraction

‣  Melody extraction: classification at contour level‣  Data scarcity: can’t explore high-capacity (and data

hungry) models‣  Solutions:

76/35

Page 77: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Summary

‣  Active music discovery: user plays active role in retrieval‣  Examples: QBH, search-by-singing-style, computational

(ethno)musicology…‣  Require (auto) extraction of pitch content: melody extraction

‣  Melody extraction: classification at contour level‣  Data scarcity: can’t explore high-capacity (and data

hungry) models‣  Solutions:

‣  Crowdsourcing

77/35

Page 78: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Summary

‣  Active music discovery: user plays active role in retrieval‣  Examples: QBH, search-by-singing-style, computational

(ethno)musicology…‣  Require (auto) extraction of pitch content: melody extraction

‣  Melody extraction: classification at contour level‣  Data scarcity: can’t explore high-capacity (and data

hungry) models‣  Solutions:

‣  Crowdsourcing ‣  Data augmentation: annotation-by-synthesis

78/35

Page 79: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Summary

‣  Active music discovery: user plays active role in retrieval‣  Examples: QBH, search-by-singing-style, computational

(ethno)musicology…‣  Require (auto) extraction of pitch content: melody extraction

‣  Melody extraction: classification at contour level‣  Data scarcity: can’t explore high-capacity (and data

hungry) models‣  Solutions:

‣  Crowdsourcing ‣  Data augmentation: annotation-by-synthesis

‣  Thanks!

79/35

Page 80: Pitch Analysis for Active Music Discovery€¦ · Collaborators: Juan Pablo Bello, Rachel Bittner, Jordi Bonada, Juan J. Bosch, Jose Miguel Diaz-Bañez, Chris Cannam, Francisco Escobar,

Summary

‣  Active music discovery: user plays active role in retrieval‣  Examples: QBH, search-by-singing-style, computational

(ethno)musicology…‣  Require (auto) extraction of pitch content: melody extraction

‣  Melody extraction: classification at contour level‣  Data scarcity: can’t explore high-capacity (and data

hungry) models‣  Solutions:

‣  Crowdsourcing ‣  Data augmentation: annotation-by-synthesis

‣  Thanks!

80/35

@justin_salamon [email protected] www.justinsalamon.com