22
Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Embed Size (px)

Citation preview

Page 1: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Dimension Reductionusing

Rademacher Serieson

Dual BCH Codes

Nir AilonEdo Liberty

Page 2: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Dimension Reduction• Algorithmic technique:

– often as “black box”

• Removes redundancy from data• A sampling method• Measure concentration phenomenon• (Existential) metric embedding• Recently: computational aspects

– time– randomness

Page 3: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Challenge

Find random projection from Rd to Rk

(d big, k small)

such that for every x Rd, ||x||2=1, 0<with probability exp{-k

||x||2 = 1 ± O(

Rd Rk

x x

Page 4: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Usage

If you have n vectors x1..xn Rd:

set k=O(log n)

by union bound:

for all i,j ||xi- xj|| ≈||xi- xj||

low-distortion metric embedding

“tight”

Rd Rk

x1 x1

x2 x3x2

x3

Page 5: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Solution: Johnson-Lindenstrauss (JL)

“dense random matrix”

k

d

=

Page 6: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

So what’s the problem?

• running time (kd)

• number of random bits (kd)

• can we do better?

Page 7: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Fast JLA, Chazelle 2006 = Sparse . Hadamard . Diagonal

time = O(k3 + dlog d), randomness=O(k3 + d)beats JL (kd) bound for: log d < k < d1/3

k

d

Fourier

Page 8: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

This Talk

• O(d logk) for k < d1/2

• beats JL up to k < d1/2

• O(d) random bits

Page 9: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Algorithm (k=d1/2)A, Liberty 2007 = BCH . Diagonal . ....

k

d

Error CorrectingCode

Page 10: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Error Correcting Codes

k = d1/2

d

B =

columns closedunder addition

row set subset ofd x d Hadamard

0 k-1/2

1 -k-1/2Bx computablein time d logdgiven x

binary “dual-BCH code of designed distance 4”

4-wise independence

Page 11: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Error Correcting Codes

Fact (easily from properties of dual BCH):

||Bt||24 = O(1)

for yt Rk with ||y||2 = 1:

||yB||4 = O(1)

Rd Rk=dB

x x

D...

Page 12: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Rademacher Serieson Error Correcting Codes

look at r.v. BDx (Rk, l2)

BDx = DiixiBi Dii R{1} i=1...d

= DiiMi

Rd Rk=dB

x x

D...

Page 13: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Talagrand’s Concentration Boundfor Rademacher Series

Z = ||DiiMi||p (in our case p=2)

Pr[ |Z-EZ| > ] = O(exp{-2/4||M||2p2})

Page 14: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Rademacher Serieson Error Correcting Codes

look at r.v. BDx (Rk, l2)

Z = ||BDx||2 = ||DiiMi||2

||M||22||x||4||Bt||24 (Cauchy-Schwartz)

by ECC properties:

||M||22||x||4O(1)

trivial: EZ = ||x||2 = 1

Rd Rk=dB

x x

D...

Page 15: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Rademacher Serieson Error Correcting Codes

look at r.v. BDx (Rk, l2)

Z = ||BDx||2 = ||DiiMi||2||M||22=O(||x||4)

EZ = 1

Pr[ |Z-EZ| > ] = O(exp{-2/4||M||222})

Pr[|Z-1| > ] = O(exp{-2/||x||42})

Rd Rk=dB

x x

D...

how to get ||x||42 = O(k-1=d-1/2) ?

Challenge: w. prob. exp{-kdeviation of more than

Page 16: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Controlling ||x||42

Rd Rk=dB

x x

D...

how to get ||x||42 = O(k-1=d-1/2) ?

• if you think about it for a second...

• “random” x has ||x||42=O(d-1/2)

• but “random” x easy to reduce:just output first k dimensions

• are we asking for too much?• no: truly random x has strong bound on

||x||p for all p>2

Page 17: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Controlling ||x||42

Rd Rk=dB

x x

D...

how to get ||x||42 = O(k-1=d-1/2) ?

• can multiply x by orthogonal matrix• try matrix HD• Z = ||HDx||4

= ||DiixiHi||4

= ||DiiMi||4

• by Talagrand:Pr[ |Z-EZ| > t ] = O(exp{-t2/4||M||24

2})

EZ = O(d-1/4) (trivial)

||M||24||H||4/34||x||4(Cauchy Schwartz)

(HD used in [AC06] to control ||HDx||)

Page 18: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Controlling ||x||42

how to get ||x||42 = O(k-1=d-1/2) ?

Rd Rk=dB

x x

D...

Z = ||DiiMi||4 Mi = xiHi

Pr[ |Z-EZ| > t ] = O(exp{-t2/4||M||242}) (Talagrand)

EZ = O(d-1/4) (trivial)

||M||24||H||4/34||x||4 (Cauchy Schwartz)

||H||4/34 d-1/4 (Hausdorff-Young)

||M||24d-1/4||x||4

Pr[ ||HDx||4 > d-1/4+ t ] = exp{-t2/d-1/2||x||42}

Page 19: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Controlling ||x||42

how to get ||x||42 = O(d-1/2) ?

Rd Rk=dB

x x

D...

Pr[ ||HDx||4 > d-1/4+ t ] = exp{-t2/d-1/2||x||42}

need some slack k=d-1/2-

max error probability for challenge: exp{-k}

k = t2/d-1/2||x||42

t = k1/2d1/4||x||4 = ||x||4d-/2

Page 20: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Controlling ||x||42

how to get ||x||42 = O(d-1/2) ?

Rd Rk=dB

x x

D...

first round:

||HDx||4 < d-1/4 + ||x||4 d-/2

second round:

||HD’’HD’x||4 < d-1/4 + d-1/4-/2 + ||x||4 d-

r=O(1/)’th round: ||HD(r)...HD’x||4 < O(r)d-1/4

... with probability 1-O(exp{-k})

Page 21: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Algorithm for k=d-1/2-

Rd RkB

x x

D...

HD(1) HD(r)

running time O(d logd)randomness O(d)

Page 22: Dimension Reduction using Rademacher Series on Dual BCH Codes Nir Ailon Edo Liberty

Open Problems

• Go beyond k=d1/2 Conjecture: can do O(d log d) for k=d1-

• Approximate linear l2-regression

minimize ||Ax-b||2 given A,b (overdetermined)

– State of the art for general inputs: Õ(linear time) for #{variables} < #{equations}1/3

– Conjecture: can do Õ(linear time) always