Upload
others
View
3
Download
0
Embed Size (px)
Citation preview
IML TutorialReview HW5
Jakob Jakob 1
1ETH Zurich
1
K-means
Given: n points xi ∈ Rd, i ∈ 1, ..., nGoal: Find the k clusters µ = (µ1, ..., µk)Minimize
L(µ) =n∑
i=1
minj∈1,...,k
‖xi − µj‖22
Algorithm : Assignment and refitting step
zi ∈ arg minj∈{1,...,k}
‖xi − µj‖22 µj =1
|{i : zi = j}|∑i:zi=j
xi
2
K-means — visualizationGiven data
Example k = 2, d = 2
3
K-means — visualizationInitialization of µ1 and µ2
Example k = 2, d = 2
4
K-means — visualizationAssignment step
Example k = 2, d = 2
5
K-means — visualizationAssignment step
Example k = 2, d = 2
6
K-means — visualizationAssignment step
Example k = 2, d = 2
7
K-means — visualizationAssignment step
Example k = 2, d = 2
8
K-means — visualizationAssignment step
Example k = 2, d = 2
9
K-means — visualizationAssignment step
Example k = 2, d = 2
10
K-means — visualizationAssignment step
Example k = 2, d = 2
11
K-means — visualizationAssignment step
Example k = 2, d = 2
12
K-means — visualizationAssignment step
Example k = 2, d = 2
13
K-means — visualizationRefitting step
Example k = 2, d = 2
14
K-means — visualizationRefitting step
Example k = 2, d = 2
15
K-means — visualizationRefitting step
Example k = 2, d = 2
16
K-means — visualizationRefitting step
Example k = 2, d = 2
17
ReviewQ6
Explanation of Q6 (switch to solutions)
18
L2 Loss
L(µ) =n∑
i=1
minj∈1,...,k
‖xi − µj‖22
L(µ) =n∑
i=1
‖xi − µzi‖22 zi = arg minj∈1,...,k
‖xi − µj‖22
µ = arg minµ
n∑i=1
‖xi − µzi‖22
µj = arg minµj
∑i:zi=j
‖xi − µj‖22
19
L2 Loss
µj = arg minµj
∑i:zi=j
‖xi − µj‖22
∂L
∂µj=
∑i:zi=j
−2(xi − µj) = −2∑i:zi=j
(xi − µj) = 0
=∑i:zi=j
(xi − µj) =∑i:zi=j
xi − |{i : zi = j}|µj = 0
=⇒ µj =1
|{i : zi = j}|∑i:zi=j
xi
20
L1 Loss
L(µ) =n∑
i=1
minj∈1,...,k
‖xi − µj‖1
L(µ) =n∑
i=1
‖xi − µzi‖1 zi = arg minj∈1,...,k
‖xi − µj‖1
µ = arg minµ
n∑i=1
‖xi − µzi‖1
µj = arg minµj
∑i:zi=j
‖xi − µj‖1 = arg minµj
∑i:zi=j
d∑q=1
|xi,q − µj,q|
21
L1 Loss
µj = arg minµj
∑i:zi=j
d∑q=1
|xi,q − µj,q|
µj,q = arg minµj,q
∑i:zi=j
|xi,q − µj,q|
L(µj,q) =∑i:zi=j
|xi,q − µj,q|
=∑
i:zi=j, xi,q≤µj,q
|xi,q − µj,q|+∑
i:zi=j, xi,q>µj,q
|xi,q − µj,q|
22
L1 Loss
L(µj,q) =∑
i:zi=j, xi,q≤µj,q
|xi,q − µj,q|+∑
i:zi=j, xi,q>µj,q
|xi,q − µj,q|
=∑
i:zi=j, xi,q≤µj,q
(µj,q − xi,q) +∑
i:zi=j, xi,q>µj,q
(xi,q − µj,q)
∂L
∂µj= |{i : zi = j, xi,q ≤ µj,q}| − |{i : zi = j, xi,q > µj,q}| = 0
=⇒ µj,q = median (xi,q, i : zi = j)
23
Mean vs Median — visualization
Example with an outlier k = 1, d = 1
24
Mean vs Median — visualization
Example with an outlier k = 1, d = 1, median in blue and mean in red
25
END OF REVIEW OF HW5
26