Upload
suurist
View
224
Download
0
Embed Size (px)
Citation preview
m
m ( - )
Hiroaki Shiokawa h
h
Research Interestsh c c
h dDB w d
2011/4�2015/10 NTT2015/11�
(
lh chh
R[ N N N N
ÕêÒ
ÀÎÉ
)
l h– m k j– k j
h
l h h eFNK– m k jGOL m– k j
h
+
h
l h hh
–•• KQOBKXU•
C C
• GOL C C–
• m– j
• G––
3 4
u r
h
h
,
hl r w
-
h
B m k2×10%
GOL m m jGOL
D S]]O[k2×10&6KMOL Uk5×10&
7 QVOk10(j
m 10(
h ( / hs v w vg
l C 6 h h – ( , g7[K R 4K]Kg– fA O[b [ MO SXQg
l C5j:A5 k– ( g7[K R 1VQ [S]RW g
.
:A5 uzl ,
– D + w x–– ( , , m
/
l h–
h xl v j k
– x m• C C I '-1�$0�!*������J• m I�1�$0�!*������J
l
l y
l y
ü 9XM[OWOX]KV 1QQ[OQK]S X x
ü(R K Kb X NO x
ü
8 CRS UK K O] KV f6K ] 1VQ [S]RW P [ N VK[S]b LK ON7[K R 3V ]O[SXQ g 9X [ M 1119 ( )
8 CRS UK K O] KV fC31 0 5PPSMSOX] 1VQ [S]RW P [ 6SXNSXQ3V ]O[ 8 L KXN ]VSO[ X K[QO MKVO 7[K R g F 42 ( +
l w e ws )
– y hy
– )(+,) x–
• x• x
RW L] ?X URaNM 5] ICRS KXN KVSU D 1 9e J
���
���
(
XM]U Rl ����������
– 5 : O WKX KXN 7S[ KX f6SXNSXQ KXN O KV K]SXQ M WW XS]b][ M] [O SX XO] [U g �'4/���$2��� ,/ (, ) (
)
. =0 12223 − 52
236�
2∈9: x
:
¬»¦ |��
XM]U R
b 1 * ) ( ( ( ( 1(
1
b 1 - ) * ( 1,
2
4 w r r w v
1 2
rl
– m hx
+
i
(∑ %
<%<=> = 127
• h w tw w r
)∑ ∑ %
<%@<A=>
%@<A
%<=> = 1932
m
m
R W ?N W I�(.2!, $0�!*������Jy
XM]U R
,
l v XM]U Rl XM]U R w y
– SX Ka [WKVScON M ] x– D(EF) x
R W ?N W H R W N U ) +I
������������1 ,
-
m
m
m
R W ?N W I�(.2!, $0�!*������J
?N W I�$3+!,�$0�!*������J
5? I�*!1/$0 $0�!*������J
y
N VK[S]b
m m O WKX
XM]U R
.
l o h XM]U RXM]U R P RW ∆H
l ∆H hl ) +I �)(J log+)
?N W 5?
∆.2A = 2{2312A − 525A}N VK[S]b QKSX
. =0 12223 − 52
236�
2∈9
N VK[S]b
/
m
m
m
m
R W ?N W I�(.2!, $0�!*������J
?N W I�$3+!,�$0�!*������J
5? I�*!1/$0 $0�!*������J
y
N VK[S]b
m m O WKX
=X] RW I�*-,#$* $0�!*�������Jm
v y r
XM]U R
(
l XM]U R wm k m m
• h h m N VK[S]bʼ m
( m k m• h
l D(Q) “ x
=X] RW
)
((
(
,
m ( m
(
(
m
m
m
m
R W ?N W I�(.2!, $0�!*������J
?N W I�$3+!,�$0�!*������J
5? I�*!1/$0 $0�!*������J
y
N VK[S]b
m m O WKX
=X] RW I�*-,#$* $0�!*�������Jm
v y r
XM]U R
om
WL N NW U 3PP NP RXWo( m
=X] RW (((
l h w– AX N =–
WL N NW U 3PP NP RXW I�'(-)!3!�$0�!*��� �5��J
AX N =m x ʼ m m h
ʼ mx
rx
()
l WL N NW U 3PP NP RXW –– AX N =
WL N NW U 3PP NP RXW I�'(-)!3!�$0�!*��� �5��J
(
tl w h
– m hʻ h m
(
(
mk (k
(+
AX N U tl r h
– m m x mʻ
A
5 6
B
p h 3 4w
h 3v
h 4v)
(
r h 4sw w r
A
5 6
(
( m 1j2 hw
(,
hl h
– C]KXP [N O] [U 1XKVb S [ TOM] ” KL [K] [b P GOL1VQ [S]RW + m
l– SX a ( , . O[ O[– 9X]OV HO X 3 E +, ( (-78c KXN 72 B1– C]K]O P ]RO K[] 27 h3
Dataset |V| |E| Skewnessofdegreedistribution
dblp 326,186 1,615,400 2.82
live 5,363,260 79,023,142 2.29
uk-2005 39,459,925 936,364,282 1.71
webbase 118,142,155 1,019,903,190 2.14
uk-2007 105,896,555 3,738,733,648 1.51
(-
l 4 == -– [ ON SWS]ON (– x x
(f
(.
) ( m
m+,
l4 ==
)f
(/
MKU UR N ]T ) , NKK [N ]T ) .
WL N NW UPP NP RXW / - /. /. /-
4 == .. - /- /, /-5? .(
lBN[XU] RXW =R R H8XU ]W X ) /I
– xʼ
– xh7[ XN ][ ]R I�1�������J
y
)
. =0 12223 − ∑ 12A�A∈9
236
�
2∈9
hʼ x
)
l
)(
l y
l y
ü 9XM[OWOX]KV 1QQ[OQK]S X x
ü(R K Kb X NO x
ü
8 CRS UK K O] KV f6K ] 1VQ [S]RW P [ N VK[S]b LK ON7[K R 3V ]O[SXQ g 9X [ M 1119 ( )
8 CRS UK K O] KV fC31 0 5PPSMSOX] 1VQ [S]RW P [ 6SXNSXQ3V ]O[ 8 L KXN ]VSO[ X K[QO MKVO 7[K R g F 42 ( +
y 0C53? IH ������J
l r h c
– mC][ M] [KV SWSVK[S]b
m
9
7
8 6
3
4
0
5
2
1
10
1112
13 mm
))
C ]L ] U [R RU R
l h
6
3
0
2
1
4 5
“ h Γ S = {T ∶ S, T ∈ W} ∪ {S}Y S, T = |Γ S ∩ Γ(T)|Γ S |Γ(T)|�
y c
m + 1
ʻʻ
)
1 y h hn h h
6
3
0
2
1
4 56
3
0
2
1
4 5
y h
6
3
0
2
1
4 5
h
l y h h m LX Nl h h m KX MN
)+
LX N KX MNl LX N 0 h \ ]
l y h h m LX Nl y h h m LX N
\]x
x ^ z m x
),
k\ = _. a, ] = I
6
3
0
5
2
1
4-.
6
0
5
2
1
4
3-.
6
0
2
1
4
3
5
)-
LX N KX MNl LX N 0 h \ ]
l KX MN– M [O ^ m
l y h h m LX Nl y h h m LX N
\]x
x ^ z m x
l C53? c h \, ] t vLX N KX MN z
).
C53? IH 44( -J r
l 0 w r
l j kx
•( m x
• ^ h m
C53?
C53? w v r C53? X]WM ]r
)/
C53? “ r
C53? IH 44 ( -J
h h
CLX :RW 5U][I2 []XO[ 9345 ( J
PCTNUN XW5U]I8 KXQ D 45 ( )J
m
m
=RWTC53?I SW 9345 ( J222
m ʼʻ
C53?
C53? IH 44 ( -J
h h
CLX :RW 5U][I2 []XO[ 9345 ( J
PCTNUN XW5U]I8 KXQ D 45 ( )J
m
m
=RWTC53?I SW 9345 ( J
C53?ICRS UK K F 42 ( +J
l C53? c v r– vC31
l
(m
m
) h hh M [O
l
u h
(
l h
∃c ∈ de ∩ df ∧ c:hijk1 ↔ de ∪ df ⊆ ne
S T
de df
de ∩ df
S T de ∪ df
neop ∩ oqLX N
w
( m S, T m MKV MV ]O[ de, dfhC31 m S ne h i
)
C53?l D X [N
– ( m m
Phase 1: Local clustering
9
7
8 6
3
0
5
2
14
10
1112
Phase 2: Cluster refinement9
7
8 6
3
0
5
2
14
10
1112
9
7
8 6
3
0
5
2
14
10
1112
Local cluster
D1B S ]MKV MV ]O[
�$++!�� MKV MV ]O[m
A [N ( 0 =XL U LU][ N RWPl D3B r) h
– m V MKV MV ]O[
\ ≥ _. a, ] ≥ I
6
3
0
5
2
1
4-.6
3
0
5
2
1
4-.
D1B PX NO
6
3
0
5
2
1
4-.
MKV MV ]O[P X NO
l �$%(,(0(-,��k( R K Kb [OKMRKLVO � � m � �
X NO S 79(R ?:8<6;>=Y S, c ≥ ^ X NO c m
m S m T
m c MKV MV ]O[P X NOK RMPN
+
A [N ) 0 5U][ N NORWN NWl ������� r=XL U LU][ N h
• MKV MV ]O[SXQ L[SNQO M [O– 2[SNQO M [O V MKV MV ]O[ m
• L[SNQO M [O ][ M] [KV SWSVK[S]b
\ ≥ _. a, ] ≥ I
6
3
0
5
2
1
4
MKVMV ]O[ MKV
MV ]O[ (
X NO + M [O hMKV MV ]O[ (
6 4
3
0
2
5
1
K RMPN,
l CR RU R [ RWP– D1B
l3 L N R r– ] RK O MV ]O[SXQ K BON MO m m
6
3
0
5
2
14
7
ks(,, t) CR RU R [ RWPΓ 3 nΓ(0)m
Y(3,4) = |v , ∩ v(t)|Γ 3 |Γ(4)| 6
3
0
5
2
14
7
-
C53?l
– C31 D( 6@w6xyw |W|)• i hz
– ʼ
l– C31 C31
•
.
(l
– SX a ( , . O[ O[– 9X]OV HO X 3 E +, ( (-78c– 72 B1
l h
/
)l
• C53?– CSWSVK[S]b RK[SXQ h
• C53? I�1�$0�!*�����5��J–
• C53? I�(+�$0�!*������5�J– x C31
• PCTNUN XW5U] I�1,�$0�!*������5�� �1!,&�$0�!*������5��J– x m ^ C31
+
(f
l C53? C53? )– x m x– QCUOVO] X3V C31
+
)fl C53? LX N 8
– C31 M [O x
+(
*fl h
– ( m ho ,
h 1 ( c
h 1 - p
"+)
*fl h r
– x m– x l(
m cavemanmodel) LKVKXMON ][OO
+
l
++
l y
l y
ü 9XM[OWOX]KV 1QQ[OQK]S X x
ü(R K Kb X NO x
ü
8 CRS UK K O] KV f6K ] 1VQ [S]RW P [ N VK[S]b LK ON7[K R 3V ]O[SXQ g 9X [ M 1119 ( )
8 CRS UK K O] KV fC31 0 5PPSMSOX] 1VQ [S]RW P [ 6SXNSXQ3V ]O[ 8 L KXN ]VSO[ X K[QO MKVO 7[K R g F 42 ( +
z j( )kl AE r
– CSXQVO 7 E• I�1�$0�!*�������J
j xj 7 E
– V]S 7 E• I�-+!, $0�!*������J
KLOV [ KQK]S XKLOV [ KQK]S X 7 E
l uz– I�(").!+!!.!"'"'( $0�!*������J
I2KO KXN 2SVV C3d +Jm m 7[K R KL x
+,
A 7D C
4 == ( 4 == ( 4 == Wi
4 ==
z j) )kl ]U R 5AE r
– BKLLS] [NO[ I .!(�$0�!*��������5�J9XM[OWOX]KV 1QQ[OQK]S X I�'(-)!3!�$0�!*�� �5��J
31C
l 5 3 r– 8(.
s , t
+-
0
10
20
30
40
50
0
1
2
3
4
5
6
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
Readßàä4&
[GB/s]
�y�X
�y�X
�Ž"oRsßàä4&
.�G,ßàä4&
W NU GNXW A R
– 9X]OV x m• 3 XSQR] 3 [XO[ XSQR] VKXNSXQ (• -( • + ( C9 4• x
+.
KnightsCorner Knightslanding
w
+/
• SCAN®� º���ÛÑåÔÎÕ ���%�ÃãÊÍL��Y• ÄãدN@�3¯�¹®·º���X¯��
IntelXeonPhi®·º�y�®°�X^���«512×ÎѯSIMDUc¯Q[��m
• XeonPhi²¯G}�CPU ÃçÎÃ@ Ž@/ÊæÎÒ@ SIMDæÉÊÍ
XeonE5 1620 3.5GHz 4/8 256×ÎÑ
XeonPhi7250 1.4GHz 68/272 512×ÎÑ
webbase2001N@�3
�����fC53? X N GNXW A R
,
?K=P
ßÓêŽÚçÌÎÇXeonPhi®·ºÊæÎÒ���«SIMDUc®·¹,SCAN¼�y�
Mz^�6oc���®Wbª�º¥µëXeonPhi¼[�¥����H�ª�º
,
• coreL�¯ÊæÎÒ�±ÐêÍ���• Mz^�6oc¯ÊæÎÒ���• SIMD®·º�xÕêÒL��Y¯ÐêÍ���
• ÃãÊÍL�¯ÊæÎÒ���• Union-FindI½åÆäËÞª¯ÃãÊÍL�• CAS$�¼�§¥ÊæÎÒ���
• ÖÙé*»�L�¯ÊæÎÒ���• iD®���� h• pe°_\
SCAN¯���ÛÑåÔÎünR¢ë!�Y¼�X^®���
,(
to~�¯>g¡»©�ºÕêÒ®101ª0;
3
0
2
1
0 2 5 7 8
1 2 0 2 3 0 1 1���1������
ptr
to
0 0 1 1 1 2 2 3from
CRS (Compressed Row Storage)0 1 2 3
CRS87°a��X¶ÂáÎÈâ�X®�»º
• CRS°ÀÎÉ�ª¯����%�• ptr²¯vܾèÍfrom¼[�©ÀÎÉ�ª���
• XeonPhi¯ßàä½ÃÌÊæ¾Ïèȼ�£¥µÂáÎÈâÝʼ�S¢¥�
h h ( )
l h h [X N PN SXRW ch
l C 6 [RWPUN RW[ ]L RXW ]U R UN M
,)
ÕêÒ�¯�>ÕêÒ~�
ÕêÒ�¯�>ÕêÒ~�
3 4 10 13 20 … end
2 3 11 13 43 … end
3 3 4 4æÉÊÍ �
2 3 2 3æÉÊÍ �
Equals�j@¼Á¿èÑ
compare (4>3)
3 4 10 13 20 … end
2 3 11 13 43 … end
ÕêÒ�¯�>ÕêÒ~�
ÕêÒ�¯�>ÕêÒ~�
advance pointer
G9¯md¼Ou¢ë��C¯Ü¾èͼ{µº
,
h h ) )
l h– ʼ h ʼ
• çêÒ@¯)F• Ou£º~�¯Ç¾ËO®·¹ëçêÒ£º@¼�¸¤º
ǾËO#� ]º
çêÒ@ ]º#�
ÕêÒw¯~��v®O³©�5®,��'"ÕêÒ�¯�>ÕêÒ~� ÕêÒ�¯�>ÕêÒ~�
3 4 10 13 20 … end 2 3 11 13 43 … end
3 3 3 3æÉÊÍ � 2 3 11 13æÉÊÍ �
l C53? LX Nv ww
,+
6
3
4 5
0
2
1
Union-FindI¼[�¥���/E½åÆäËÞ¼?K
2
10
3 5
4 6
Union-FindI
• f"�Y°CompareandSwap$�¼[�©ÊæÎÒ�¯A"<��¥»ºì
core«�>ÕêÒ¯´�ÃãÊÍ«º��-core
non-core
l C53? LX Nv ww
,,
6
3
4 5
0
2
1
Union-FindI¼[�¥���/E½åÆäËÞ¼?K
6
3
4 5
0
2
1
2
10
3 5
4 6
Union-FindI
• f"�Y°CompareandSwap$�¼[�©ÊæÎÒ�¯A"<��¥»ºì
core«�>ÕêÒ¯´�ÃãÊÍ«º��-
core0«Mz^�ÕêÒ°1,4,5
core
non-core
2
1
0
3 54
6
Union-FindI
l C53? LX Nv ww
,-
6
3
4 5
0
2
1
Union-FindI¼[�¥���/E½åÆäËÞ¼?K
6
3
4 5
0
2
1
6
3
4 5
0
2
1
2
10
3 5
4 6
Union-FindI
• f"�Y°CompareandSwap$�¼[�©ÊæÎÒ�¯A"<��¥»ºì
core«�>ÕêÒ¯´�ÃãÊÍ«º��-
core0«Mz^�ÕêÒ°1,4,5
5��l¢¥¯ªë2¨¯ÃãÊÍ(I)¼
f"
core
non-core
2
1
0
3 54
6
Union-FindI
2
1
0
3 54
6
Union-FindI
CAS
,.
• T-¢¥½åÆäËÞ• SCANí:J=P CPU¯´ªwN.k• SCAN-XPí?K=P CPU,KNC,KNLª��.k
• .�®�[¢¥ÐêÍÌÎÑ- - ÕêÒ@ ÀÎÉ@0 - 1,134,890 2,987,624
web-BerkStan 685,230 6,649,470soc-Pokec 1,632,803 22,301,964
com-LiveJournal 3,997,962 34,681,189soc-LiveJournal1 4,846,609 42,851,237
- 3,072,441 117,185,083115,554,441 854,809,761
• .�Z( �pe°qB¼�V• CPU íXeonE51620(3.5GHz,4Ž,8ÊæÎÒ)
Memory16GB• KNC íXeonPhi3120A(1.10GHz,57Ž,228ÊæÎÒ)
Memory6GB• KNL íXeonPhi7250(1.4GHz,68Ž,272ÊæÎÒ)
Memory16GB(MCDRAM)+96GB(DDR4)
,/
�r¢©ë100���¯�y�36`ª�Y!
better
ÀÎÉ@2 +
N/A�
�ßàä�t®·¹.k�
Exec
utio
n tim
e (m
s, lo
garit
hmic
)
5AE ?5 ?=
-
com-youtube
1
10
100
1000
10000
100000
Executiontim
e(m
s)
com-LiveJournal1
10
100
1000
10000
100000
Executiontim
e(m
s)
com-Orkut1
10
100
1000
10000
100000
Executiontim
e(m
s)
soc-LiveJournal1
web-BerkStan1
10
100
1000
10000
100000
Executiontim
e(m
s)
1
10
100
1000
10000
100000
Executiontim
e(m
s)
1
10
100
1000
10000
100000
Executiontim
e(m
s)
soc-Pokecbe
tter
bette
r
bette
r
bette
r
bette
r
bette
r
Exec
utio
n tim
e (m
s, lo
garit
hmic
)
Exec
utio
n tim
e (m
s, lo
garit
hmic
)
Exec
utio
n tim
e (m
s, lo
garit
hmic
)
Exec
utio
n tim
e (m
s, lo
garit
hmic
)
Exec
utio
n tim
e (m
s, lo
garit
hmic
)
Exec
utio
n tim
e (m
s, lo
garit
hmic
)
CPU KNC KNL
SCAN-XPSCAN
SCAN(Xeon,1) SCAN-XP(Xeon,1) SCAN-XP(Xeon,4) SCAN-XP(Xeon,8) SCAN-XP(KNC,57) SCAN-XP(KNC,114)SCAN-XP(KNC,171) SCAN-XP(KNC,228) SCAN-XP(KNL,68) SCAN-XP(KNL,136) SCAN-XP(KNL,204) SCAN-XP(KNL,272)
• m m–– BOKV [VN [ O[]b z ʻ
-
• m– HO X RS 7 E O]M
• m– h h h h hp
l– h
l W N N[ RWP NW A XKUN [