Introduction to Deep Compression

ディープラーニングの最新動向圧縮技術編①　Deep Compression

2017/1/5 株式会社ウェブファーマー

大政　孝充

今回取り上げるのはこれ

[1] S. Han, et al.”Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding.”ICLR 2016 - Best Paper Award. Deep Neural Networkのパラメータ容量を３段階の圧縮技術を用いて大幅に縮減した！

解説のポイント

①取り組んだ課題とは ②手法の説明　１）Pruning 　２）Quantization 　３）Haffman Encoding ③結果どうなった？

Deep Learningはパラメータ容量が巨大なのでこんなに不便

●問題１：巨大過ぎてAppStoreなどで扱えない

コンピュータで学習

数百MB

Deep Learningはパラメータ容量が巨大なのでこんなに不便

●問題２：モバイルのバッテリーを大量消費する

すぐに電池がなくなる

数百MBそもそもメモリ上に展開できない

よってモバイル向けに容量を減らしたい！

コンピュータで学習

数百MB

数十MB？数MB？

手法全体の流れ

第１段階 Pruning

第２段階Quantization

第３段階Huffman Encoding

圧縮第１段階：Pruning（切り詰める）

普通に学習した後、閾値を超えないような小さな値の重みを削減する。

・・

0.0008

・・

これで１０分の１とかにする

圧縮第２段階：Quantization（量子化）

まず、似た重み同士でまとめる

この例の場合、１６個の重みを４種類にまとめる

まとめ方は、k-means法を用いる argminC

w−Ci2

w∈Ci

∑i=1

４種類のクラスタにまとまった。

求まった重みを初期値としてFine-Tuningする back propagationはクラスタごとの勾配を求める

∂L∂Ck

=∂L∂Wij

∂Wij

∂Cki, j∑ =

∂L∂Wij

Ι Iij = k( )i, j∑ ：indicator functionΙ ⋅( )

第２段階全体図

圧縮第３段階：Haffman Encoding（ハフマン符号化）

Haffman Encodingを用いてさらに容量を減らす

重みの値１ 1.0 00

重みの値２ 2.0 01

重みの値３ 3.0 10

例）

1.0 00

3.0 10

2.0 01

14 bits

よく出現する数字には短い bit をあて、余り出現しない数字には長い bit をあてる

1.0 10

3.0 110

11 bits

いろいろなモデルの圧縮率

LeNetで４０分の１、AlexNetで３５分の１、VGGNetではなんと、49分の１！

いろいろなモデルのスピード増加率

どのモデルにおいても、のきなみ早くなっている

いろいろなモデルの電力消費削減率

どのモデルにおいても、のきなみ電力が削減されている

結論

l  LeNet、AlexNet、VGGNetいずれにおいてもパラメータの容量を大幅に削減できた

l  結果、速度の大幅な上昇、および電力消費の大幅な削減が達成できた

l  このしくみにより、Deep Learningがモバイルでも使用できそうだ！

おわり

Introduction to Deep Compression

Technology

Analysis on Local Compression Bearing Capacity of Deep

High-quality Single-model Deep Video Compression with ... · Inspired by recent advance in DL-based image and video compression works, we propose a fully end-to-end deep video compression

D COMPRESSION: COMPRESSING DEEP NEURAL · PDF filePublished as a conference paper at ICLR 2016 DEEP COMPRESSION: COMPRESSING DEEP NEURAL NETWORKS WITH PRUNING, TRAINED QUANTIZATION

D COMPRESSION: COMPRESSING DEEP NEURAL ETWORKS … · 2016-02-16 · Published as a conference paper at ICLR 2016 DEEP COMPRESSION: COMPRESSING DEEP NEURAL NETWORKS WITH PRUNING,

Attention-Guided Image Compression by Deep Reconstruction

Basic Introduction to Compression Hosiery · Basic Introduction to Compression Hosiery Melissa Oleck, ... * Compression stockings beneficial ... shiny • There may not be

Data compression introduction

Enhanced Bayesian Compression via Deep Reinforcement Learningopenaccess.thecvf.com/content_CVPR_2019/papers/... · compression method. We map the problem of Bayesian compression problem

Deep Compression: Compressing Deep Neural Networks with …forum.stanford.edu/events/posterslides/DeepCompression... · 2016-03-01 · Deep Compression: Compressing Deep Neural Networks

Natural Compression for Distributed Deep Learningsimple and theoretically and practically effective compression technique, which we call natural compression (C nat). Our technique

Introduction to Compression

Introduction to Data Compression

deep compression Stanford · Deep Compression and EIE: ——Deep Neural Network Model Compression and Efficient Inference Engine Song Han CVA group, Stanford University

Elastic compression stockings for prevention of deep · PDF file24.01.2000 · Elastic compression stockings for prevention of deep vein thrombosis (Review) Amaragiri SV, Lees TA This

Compression of Deep Convolutional Neural Networks under ... · arXiv:1805.08303v2 [cs.CV] 29 Oct 2018 Compression of Deep Convolutional Neural Networks under Joint Sparsity Constraints

Introduction to video compression

Deep Image Compression with Iterative Non- uniform ... · October 9, 2018 International Conference on Image Processing ICIP-2018 4 Introduction : image compression Image compression

IMAGE COMPRESSION USING DEEP AUTOENCODER - · PDF fileIMAGE COMPRESSION USING DEEP AUTOENCODER A PROJECT REPORT ... 2.1.1 Image compression using Huffman coding ... 2.1.3 Representing

DVC: An End-To-End Deep Video Compression Frameworkopenaccess.thecvf.com/content_CVPR_2019/papers/Lu_DVC_An_End-To-End... · DVC: An End-to-end Deep Video Compression Framework Guo

Deep frame interpolation for video compression