23
Vishakh Hegde Reza Zadeh FusionNet: 3D Object Classification Using Multiple Data Representations Paper: http://matroid.com/papers/fusionnet.pdf Twitter: @Reza_Zadeh

FusionNet: 3D Object Classification Using Multiple Data

  • Upload
    others

  • View
    7

  • Download
    0

Embed Size (px)

Citation preview

Page 1: FusionNet: 3D Object Classification Using Multiple Data

Vishakh HegdeReza Zadeh

FusionNet: 3D Object Classification Using Multiple Data Representations

Paper: http://matroid.com/papers/fusionnet.pdf Twitter: @Reza_Zadeh

Page 2: FusionNet: 3D Object Classification Using Multiple Data

Object recognitionGiven 3D model, figure out what it is

» bathtub

@Reza_Zadeh

Page 3: FusionNet: 3D Object Classification Using Multiple Data

Princeton ModelNet662 object classes, 127,915 CAD models

ModelNet40: 40 class subset

http://modelnet.cs.princeton.edu

@Reza_Zadeh

Page 4: FusionNet: 3D Object Classification Using Multiple Data

Princeton ModelNetProblem of input representation

Try using image recognition on projections, but that only goes so far.

@Reza_Zadeh

Page 5: FusionNet: 3D Object Classification Using Multiple Data

From Image Recognition to Object Recognition

@Reza_Zadeh

Page 6: FusionNet: 3D Object Classification Using Multiple Data

Convolutional Network

Slide a two-dimensional patch over pixels.

How to adapt to three dimensions?

Figure: Google image search for “convolutional neural network”

Page 7: FusionNet: 3D Object Classification Using Multiple Data

Multi-View CNN

Rotate camera around object

Figure: H. Su, S. Maji, E. Kalogerakis, E. Learned-Miller. Multi-view Convolutional Neural Networks for 3D Shape Recognition. ICCV2015

Page 8: FusionNet: 3D Object Classification Using Multiple Data

Representations

@Reza_Zadeh

Page 9: FusionNet: 3D Object Classification Using Multiple Data

Volumetric (V-CNN)

Simple idea: slide a three-dimensional volume over voxels.

@Reza_Zadeh

Page 10: FusionNet: 3D Object Classification Using Multiple Data

Volumetric CNNsUse two different Volumetric CNNs (VCNN-I and VCNN-II). Example of one:

@Reza_Zadeh

Page 11: FusionNet: 3D Object Classification Using Multiple Data

FusionNetFusion of two volumetric representation CNNs and one pixel representation CNN

Hyper-parameters tuned on a

cluster

http://matroid.com/papers/fusionnet.pdf

Page 12: FusionNet: 3D Object Classification Using Multiple Data

Machine Learning Pipeline

Data

Learning Algorithm

TrainedModel

Replicatemodel

Serve Model

Repeat entire pipeline

@Reza_Zadeh

Page 13: FusionNet: 3D Object Classification Using Multiple Data

Deeper Dive into Networks

@Reza_Zadeh

Page 14: FusionNet: 3D Object Classification Using Multiple Data

Multi-View CNNView positions: Corners of icosahedron (20 faces)

Base network: AlexNet (# parameters ~ 60M)

Pre-training on ImageNet, fine-tune last three layers.

Page 15: FusionNet: 3D Object Classification Using Multiple Data

VCNN-I

Long kernels learn features spanning the size of the 3D model

Data Augmentation: Gaussian noise added to vertex coordinates in CAD model

Better than VCNN II on: Table, Plant, Bench

Page 16: FusionNet: 3D Object Classification Using Multiple Data

VCNN-II

GoogLeNet inspired inception modulesKernel sizes: 1x1x30, 3x3x30, 5x5x30

Hope: Learn features at multiple scales

Better than VCNN I: Radio, Wardrobe, Xbox

Page 17: FusionNet: 3D Object Classification Using Multiple Data

Results

@Reza_Zadeh

Page 18: FusionNet: 3D Object Classification Using Multiple Data

Results

Page 19: FusionNet: 3D Object Classification Using Multiple Data

FusionNet

At the time of submission (July 17th 2016)

Page 20: FusionNet: 3D Object Classification Using Multiple Data

ModelNet now

Recent (December 5th 2016)

Page 21: FusionNet: 3D Object Classification Using Multiple Data

Conclusions3D convolutions on different kernel sizes help

Combination MVCNN + VCNN helps

Hyper-parameter tuning helps

Page 22: FusionNet: 3D Object Classification Using Multiple Data

DEEM workshopHeld in conjunction with SIGMOD/PODS

May 14th, 2017 – Submissions open!

Page 23: FusionNet: 3D Object Classification Using Multiple Data

Thank you!FusionNet paper

http://matroid.com/papers/fusionnet.pdf

@Reza_Zadeh