FusionNet: 3D Object Classification Using Multiple Data

Vishakh HegdeReza Zadeh

FusionNet: 3D Object Classification Using Multiple Data Representations

Paper: http://matroid.com/papers/fusionnet.pdf Twitter: @Reza_Zadeh

Object recognitionGiven 3D model, figure out what it is

» bathtub

@Reza_Zadeh

Princeton ModelNet662 object classes, 127,915 CAD models

ModelNet40: 40 class subset

http://modelnet.cs.princeton.edu

@Reza_Zadeh

Princeton ModelNetProblem of input representation

Try using image recognition on projections, but that only goes so far.

@Reza_Zadeh

From Image Recognition to Object Recognition

@Reza_Zadeh

Convolutional Network

Slide a two-dimensional patch over pixels.

How to adapt to three dimensions?

Figure: Google image search for “convolutional neural network”

Multi-View CNN

Rotate camera around object

Figure: H. Su, S. Maji, E. Kalogerakis, E. Learned-Miller. Multi-view Convolutional Neural Networks for 3D Shape Recognition. ICCV2015

Representations

@Reza_Zadeh

Volumetric (V-CNN)

Simple idea: slide a three-dimensional volume over voxels.

@Reza_Zadeh

Volumetric CNNsUse two different Volumetric CNNs (VCNN-I and VCNN-II). Example of one:

@Reza_Zadeh

FusionNetFusion of two volumetric representation CNNs and one pixel representation CNN

Hyper-parameters tuned on a

cluster

http://matroid.com/papers/fusionnet.pdf

Machine Learning Pipeline

Learning Algorithm

TrainedModel

Replicatemodel

Serve Model

Repeat entire pipeline

@Reza_Zadeh

Deeper Dive into Networks

@Reza_Zadeh

Multi-View CNNView positions: Corners of icosahedron (20 faces)

Base network: AlexNet (# parameters ~ 60M)

Pre-training on ImageNet, fine-tune last three layers.

VCNN-I

Long kernels learn features spanning the size of the 3D model

Data Augmentation: Gaussian noise added to vertex coordinates in CAD model

Better than VCNN II on: Table, Plant, Bench

VCNN-II

GoogLeNet inspired inception modulesKernel sizes: 1x1x30, 3x3x30, 5x5x30

Hope: Learn features at multiple scales

Better than VCNN I: Radio, Wardrobe, Xbox

Results

@Reza_Zadeh

Results

FusionNet

At the time of submission (July 17th 2016)

ModelNet now

Recent (December 5th 2016)

Conclusions3D convolutions on different kernel sizes help

Combination MVCNN + VCNN helps

Hyper-parameter tuning helps

DEEM workshopHeld in conjunction with SIGMOD/PODS

May 14th, 2017 – Submissions open!

Thank you!FusionNet paper

http://matroid.com/papers/fusionnet.pdf

@Reza_Zadeh

FusionNet: 3D Object Classification Using Multiple Data

Documents

LNICST 79 - Automatic Object Classification and Image

Object detection and object classification using machine

FMS Budget Object Classification Codes (BOC) · 0150 Investment Interest AMS Trust ... Introduction describes how budget object classification codes are used, ... Budget Object Classification

Human Classification, Activity Recognition, Object ...cseweb.ucsd.edu/~mpatanka/docs/Research2.pdf · Human Classification, Activity Recognition, Object Detection and Human Object

Acceleration of Multi-Object Detection and Classification ...on-demand.gputechconf.com/gtc/...multi-object-detection-classification... · Acceleration of Multi-Object Detection and

Fusion Framework for Moving-Object Classification

Object-Based Classification & eCognitionlees.geo.msu.edu/courses/geo827/lecture_12a_ecognition.pdf · What is Object-Based Classification The object based image analysis approach

FusionNet: working smarter, not harder with SQuAD · FusionNet: working smarter, not harder with SQuAD Stanford CS224N Default Project Sebastian Hurubaru and François Chesnay Department

Object Localization, Segmentation, Classification, and Pose …€¦ · •Extend CNN model to multiclass object localization, segmentation, classification, and pose estimation in

Object Classification From Randomized EEG Trials

MOVING OBJECT DETECTION, TRACKING AND CLASSIFICATION …signal.ee.bilkent.edu.tr/Theses/Yigithan.pdf · MOVING OBJECT DETECTION, TRACKING AND CLASSIFICATION FOR SMART VIDEO SURVEILLANCE

Object Detection and Pixel Classification Algorithm

GENERALIZED HOUGH TRANSFORM FOR OBJECT CLASSIFICATION … · 2016-07-05 · Generalized Hough Transform for object ... GENERALIZED HOUGH TRANSFORM FOR OBJECT CLASSIFICATION IN THE

• Identifying Use Cases • Object Analysis • Classification •

Object-centric spatial pooling for image classification

MODULE:VII OBJECT ANALYSIS: CLASSIFICATION 1/11/20161Module VII Object Analysis:Classification

Maritime Object Detection, Tracking, and Classification

1 Framework of Object Detection and Classification …analysis framework is high. KEY WORDS: Cloud Computing, Video Stream Analytics, Object Detection, Object Classification, High

From image classification to object detection

FusionNet: 3D Object Classiﬁcation Using Multiple Data ... · Vishakh Hegde Stanford and Matroid vishakh@matroid.com Reza Zadeh Stanford and Matroid reza@matroid.com Abstract High-quality