Fully Convoluional Networks for Semantic...

Fully Convolutional Networks for Semantic Segmentation

Jonathan Long, Evan Shelhamer, Trevor Darrell

Presenter: Hannah Li

Fixed dimension and throws away spatial coordinatesConvolutional Networks

Fully Convolutional Networks

xi,j - location (i, j) for a particular layeryi,j - location (i, j) for the following layerk - kernel (filter) sizes - stridefks - determines the layer type:

- matrix multiplication for convolution or average pooling- spatial max for max pooling- elementwise nonlinearity for an activation function

To find the gradient descent on the whole image, you only need to find the gradient descent of the final layer

Output dimensions are reduced to keep filters small and

computational requirements reasonable

However, we need dense pixels…

Solution: Deconvolution/Upsampling

stride = 1 stride = 2

https://github.com/vdumoulin/conv_arithmetic

http://tutorial.caffe.berkeleyvision.org/caffe-cvpr15-pixels.pdf

Deconvolution

Bilinear interpolation

From classifier to dense FCN1. Discard the final layer2. Convert all fully connected layers to convolutions3. Append convolution with channel dimension 21 to

predict score for the PASCAL classes

FCN for segmentation Technique: combining final

prediction layer with lower layers with finer strides

Not detailed enough

Combining fine layers and coarse layers lets the model make local predictions that respect global

structure.

Results

20% improvement for mean IoU

286 times faster

*Simultaneous detection and segmentation. ECCV 2014

Conclusions

• “Fully convolutional” networks take inputs of any size

• Adapt AlexNet, VGG net, GoogLeNetinto fully convolutional networks

• Fine-tune them to the segmentation task

• New architecture: combines semantic info from coarse layer with info from shallow, fine layer

• 20% relative improvement to 62.2% mean IU (PASCAL VOC 2012)

Extras

Fully Convolutional Networks

xi,j - location (i, j) for a particular layeryi,j - location (i, j) for the following layerk - kernel (filter) sizes - stridefks - determines the layer type:

- matrix multiplication for convolution or average pooling- spatial max for max pooling- elementwise nonlinearity for an activation function

- Kernel size and stride for this functional form obeys the transformation rule above- Computes a nonlinear filter (as opposed to a general nonlinear function)

FCN naturally operates on an input of any size!

Loss Function

• Loss function is a sum over the spatial dimension of the final layer

• Gradient of loss function is a sum over the gradients of each of its spatial components

• Takes all of the final layer receptive fields as a minibatch

The receptive fields overlap significantly ->More efficient!

5 times faster than AlexNet

Spatial output maps are suitable dense problems like semantic

segmentation

Fully Convoluional Networks for Semantic...

Documents

Security Predictions

Precise minds in uncertain worlds - Semantic Scholar€¦ · chain of interdependent predictions to match on different levels, from complex features to low-level stimulus charac-teristics

Grammatical predictions reveal influences of semantic

On Human Predictions with Explanations and Predictions of

IKT Predictions

25 predictions

Detecting Fine-Grained Cross-Lingual Semantic Divergences … · 2020. 11. 11. · ity model, while token-level predictions have the potential of further distinguishing between coarse

2013 predictions

2008.40 183-190 doi: JO.37j8/BRM.40.l.J83 Semanticfeature ... · Vigliocco et al., 2004; Vinson & Vigliocco, 2002) and predictions concerning the separability of semantic and syntactic

Securities Industry Conference 2018 - Goodacre UK · stright predictions in respect of its future demand and value. Mike’s firm Crealogix provides financial solutions for banking

2012 predictions

czoneindia.comczoneindia.com/download/Kundli-Manual.pdf · What is Kundli PRO for Windows? Windows compatibility ... Varshphal for predictions in respect to time. Some of the key

Using Standardized Semantic Technologies For Discovery And ... · Matching algorithms are domain-independent, hence futureproof with respect to changes in the - domain Foundation

Predictions 2013

2014 predictions

Semantic Comprehension Deficits JISHA 26 (1), 43-61 ...ishaindia.org.in/jisha_Vol26_1_articles_final/semantic... · aphasia with respect to auditory, ... of real versus nonsense consonant-vowel-consonant

Predictions, Hopes and Aspirations for U.S. Healthcare in 2015€¦ · • The real predictions for 2015 • The aspirational predictions for 2015 • Poll questions for your predictions

Correcting density functional theory for accurate ......predictions of the compound thermodynamic stability with respect to decomposition into the competing phases which can be used

TOP 10 PREDICTIONS IDC Predictions 2012: Competing for 2020

Predictions, Surprise, and Predictions of Surprise in