Deep Learning: AI Breakthrough

Mohsen Fayyaz

Sensifai

Tehran University – 15 Dey 1395 (4 Jan 2017)

Video Processing and Deep Learning

What is Video?

• Batches of Frames• Can we process video as batches of frames?

Motion cannot be inferred from single frame

Why do we need video processing?

• Self-Driving Cars: Video Semantic Segmentation

Feature Space Optimization for Semantic Video Segmentation, Kundu et. al., 2016

• Robots: Action Recognition

Simonyan et. al., 2014

• Google, YouTube, Aparat : Video Tagging

Densecap, Johnson et. al., 2016 (Image captioning)

• Network Video Broadcasting: Frame Prediction

Patraucean et. al., 2016

From Images to Video

Extracted

FeaturesFrames

Extracted

Features

Image Video

Extracted Spatio-Temporal

FeaturesFrames

Donahe et. al., 2015

Extracted Spatio-Temporal

FeaturesFrames

Donahe et. al., 2015

What if we want regional

features?

From Images to Video - STFCN

Extracted Regional Spatio-Temporal

FeaturesFrames

Convolutional LSTM

Fayyaz et. al., 2016

From Images to Video – C3D

Extracted Regional Spatio-Temporal

FeaturesFrames

Tran et. al., 2015

Now that we have the appropriate toolLet’s see some real world applications

Video Semantic Segmentation - STFCN

Fayyaz et. al., 2016

Video Semantic Segmentation – C3D

Tran et. al., 2015

Action Recognition & Video Classification

Simonyan et. al., 2014

Does video have visual data only?

Action Recognition & Video Classification

Wu et al., 2015

Vision

Let’s briefly take a look at some state-of-the-art Image based Networks

Extremely Deep Networks

Residual Networks

• Problem: Gradients Vanish in Back-propagation

• Solution: Let’s make a shortcut for them!

• Y = 𝐻(𝑋,𝑊𝐻) -> Y = 𝐻 𝑋,𝑊𝐻 + 𝑋

Highway Networks

• Similar to ResNets

• The shortcuts are controlled using a learnable parameter to

have a better trade-off between being

• Y = 𝐻 𝑋,𝑊𝐻 . 𝑇 𝑋,𝑊𝑇 + 𝑋. (1 − 𝑇 𝑋,𝑊𝑇 )

DenseNets

• If ResNet works with just connecting previous layers, why

not connecting all?!

• 𝑌 = 𝐹(𝑋𝑛, 𝑋𝑛−1, …, 𝑋0)• Improvements in both Forward &

• Backward

Now what if we use the idea of propagating data and gradients between shallow and

deep layers in video based networks?

Up to here everything was SupervisedBut there are bunch of data across the

Internet with weak labels …Let’s go through Weakly-Supervised

methods

Weakly Supervised Learning

Weakly Supervised Learning with CNNs

• Multiple Labeling

• Weakly Localization

• Data can be crawled

over Internet• Can be adopted to Video

Oquab et. al., 2015

How about some Unsupervised methods …

Unsupervised Learning

Anticipating Visual Representations From Unlabeled Video• Training on Big Huge amount of unlabeled Video across the net

• Training Classifiers on the final output

Vondrick et. al., 2016

Practical considerations

What Hardware do I use?

• NVIDIA GPU + SSD + HDD

• More info on:http://www.DeepLearning.ir

What framework do I use?

Tensorflow

Theano

Microsoft CNTK

Deeplearning4j

What framework do I use?

Tensorflow Torch Theano

From Karpathy’s slides

Distributed Training:

Will be presented at my next presentation at Sharif University of Technology

on 22 Dey 1395 (11 Jan 2017)

From Karpathy’s slides

Thank You

Fayyaz@Sensifai.com

Deep Learning: AI Breakthrough

Science

Provably robust deep - Carnegie Mellon School of Computer ...cliu6/16-883/robust_deep_learning.pdf · The AI breakthrough (some recent history) 4 Karraset al., 2018 Radford et al.,

The AI Era Ignited by GPU Deep Learning

Deep learning-based model for detecting 2019 novel coronavirus … › content › medrxiv › early › 2020 › 02 › 26 › ... · Deep learning, an important breakthrough in

Future of AI: Blockchain and Deep Learning

Dell EMC Ready Solutions for AI, Machine and Deep · PDF fileDell EMC Ready Solutions for AI, Machine and Deep Learning ... artificial intelligence ... making deep learning more accessible

A deep dive in ai and ml

Weka AI Reference Architecture with NVIDIA DGX-1 …...Deep learning (DL) is an artificial intelligence (AI) breakthrough, solving problems at better than human levels of performance

DATA Deep Learning and (Image) Classiﬁcationoyallon/ENSAE_cours.pdf · DATA Deep learning: Technical breakthrough 2 • Deep learning has permitted to solve a large number of task

AI - Deep Learning Computer Vision - Scene Understanding

Deep Learning & Neural Networks Lecture 2kevinduh/a/deep2014/140116-ResearchSeminar.pdf1 General Ideas in Deep Learning Motivation for Deep Architectures and why is it hard? Main Breakthrough

Deep Dreaming of AI in Education BETT 2017 · I am not a robot 31/01/2017 Deep Dreaming of AI in Education –BETT 2017 4 Deep dreaming of AI in education: ›Reference source –

AI Benchmark: Running Deep Neural Networks on Android Smartphonesopenaccess.thecvf.com/content_ECCVW_2018/papers/11133/... · 2019-02-10 · AI Benchmark: Running Deep Neural Networks

Learning Deep Architectures for AI - now publishers

AI, Machine, Deep learning and NLP

Learning Deep Architecturesfleet/courses/cifarSchool09/slidesBengio.pdf · The Deep Breakthrough Before 2006, training deep architectures was unsuccessful, except for convolutional

Learning Deep Architectures for AI Contents

Learning Deep Architectures for AI - York University

AI FOR SCIENCE: DEEP LEARNING FOR IMPROVED SATELLITE

Deep learning-based model for detecting 2019 novel ...2020/02/25 · Deep learning, an important breakthrough in the domain of AI in the past decade, has huge potential at extracting

Learning Deep Architectures in AI