Upload
nanoterach
View
218
Download
0
Embed Size (px)
Citation preview
8/10/2019 Emerging Methods of Computing
1/55
Emerging Methods of Computing
Luca Benini ETHZ
Wayne Burleson University of Massachusetts
Fabien Clermidy CEA-LETI
Enrico Macii Politecnico di Torino
Angel Rodriguez-Vazquez University of Sevilla
Chair:Yusuf Leblebici, EPFL
8/10/2019 Emerging Methods of Computing
2/55
Emerging Methods of Computing
?
8/10/2019 Emerging Methods of Computing
3/55
A million spiking-neuron integrated circuit with a scalable communication network and interface
Science 8 August 2014: Vol. 345 no. 6197 pp. 668-673
8/10/2019 Emerging Methods of Computing
4/55
Emerging Methods of Computing
8/10/2019 Emerging Methods of Computing
5/55
Architectures
Parallelism
Memory Bandwidth
3D Integration
Heat Management
8/10/2019 Emerging Methods of Computing
6/55
Luca Benini
ETHZ
8/10/2019 Emerging Methods of Computing
7/55
Ultra-low power computational sensing for next generation
"Internet of Everything" platformsLuca Benini
IIS-ETHZ & DEI-UNIBO
8/10/2019 Emerging Methods of Computing
8/55
How much energy to process (1 op. per Byte) one BB?
Computing for the IoT
8/10/2019 Emerging Methods of Computing
9/55
How efficient?
9
[RuchIBM11]
1012ops/J
1pJ/op
1GOPS/mW
1Brontobyte (1027)/Year (single op) with ''just'' 10MW!
CNN scene analysis (SoA)
127 kOp/pixel 4k needs 1TOp
8/10/2019 Emerging Methods of Computing
10/55
pJ/op Parallel ULP Computing
10
pJ/op is traditionally the target of ASIC + uCntr
Scalable: many-core + heterogeneity
Programmable: OpenMP, OpenCL, OpenVX
Open: Software& HW
Best-in-class LP silicon technology (partner foundries!)
PULP
1.Near-threshold operation
2.Instant-on-circuits3.Heterogeneous Architecture
4.Volume (3D) integration
4 Strategic areas
8/10/2019 Emerging Methods of Computing
11/55
PULP v1First Demonstrator
11
First Multi-core
chip in28nmRVTFDSOI (STM free silicon
program)
4 Open RISC Cores
6 Dynamicallycontrollable Body Bias
islands.
Both Reverse and
Forward Body Bias
Functional 0.45V
1.20V
Tested and
Characterized (July-
August 14)
Core area: 1.5mm2
First-time Good silicon
8/10/2019 Emerging Methods of Computing
12/55
PulpV2 - Getting closer
PULP
commercial ARM
processors
commercial
low-power MCUs
multicore ULP
platforms
sub-threshold
MCUs
near-threshold
MCUs
211 GOPS/W
power scalable by ~2000x many operatingpoints provided by
VBB and VDDknobs
perform
ancescalableby~3
00x
How to close the
x5 gap?
How to extend
the range to sub
MOPS?
8/10/2019 Emerging Methods of Computing
13/55
Hybrid + Approximate Memory
13
8/10/2019 Emerging Methods of Computing
14/55
Neural Processing Accelerator
14Departement Informationstechnologie und Elektrotechnik
8/10/2019 Emerging Methods of Computing
15/55
1GBps @ 2mW one IO word every cycle at
250MHz SIP+die stacking option for large+low
cost memories + sensors becomes viable
ULP Phy for 3D Integration
A 0.45-0.7V 1-6Gb/s 0.29-0.58pJ/bit Source Synchronous Transceiver
Using Automatic Phase Calibration in 65nm CMOS (0.15mm2)
15Departement Informationstechnologie und Elektrotechnik
On 36-inch SMA cable
8/10/2019 Emerging Methods of Computing
16/55
Multithermand AdG
Multiscale Thermal Management of Computing Systems
8/10/2019 Emerging Methods of Computing
17/55
Wayne Burleson
University of Massachusetts
8/10/2019 Emerging Methods of Computing
18/55
From Nano to Exa:Re-thinkingData Representations and Data Movement
Wayne Burleson
U. Massachusetts, Amherst
AMD Research, Boston
Image courtesy US Dept of Energy
Image courtesy nlp.stanford.edu
8/10/2019 Emerging Methods of Computing
19/55
Energy-efficient computing
across scales, DARPA
8/10/2019 Emerging Methods of Computing
20/55
Exascale High Performance Computing
Images courtesy US Dept of Energy, Advanced Scientific Computing
Scientific computation Weather
Combustion
Materials
Energy
Genetics Evolving codebase
Big-data
Big-compute
Future computations
Graphs Multimedia
Data Analytics
8/10/2019 Emerging Methods of Computing
21/55
Public-private partnerships
for Exascale Research (2014)
8/10/2019 Emerging Methods of Computing
22/55
Data representations Modeling the analogworld
Physics
Chemistry
Biology
Modeling the virtual world Graphs, Relationships,
Social, Behavioral, Economic
Metrics, Costs, Risk
Modeling the observedworld Media and Signal Processing
Neuromorphic
Approximate
Data representations Data types, Floating point, Graphics, Integer,
Standards: IEEE 754
Sampling: Adaptive Mesh, Compressive sensing
Arithmetic and Numerical effects
Software development and validation
Libraries: BLAS, LINPACK, MATLAB, Speech processing
Combustion turbulence
Financial Risks (2008)
8/10/2019 Emerging Methods of Computing
23/55
2011
, 2011
Data movement
8/10/2019 Emerging Methods of Computing
24/55
Data movement
On-chip interconnect Caches: L2 and L3 on-die, CPU-GPU shared memory,
coherence,
Networks on Chip (circuits, architecture, protocols)
Off-chip
Fast interfaces: electrical/optical Non-volatile memory, Multi-level memory
Die-stacked memory, 2.5D, 3D
Compression, encryption, coding,
Memory allocation, management Interprocessor communication - MPI
Work-flow management, avoid disk/file system wherepossible eg HPC: combine simulation, analysis, visualization
8/10/2019 Emerging Methods of Computing
25/55
Vision: Heterogeneous Systems Era
8/10/2019 Emerging Methods of Computing
26/55
Heterogeneous System Architecture
(HSA) OpenEcosystem!
8/10/2019 Emerging Methods of Computing
27/55
The HSA Vision
8/10/2019 Emerging Methods of Computing
28/55
CPUs, GPUs, and Accelerators:Apple A8 SoC
28
Out-of-Core
Accelerators
Maltiel Consulting
estimates
Our estimates
From David Brooks, Harvard, 2014
8/10/2019 Emerging Methods of Computing
29/55
Lessons Learned
From Exa to Nano
Heterogeneous Compute, GPU-compute, Co-design
Data Movement: New Memory archs, NoC, Resiliency
Open SW Systems, MPI + X,
From Nano to Exa
Dedicated DSP, GPU and other accelerators
Customized data representations
Low-power design: Near-threshold, Leakage control, DVFS
Software development (HW/SW contracts) Data representations: types/arithmetic
Memory models, Parallel programming
8/10/2019 Emerging Methods of Computing
30/55
Fabien Clermidy
CEA-LETI
8/10/2019 Emerging Methods of Computing
31/55
&
www.cea.fr
Fabien Clermidy, PhD
Head of Digital Architecture &Design Lab
Computing goes 3D
8/10/2019 Emerging Methods of Computing
32/55
Cliquez pour modifier le style du titre
Computing goes 3D CEA. All rights reserved |32& Dec. 2014
Technology versus application
Energy efficient
High performance
Small area
General
purpose
Dedicated
Units
Scalable
Flexible
AdaptableLow NRE cost
8/10/2019 Emerging Methods of Computing
33/55
Cliquez pour modifier le style du titre
Computing goes 3D CEA. All rights reserved |33& Dec. 2014
3D Technologies
3D stacking
Heterogeneous integrationYield improvement
Servers, big data
Monolithic 3D IntegrationReduce Silicon footprint
Reduce routing wirelength
Low-power & high performance
BEOL NVMLogic in memory designs
Non-volatility for normally-off IoT
8/10/2019 Emerging Methods of Computing
34/55
Cliquez pour modifier le style du titre
Computing goes 3D CEA. All rights reserved |34& Dec. 2014
3D Stacking
$0
$50
$100
$150
$200
$250
Cost
Defect Density (28nm)
Cost Comparison
2D die
3D-CHIPLET (Passive
65nm)
3D-CHIPLET (Active 65nm)
3D-partitionning (Active
65nm-20%)
8/10/2019 Emerging Methods of Computing
35/55
Cliquez pour modifier le style du titre
Computing goes 3D CEA. All rights reserved |35& Dec. 2014
3D many-core interposer
8/10/2019 Emerging Methods of Computing
36/55
Cliquez pour modifier le style du titre
Computing goes 3D CEA. All rights reserved |36& Dec. 2014
3DMI vs. Traditional Scaling
Case study: FPGA partitioning on 2/3/4 layers
Comparison between 3DMI FDSOI 14nm (available) and ITRS
FinFET 10 & 7 nm (predictive)
8/10/2019 Emerging Methods of Computing
37/55
Cliquez pour modifier le style du titre
Computing goes 3D CEA. All rights reserved |37& Dec. 2014
RRAM
Features:
BEOL technology => low footprintNon-volatile => no leakage
ApplicationsMemory-in-logic => IoT
High density crossbars => Big data
0
20
40
60
80
100
020406080100
PowerGain(%)
Activity rate (%)
alu4apex2
apex4ex1010
ex5pmisex3
pdcseqAverage
OxRAM[Chen.IEDM 2009]
PCRAM[Servalli.IEDM 2009]
CBRAM[Palma.TED 2014]
Ron 1k 10k 4.2k
Roff 100k 2M 10G
Ileakage 15A 0.75A 0.15nA
8/10/2019 Emerging Methods of Computing
38/55
Centre de Grenoble
17 rue des Martyrs
38054 Grenoble Cedex
Centre de Saclay
Nano-Innov PC 17291191 Gif sur Yvette [email protected]
8/10/2019 Emerging Methods of Computing
39/55
Enrico Macii
Politecnico di Torino
8/10/2019 Emerging Methods of Computing
40/55
Neuromorphic Computing Models:Optimization of Multicore Neuromorphic Computing Platforms
Enrico Macii
Dip. di Automatica e Informatica
Brain like Computing Systems
8/10/2019 Emerging Methods of Computing
41/55
41Enrico Macii ([email protected])
Neuromorphic HW devices/platforms (artificial brains): Densely interconnected multicoresbrain emulation (NM-MC)
New devices implementing HW neuron models(e.g., memristors)
Potential: Innovative brain-inspired computing methods
Real-time emulation of (part of) the human brain(not achievable with SW neuron simulators)
Challenges: Brain simulation/emulation capability is currently limited to
a few thousands of neurons (flys brain)
Support neural network features (e.g., plasticity)
Focus on NeuroMorphic-MultiCores (NM-MC) NM HW @ HBP:
BrainScaleS (U. Heidelberg)
SpiNNaker (U. Manchester)
Brain-like Computing Systems
NM MC
8/10/2019 Emerging Methods of Computing
42/55
42Enrico Macii ([email protected])
Define neuron models
Define neuron networks and populations
Perform neuron activity simulationAllocate populations to cores
(max ~100 per core)
NM-MC
Conceptual Scheme
8/10/2019 Emerging Methods of Computing
43/55
43Enrico Macii ([email protected])
Spiking neural network
Splitting and Mapping Engine
Computation (fascicle) Communication (spikes)
Mapping neurons to maximize
neurons per core
Profile neurons communication
activities (neurons connections)optimization
We are currently looking at optimizing:
Mapping of neurons on NM-cores depending on their activity (spiking rate)
Network usage by communicationaware neuron population splitting and allocation
Profile neurons activity
(e.g. spiking rate)
Mapping neurons to reduce
packet transmission
Conceptual Scheme
Discussion
8/10/2019 Emerging Methods of Computing
44/55
44Enrico Macii ([email protected])
Existing knowledge of brain functions is used to design an affordable
supercomputer that can itself serve as a tool to investigate brain
behavior
and that it can contribute to a fundamental, biological
understanding of how the brain works.
However, this research has possible impact outside brain simulation:
Design brain interfaces for robotics
Provide insights into specific properties of the different hardware architectures
Explore non-von Neumann computing outside the realm of brain-science
Develop resource optimization techniques (efficient on-chip/off-chip, energy
efficient computation) for densely interconnected multi-core systems
Discussion
8/10/2019 Emerging Methods of Computing
45/55
Angel Rodriguez-Vazquez
University of Sevilla
VISION BEYOND IMAGING
8/10/2019 Emerging Methods of Computing
46/55
ANAFOCUS
VISIONBEYONDIMAGING
Hybrid Cellular Architectures
for
High-Sensitivity, High-Speed, High-ResolutionVision Systems
with
Reduced SWaP
Angel Rodrguez-VzquezInstitute of Microelectronics of Seville-SPAIN
University of Seville
Spanish Council of Research (CSIC)[email protected], [email protected]
Trends of MicroElectronic Imaging Systems
8/10/2019 Emerging Methods of Computing
47/55
Reducing the PIXELSIZE
even despite diffraction limits
< 1mm @ 4T-pixels with transistor sharing
Increasing the pixel FILLFACTOR
Use of Back Side Illumination technologies
Increasing the PIXELCOUNT
> 10Mpixels
Increasing READOUTSPEEDANDDATATRANSFER
> 10Gpixel/sec
Increasing READ-OUTACCURACY< 1e-noise > 150dB DR Image acquisition
IncreasingSYSTEMEMBEDDING
Single Chip Camera
Trends of MicroElectronic Imaging Systems
Typ ical CMOS Image Sensor A rch itectu re
8/10/2019 Emerging Methods of Computing
48/55
Typ ical CMOS Image Sensor A rch itectu re
LensDigital Image Sensor Chip
DataProcessing
Memorybank
Camera Control block ControlInterface
DataInterfa
ce
clk
_ext
Power
Image
Interface
ControlInterface
ADC
Image
Processor
Column Amplifiers
Pixel ArraySensor
Control
PLLAnalogue References
FRAME-BASEDCONCEPT; clear border between sensing and processing
IS THIS ADEQUATE FOR VISION ??
Single chip embedding
Vision and Imag ing have Different Goals !!
8/10/2019 Emerging Methods of Computing
49/55
VISION MAINOBJECTIVE: scene understanding
IMAGING MAINOBJECTIVE: image quality
49
Capture Demosaicing Exposure ctrl. Noise removal Output
Capture Noise removal Edge filteringObjectrecognition
Feature
description
Vision and Imag ing have Different Goals !!
Data Reduct ion in the Vision Processing Chain
8/10/2019 Emerging Methods of Computing
50/55
Abstraction level(Data structure complexity)
D
atadimensionality
(
No.ofobjects)
Image capture
Spatial filtering
Edge/Motiondetection
Regular flow
High computationaldemand
Decision making
Algorithm control
Conditionaljumps
Irregular flow
Moderate demand
Image segmentation
Object labeling
Feature extraction
Less regular flow
Lower demand
50
Data Reduct ion in the Vision Processing Chain
Conventional sensor border
Vision sensor
border
ImagerSmart
8/10/2019 Emerging Methods of Computing
51/55
F
Imager
Low-leveltasksADC
F
Mid-leveltasks
High-leveltasks
Microprocessor
f f
Vision co-processor
F
. . .
. . .
Smart
f
Information Flow:F >> f > f
Sensor Memory
Mixed-signal
processor
SIMDCellular Nonlinear Networks provide a
computation framework for that
Eye-RIS: A Representative Industrial
8/10/2019 Emerging Methods of Computing
52/55
ANAFOCUS
Eye RIS: A Representative IndustrialSIMD-CNN Vision Systems
ImageAcquisition
In-pixel ImageProcessing
In-pixel ImageMemory
ImageProcessor
Coordinates
Processor
Image post-processing
and control
Data andprogrammemory
MLP
SIS Q-Eye DIPRISC Processor
DDR
I/O and
Communications
Q-eye
Controller
I/O
Interface
8/10/2019 Emerging Methods of Computing
53/55
Some Open Points for Discussion
How to achieve convergence with mainstreamindustrial approaches ??
Hardware-software standards
Which can kind of data encoding will be used for
sensor-processor outcomes ??
only spatial-temporal events
data features
Is it possible to derive a reduced set of data features
which constitute an universal basis for vision tasks ??
ad-hoc sensor design constraint industrialdeployment
May conventional frame-based architectures confront
the challenges of vision ??
ANALOG
8/10/2019 Emerging Methods of Computing
54/55
Digital
VLSI DataTiming
Supply, etc
ANALOG
FRONT-END
GRAY SCONCEPT
Driven by Speed, Accuracy
Calibration and
Correction
MEAD SCONCEPT, BIO-INSPIREDDriven by Power Consumption
Analog
Processing
AnalogVLSI
MASSIVE DATA
SENSORS
Analog
VLSI
ANALOG
FRONT-END
Mixed-Signal
VLSI
8/10/2019 Emerging Methods of Computing
55/55
Emerging Methods of Computing
Luca Benini ETHZ
Wayne Burleson University of Massachusetts
Fabien Clermidy CEA-LETI
Enrico Macii Politecnico di TorinoAngel Rodriguez-Vazquez University of Sevilla
Chair:Yusuf Leblebici, EPFL