Emerging Methods of Computing

Embed Size (px)

Citation preview

  • 8/10/2019 Emerging Methods of Computing

    1/55

    Emerging Methods of Computing

    Luca Benini ETHZ

    Wayne Burleson University of Massachusetts

    Fabien Clermidy CEA-LETI

    Enrico Macii Politecnico di Torino

    Angel Rodriguez-Vazquez University of Sevilla

    Chair:Yusuf Leblebici, EPFL

  • 8/10/2019 Emerging Methods of Computing

    2/55

    Emerging Methods of Computing

    ?

  • 8/10/2019 Emerging Methods of Computing

    3/55

    A million spiking-neuron integrated circuit with a scalable communication network and interface

    Science 8 August 2014: Vol. 345 no. 6197 pp. 668-673

  • 8/10/2019 Emerging Methods of Computing

    4/55

    Emerging Methods of Computing

  • 8/10/2019 Emerging Methods of Computing

    5/55

    Architectures

    Parallelism

    Memory Bandwidth

    3D Integration

    Heat Management

  • 8/10/2019 Emerging Methods of Computing

    6/55

    Luca Benini

    ETHZ

  • 8/10/2019 Emerging Methods of Computing

    7/55

    Ultra-low power computational sensing for next generation

    "Internet of Everything" platformsLuca Benini

    IIS-ETHZ & DEI-UNIBO

  • 8/10/2019 Emerging Methods of Computing

    8/55

    How much energy to process (1 op. per Byte) one BB?

    Computing for the IoT

  • 8/10/2019 Emerging Methods of Computing

    9/55

    How efficient?

    9

    [RuchIBM11]

    1012ops/J

    1pJ/op

    1GOPS/mW

    1Brontobyte (1027)/Year (single op) with ''just'' 10MW!

    CNN scene analysis (SoA)

    127 kOp/pixel 4k needs 1TOp

  • 8/10/2019 Emerging Methods of Computing

    10/55

    pJ/op Parallel ULP Computing

    10

    pJ/op is traditionally the target of ASIC + uCntr

    Scalable: many-core + heterogeneity

    Programmable: OpenMP, OpenCL, OpenVX

    Open: Software& HW

    Best-in-class LP silicon technology (partner foundries!)

    PULP

    1.Near-threshold operation

    2.Instant-on-circuits3.Heterogeneous Architecture

    4.Volume (3D) integration

    4 Strategic areas

  • 8/10/2019 Emerging Methods of Computing

    11/55

    PULP v1First Demonstrator

    11

    First Multi-core

    chip in28nmRVTFDSOI (STM free silicon

    program)

    4 Open RISC Cores

    6 Dynamicallycontrollable Body Bias

    islands.

    Both Reverse and

    Forward Body Bias

    Functional 0.45V

    1.20V

    Tested and

    Characterized (July-

    August 14)

    Core area: 1.5mm2

    First-time Good silicon

  • 8/10/2019 Emerging Methods of Computing

    12/55

    PulpV2 - Getting closer

    PULP

    commercial ARM

    processors

    commercial

    low-power MCUs

    multicore ULP

    platforms

    sub-threshold

    MCUs

    near-threshold

    MCUs

    211 GOPS/W

    power scalable by ~2000x many operatingpoints provided by

    VBB and VDDknobs

    perform

    ancescalableby~3

    00x

    How to close the

    x5 gap?

    How to extend

    the range to sub

    MOPS?

  • 8/10/2019 Emerging Methods of Computing

    13/55

    Hybrid + Approximate Memory

    13

  • 8/10/2019 Emerging Methods of Computing

    14/55

    Neural Processing Accelerator

    14Departement Informationstechnologie und Elektrotechnik

  • 8/10/2019 Emerging Methods of Computing

    15/55

    1GBps @ 2mW one IO word every cycle at

    250MHz SIP+die stacking option for large+low

    cost memories + sensors becomes viable

    ULP Phy for 3D Integration

    A 0.45-0.7V 1-6Gb/s 0.29-0.58pJ/bit Source Synchronous Transceiver

    Using Automatic Phase Calibration in 65nm CMOS (0.15mm2)

    15Departement Informationstechnologie und Elektrotechnik

    On 36-inch SMA cable

  • 8/10/2019 Emerging Methods of Computing

    16/55

    Multithermand AdG

    Multiscale Thermal Management of Computing Systems

  • 8/10/2019 Emerging Methods of Computing

    17/55

    Wayne Burleson

    University of Massachusetts

  • 8/10/2019 Emerging Methods of Computing

    18/55

    From Nano to Exa:Re-thinkingData Representations and Data Movement

    Wayne Burleson

    U. Massachusetts, Amherst

    AMD Research, Boston

    Image courtesy US Dept of Energy

    Image courtesy nlp.stanford.edu

  • 8/10/2019 Emerging Methods of Computing

    19/55

    Energy-efficient computing

    across scales, DARPA

  • 8/10/2019 Emerging Methods of Computing

    20/55

    Exascale High Performance Computing

    Images courtesy US Dept of Energy, Advanced Scientific Computing

    Scientific computation Weather

    Combustion

    Materials

    Energy

    Genetics Evolving codebase

    Big-data

    Big-compute

    Future computations

    Graphs Multimedia

    Data Analytics

  • 8/10/2019 Emerging Methods of Computing

    21/55

    Public-private partnerships

    for Exascale Research (2014)

  • 8/10/2019 Emerging Methods of Computing

    22/55

    Data representations Modeling the analogworld

    Physics

    Chemistry

    Biology

    Modeling the virtual world Graphs, Relationships,

    Social, Behavioral, Economic

    Metrics, Costs, Risk

    Modeling the observedworld Media and Signal Processing

    Neuromorphic

    Approximate

    Data representations Data types, Floating point, Graphics, Integer,

    Standards: IEEE 754

    Sampling: Adaptive Mesh, Compressive sensing

    Arithmetic and Numerical effects

    Software development and validation

    Libraries: BLAS, LINPACK, MATLAB, Speech processing

    Combustion turbulence

    Financial Risks (2008)

  • 8/10/2019 Emerging Methods of Computing

    23/55

    2011

    , 2011

    Data movement

  • 8/10/2019 Emerging Methods of Computing

    24/55

    Data movement

    On-chip interconnect Caches: L2 and L3 on-die, CPU-GPU shared memory,

    coherence,

    Networks on Chip (circuits, architecture, protocols)

    Off-chip

    Fast interfaces: electrical/optical Non-volatile memory, Multi-level memory

    Die-stacked memory, 2.5D, 3D

    Compression, encryption, coding,

    Memory allocation, management Interprocessor communication - MPI

    Work-flow management, avoid disk/file system wherepossible eg HPC: combine simulation, analysis, visualization

  • 8/10/2019 Emerging Methods of Computing

    25/55

    Vision: Heterogeneous Systems Era

  • 8/10/2019 Emerging Methods of Computing

    26/55

    Heterogeneous System Architecture

    (HSA) OpenEcosystem!

  • 8/10/2019 Emerging Methods of Computing

    27/55

    The HSA Vision

  • 8/10/2019 Emerging Methods of Computing

    28/55

    CPUs, GPUs, and Accelerators:Apple A8 SoC

    28

    Out-of-Core

    Accelerators

    Maltiel Consulting

    estimates

    Our estimates

    From David Brooks, Harvard, 2014

  • 8/10/2019 Emerging Methods of Computing

    29/55

    Lessons Learned

    From Exa to Nano

    Heterogeneous Compute, GPU-compute, Co-design

    Data Movement: New Memory archs, NoC, Resiliency

    Open SW Systems, MPI + X,

    From Nano to Exa

    Dedicated DSP, GPU and other accelerators

    Customized data representations

    Low-power design: Near-threshold, Leakage control, DVFS

    Software development (HW/SW contracts) Data representations: types/arithmetic

    Memory models, Parallel programming

  • 8/10/2019 Emerging Methods of Computing

    30/55

    Fabien Clermidy

    CEA-LETI

  • 8/10/2019 Emerging Methods of Computing

    31/55

    &

    www.cea.fr

    Fabien Clermidy, PhD

    Head of Digital Architecture &Design Lab

    Computing goes 3D

  • 8/10/2019 Emerging Methods of Computing

    32/55

    Cliquez pour modifier le style du titre

    Computing goes 3D CEA. All rights reserved |32& Dec. 2014

    Technology versus application

    Energy efficient

    High performance

    Small area

    General

    purpose

    Dedicated

    Units

    Scalable

    Flexible

    AdaptableLow NRE cost

  • 8/10/2019 Emerging Methods of Computing

    33/55

    Cliquez pour modifier le style du titre

    Computing goes 3D CEA. All rights reserved |33& Dec. 2014

    3D Technologies

    3D stacking

    Heterogeneous integrationYield improvement

    Servers, big data

    Monolithic 3D IntegrationReduce Silicon footprint

    Reduce routing wirelength

    Low-power & high performance

    BEOL NVMLogic in memory designs

    Non-volatility for normally-off IoT

  • 8/10/2019 Emerging Methods of Computing

    34/55

    Cliquez pour modifier le style du titre

    Computing goes 3D CEA. All rights reserved |34& Dec. 2014

    3D Stacking

    $0

    $50

    $100

    $150

    $200

    $250

    Cost

    Defect Density (28nm)

    Cost Comparison

    2D die

    3D-CHIPLET (Passive

    65nm)

    3D-CHIPLET (Active 65nm)

    3D-partitionning (Active

    65nm-20%)

  • 8/10/2019 Emerging Methods of Computing

    35/55

    Cliquez pour modifier le style du titre

    Computing goes 3D CEA. All rights reserved |35& Dec. 2014

    3D many-core interposer

  • 8/10/2019 Emerging Methods of Computing

    36/55

    Cliquez pour modifier le style du titre

    Computing goes 3D CEA. All rights reserved |36& Dec. 2014

    3DMI vs. Traditional Scaling

    Case study: FPGA partitioning on 2/3/4 layers

    Comparison between 3DMI FDSOI 14nm (available) and ITRS

    FinFET 10 & 7 nm (predictive)

  • 8/10/2019 Emerging Methods of Computing

    37/55

    Cliquez pour modifier le style du titre

    Computing goes 3D CEA. All rights reserved |37& Dec. 2014

    RRAM

    Features:

    BEOL technology => low footprintNon-volatile => no leakage

    ApplicationsMemory-in-logic => IoT

    High density crossbars => Big data

    0

    20

    40

    60

    80

    100

    020406080100

    PowerGain(%)

    Activity rate (%)

    alu4apex2

    apex4ex1010

    ex5pmisex3

    pdcseqAverage

    OxRAM[Chen.IEDM 2009]

    PCRAM[Servalli.IEDM 2009]

    CBRAM[Palma.TED 2014]

    Ron 1k 10k 4.2k

    Roff 100k 2M 10G

    Ileakage 15A 0.75A 0.15nA

  • 8/10/2019 Emerging Methods of Computing

    38/55

    Centre de Grenoble

    17 rue des Martyrs

    38054 Grenoble Cedex

    Centre de Saclay

    Nano-Innov PC 17291191 Gif sur Yvette [email protected]

  • 8/10/2019 Emerging Methods of Computing

    39/55

    Enrico Macii

    Politecnico di Torino

  • 8/10/2019 Emerging Methods of Computing

    40/55

    Neuromorphic Computing Models:Optimization of Multicore Neuromorphic Computing Platforms

    Enrico Macii

    Dip. di Automatica e Informatica

    Brain like Computing Systems

  • 8/10/2019 Emerging Methods of Computing

    41/55

    41Enrico Macii ([email protected])

    Neuromorphic HW devices/platforms (artificial brains): Densely interconnected multicoresbrain emulation (NM-MC)

    New devices implementing HW neuron models(e.g., memristors)

    Potential: Innovative brain-inspired computing methods

    Real-time emulation of (part of) the human brain(not achievable with SW neuron simulators)

    Challenges: Brain simulation/emulation capability is currently limited to

    a few thousands of neurons (flys brain)

    Support neural network features (e.g., plasticity)

    Focus on NeuroMorphic-MultiCores (NM-MC) NM HW @ HBP:

    BrainScaleS (U. Heidelberg)

    SpiNNaker (U. Manchester)

    Brain-like Computing Systems

    NM MC

  • 8/10/2019 Emerging Methods of Computing

    42/55

    42Enrico Macii ([email protected])

    Define neuron models

    Define neuron networks and populations

    Perform neuron activity simulationAllocate populations to cores

    (max ~100 per core)

    NM-MC

    Conceptual Scheme

  • 8/10/2019 Emerging Methods of Computing

    43/55

    43Enrico Macii ([email protected])

    Spiking neural network

    Splitting and Mapping Engine

    Computation (fascicle) Communication (spikes)

    Mapping neurons to maximize

    neurons per core

    Profile neurons communication

    activities (neurons connections)optimization

    We are currently looking at optimizing:

    Mapping of neurons on NM-cores depending on their activity (spiking rate)

    Network usage by communicationaware neuron population splitting and allocation

    Profile neurons activity

    (e.g. spiking rate)

    Mapping neurons to reduce

    packet transmission

    Conceptual Scheme

    Discussion

  • 8/10/2019 Emerging Methods of Computing

    44/55

    44Enrico Macii ([email protected])

    Existing knowledge of brain functions is used to design an affordable

    supercomputer that can itself serve as a tool to investigate brain

    behavior

    and that it can contribute to a fundamental, biological

    understanding of how the brain works.

    However, this research has possible impact outside brain simulation:

    Design brain interfaces for robotics

    Provide insights into specific properties of the different hardware architectures

    Explore non-von Neumann computing outside the realm of brain-science

    Develop resource optimization techniques (efficient on-chip/off-chip, energy

    efficient computation) for densely interconnected multi-core systems

    Discussion

  • 8/10/2019 Emerging Methods of Computing

    45/55

    Angel Rodriguez-Vazquez

    University of Sevilla

    VISION BEYOND IMAGING

  • 8/10/2019 Emerging Methods of Computing

    46/55

    ANAFOCUS

    VISIONBEYONDIMAGING

    Hybrid Cellular Architectures

    for

    High-Sensitivity, High-Speed, High-ResolutionVision Systems

    with

    Reduced SWaP

    Angel Rodrguez-VzquezInstitute of Microelectronics of Seville-SPAIN

    University of Seville

    Spanish Council of Research (CSIC)[email protected], [email protected]

    Trends of MicroElectronic Imaging Systems

  • 8/10/2019 Emerging Methods of Computing

    47/55

    Reducing the PIXELSIZE

    even despite diffraction limits

    < 1mm @ 4T-pixels with transistor sharing

    Increasing the pixel FILLFACTOR

    Use of Back Side Illumination technologies

    Increasing the PIXELCOUNT

    > 10Mpixels

    Increasing READOUTSPEEDANDDATATRANSFER

    > 10Gpixel/sec

    Increasing READ-OUTACCURACY< 1e-noise > 150dB DR Image acquisition

    IncreasingSYSTEMEMBEDDING

    Single Chip Camera

    Trends of MicroElectronic Imaging Systems

    Typ ical CMOS Image Sensor A rch itectu re

  • 8/10/2019 Emerging Methods of Computing

    48/55

    Typ ical CMOS Image Sensor A rch itectu re

    LensDigital Image Sensor Chip

    DataProcessing

    Memorybank

    Camera Control block ControlInterface

    DataInterfa

    ce

    clk

    _ext

    Power

    Image

    Interface

    ControlInterface

    ADC

    Image

    Processor

    Column Amplifiers

    Pixel ArraySensor

    Control

    PLLAnalogue References

    FRAME-BASEDCONCEPT; clear border between sensing and processing

    IS THIS ADEQUATE FOR VISION ??

    Single chip embedding

    Vision and Imag ing have Different Goals !!

  • 8/10/2019 Emerging Methods of Computing

    49/55

    VISION MAINOBJECTIVE: scene understanding

    IMAGING MAINOBJECTIVE: image quality

    49

    Capture Demosaicing Exposure ctrl. Noise removal Output

    Capture Noise removal Edge filteringObjectrecognition

    Feature

    description

    Vision and Imag ing have Different Goals !!

    Data Reduct ion in the Vision Processing Chain

  • 8/10/2019 Emerging Methods of Computing

    50/55

    Abstraction level(Data structure complexity)

    D

    atadimensionality

    (

    No.ofobjects)

    Image capture

    Spatial filtering

    Edge/Motiondetection

    Regular flow

    High computationaldemand

    Decision making

    Algorithm control

    Conditionaljumps

    Irregular flow

    Moderate demand

    Image segmentation

    Object labeling

    Feature extraction

    Less regular flow

    Lower demand

    50

    Data Reduct ion in the Vision Processing Chain

    Conventional sensor border

    Vision sensor

    border

    ImagerSmart

  • 8/10/2019 Emerging Methods of Computing

    51/55

    F

    Imager

    Low-leveltasksADC

    F

    Mid-leveltasks

    High-leveltasks

    Microprocessor

    f f

    Vision co-processor

    F

    . . .

    . . .

    Smart

    f

    Information Flow:F >> f > f

    Sensor Memory

    Mixed-signal

    processor

    SIMDCellular Nonlinear Networks provide a

    computation framework for that

    Eye-RIS: A Representative Industrial

  • 8/10/2019 Emerging Methods of Computing

    52/55

    ANAFOCUS

    Eye RIS: A Representative IndustrialSIMD-CNN Vision Systems

    ImageAcquisition

    In-pixel ImageProcessing

    In-pixel ImageMemory

    ImageProcessor

    Coordinates

    Processor

    Image post-processing

    and control

    Data andprogrammemory

    MLP

    SIS Q-Eye DIPRISC Processor

    DDR

    I/O and

    Communications

    Q-eye

    Controller

    I/O

    Interface

  • 8/10/2019 Emerging Methods of Computing

    53/55

    Some Open Points for Discussion

    How to achieve convergence with mainstreamindustrial approaches ??

    Hardware-software standards

    Which can kind of data encoding will be used for

    sensor-processor outcomes ??

    only spatial-temporal events

    data features

    Is it possible to derive a reduced set of data features

    which constitute an universal basis for vision tasks ??

    ad-hoc sensor design constraint industrialdeployment

    May conventional frame-based architectures confront

    the challenges of vision ??

    ANALOG

  • 8/10/2019 Emerging Methods of Computing

    54/55

    Digital

    VLSI DataTiming

    Supply, etc

    ANALOG

    FRONT-END

    GRAY SCONCEPT

    Driven by Speed, Accuracy

    Calibration and

    Correction

    MEAD SCONCEPT, BIO-INSPIREDDriven by Power Consumption

    Analog

    Processing

    AnalogVLSI

    MASSIVE DATA

    SENSORS

    Analog

    VLSI

    ANALOG

    FRONT-END

    Mixed-Signal

    VLSI

  • 8/10/2019 Emerging Methods of Computing

    55/55

    Emerging Methods of Computing

    Luca Benini ETHZ

    Wayne Burleson University of Massachusetts

    Fabien Clermidy CEA-LETI

    Enrico Macii Politecnico di TorinoAngel Rodriguez-Vazquez University of Sevilla

    Chair:Yusuf Leblebici, EPFL