20
The Eurolab4HPC project has received funding from the European Union Horizon 2020 Framework Programme (H2020-EU.1.2.2. - FET Proactive) under grant agreement number 800962 . 1

The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

  • Upload
    others

  • View
    14

  • Download
    0

Embed Size (px)

Citation preview

Page 1: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

The Eurolab4HPC project has received funding from the European Union Horizon 2020 Framework Programme (H2020-EU.1.2.2. - FET Proactive) under grant agreement number 800962.1

Page 2: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

Eurolab4HPC• Eurolab4HPC: 2015–2017 (CSA)• Eurolab4HPC2: 2018–2020 (CSA)

2

Page 3: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

Structure the HPC Community (1/2)

3

Thematic workshops• Dedicated and focussed workshops where

the different stakeholders for a particular theme

• can meet, brainstorm, work on a vision, and make plans for collaborations

Oct 2018 (Heraklion): Eurolab4HPC Thematic workshop on Accelerators

Apr 2019 (Edinburgh): Eurolab4HPC Thematic workshop on multicore programming

Oct 2019 (Bilbao): Eurolab4HPC Thematic Session onprogramming models for upcoming supercomputers

Eurolab school• Colocated with HiPEAC ACACES summer school• Three courses on HPC• Eurolab offered travel grants for students

Page 4: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

Short collaborations

4

• Travel costs for short stays covered up to €5,000

• Duration approx. 3 months• Foster tight integration among

research teams at multiple levels• Hardware, architectures,

programming and applications• PhD students, post-doctoral

researchers and academics

Structure the HPC Community (2/2)

Name Title To From

Alejandro Soba

HPC for Neutron Transport in a Fusion Reactors Core

CNEA, Argentina BSC

Albert Kahira Scaling Large Machine Learning Models for HPC

BSC Tokyo Institute of Technology

Sandra Catalan

Math Libraries Migration and Optimization on ARM-based processors

BSC Fujitsu

IvanFernandezVega

Processing in Memory for Data Intensive Applications

University of Malaga

ETHZ

Christopher Eldred

Themis Univ. GrenobleAlpes/Inria

Imperial College

Page 5: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

5

EuroLab-4-HPC 2017 Vision• Long-term vision for excellence in European HPC research• Written by expert working group

• Luca Benini, Babak Falsafi, Paul Carpenter, Avi Mendelson, Theo Ungerer• Sandro Bartolini, François Bodin, Dietmar Fey, Marc Duranton,

Igor Zacharov

• Complements HiPEAC Vision• Broad scope of computing and society: HPC, AI, embedded• 2017–2023 research horizon

• Complements ETP4HPC SRA – Strategic Research Agenda• Industry-driven roadmap to build competitive European HPC technology value chain.• 6-year horizon

• Released August 2017• 2020 update under preparation• BoF: Challenges and Opportunities in Academic HPC Systems

Research in 203017:15 to 18:45 on Wednesday 20 November (today!)

Page 6: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

6

FETHPCEuroHPC

R&D projects

EuroLab4HPCvision

EuroLab4HPC

EXDCI

HPC applications

ETP4HPCSRA

HPC infrastructure

PRACE

HPC industryETP4HPC

HPC systems academia

Codesign

CoEs FOCUSCOE

Page 7: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

7

Promote entrepreneurship (1/2)

Idea-to-business training• First held in Gothenburg, November 2018• Very early stages, i.e. when founders only have a piece

of technology and an idea of the potential market. • Business Model Canvas• Customer Development Process

Open source innovation camp⇒ Open source natural forum for:• Testing ideas and innovate,• Promoting entrepreneurship,• Fostering industry take-up `• Training more experts in HW and SW

• Supported RISC-V Workshop, June 2019, Zurich

• Eurolab4HPC extends it with additional events together with the FOSSI foundation

• Eurolab also presented prizes to winners of call for contributions for open source HPC projects

Page 8: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

Business prototyping projects (BPPs)

8

• Successful projects are mentored using the business model canvas.

• Business model hypotheses iteratively verified or adjusted to come up with business plan

• Eurolab4HPC grant of € 15,000• Typically lasts 3 months

Promote entrepreneurship (2/2)

Name Title Start date

Josep de La PuenteJose María CelaAlejandro Martí

USER - Urgent Simulation of Earthquakes for Resilience

February 2019

Monica de MierStephan Mohr Gerhard Goldbeck

ATOMSIM - Atomistic simulations for materials design

December 2018

Petar RadojkovicPaul CarpenterMario Nemirovsky

PREDICT: Prediction of HPC failures in the field

June 2019

Page 9: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

Industry days

9

• First Eurolab4HPC Industry Day, November 2018 (Multicore Day, Sweden)

• Second Eurolab4HPC Industry Day, February 2019 (University of Edinburgh)

Stimulate technology transfer (1/2)

Technology transfer training• First Eurolab4HPC Technology Transfer Training

Course, HLRS, Stuttgart, March 2019• Two-day course intended to equip academic

researchers• Basic understanding of the entrepreneurial process• Generation of business models• Managerial challenges

Page 10: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

Technology transfer projects (TTPs)

10

• Bilateral technology transfer between• one academic partner as technology provider and• one industry partner as technology receiver

• Funded TTP typically lasts 6 months • One TTP project was granted• Total budget of the TTP is €90,000, of which Eurolab4HPC

pays up to 50% (€45,000)

Stimulate technology transfer (2/2)

Page 11: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

BoF: Challenges and Opportunities in Academic HPC Systems Research in 2030

17:15 to 18:45 on Wednesday 20 November (today!)Room 708 on ground floor

Public workshop in process of preparing 2020 vision update

John Davis (BSC)Martin Schulz (LRZ + TUM)

https://sc19.supercomputing.org/presentation/?id=bof197&sess=sess337

11

Page 12: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

Contact us

mail: [email protected]: @eurolab4HPCwebsite: eurolab4HPC.eu

12

Page 13: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

Backup

13

Page 14: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

14

Comparison of HPC RoadmapsGoal Timespan SWOT /

political Scope Num.pages

HiPEAC vision Steer European academic research (driven by industry)

“short”: 3 years,“mid”: 6 years,long: >2020

Y HPC + embedded 72

ETP4HPC SRA / EXDCI

Strengthening European [industrial] HPC ecosystem

6 years(2014 to 2020) Y HPC except

applications 92

PRACE Scientific Case

[Academic] need for European HPC infrastructure

8 years(2012 to 2020) Y HPC applications 159

EESI (European Exascale Software Initiative)

Development of efficient exascale applications 5 to 10 years N Exascale applications 34

BDVA (Big Data Value Association) Big Data technologies roadmap 2020 - Big data 45

Rethink BigRoadmap for European Technologies in Hardware and Networking for Big Data

- Big data

ECSEL MASRIAEuropean leadership in enabling and industrial technologies. Competitive EU ECS industry.

2015 roadmap to about 2025 Y Electronic components

and systems (ECS) 91

Eurolab4HPC Academic excellence in HPC 8 to 10 years N Whole HPC stack 20—25

Page 15: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

15

Page 16: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

New application and software technologies pull

• Strong scaling evolutionary applications• Importance of legacy codes• Need to adapt for scalability and heterogeneity

• Potential new HPC applications• HPDA: High-performance Data Analytics• Deep learning/neuromorphic• Industrial Internet of Things; Industry 4.0• New expert programming (DSLs)• Approximate computing (concerns SW and HW)• ...

• Need for co-design• Interactivity, real-time and bursting to cloud

16

Page 17: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

System software and programming environment

• Sustained increase in system complexity, specialization and heterogeneity => intelligent programming environments

• Complex application performance analysis and debugging• Too many threads to analyse• Big gap between anomalous behaviour and source code changes

• Disruptive models of computation• Will Fortran, C, sockets, POSIX, MPI, OpenMP be replaced? (probably not)

• Convergence between storage and memory• Neuromorphic, resistive and quantum computing

• Task-based, or other programming abstraction?

17

Page 18: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

Vertical Challenges: Green ICT, Energy and Resiliency• Green ICT

• Computer systems already consume a large part of our natural resources • Improving PUE (power usage effectiveness) and DCIE (data centre

infrastructure efficiency)• Heat reuse• Environmental impact: reducing CO2, life cycle assessment

• Energy• Energy efficiency: efficient software stacks, libraries• NVM, new materials

• Resiliency• Shrinking increases failure rates: 10s atoms per transistor• Enormous number of components• Compiler level resilience, reliability in runtime and programming models, ABFT

(Application/Algorithm Based Fault Tolerance)

18

Page 19: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

Data centre and cloud computing

• Convergence of HPC and cloud computing• Data analytics complement simulation• Science entering 4th paradigm

• 1: theory, 2: empirical, 3: simulation• instrument data (LHC/SKA), simulation data, sensor data, human data

• Warehouse-scale computers• Commodity systems and very low prices• Increasingly shared concerns: enterprise – HPC

• Scale out, dynamic resource management• High utilization, parallelization (multicore, GPU,FPGA)• Low latency interconnect, application resiliency• Infrastructure costs, economies of scale

19

Page 20: The Eurolab4HPC project has received funding from the ... · Fusion Reactors Core; CNEA, Argentina : BSC. Albert Kahira; Scaling Large Machine Learning Models for HPC . BSC; TokyoInstitute

Disruptive Technologies will Push Future HPC

• Sustaining Technology (improving HW in ways generally expected) • Continuous CMOS Scaling • Die Stacking - 3D-Chip

• Disruptive Technology in Hardware/VLSI (innovation that creates a new line of HPC HW superseding existing HPC techniques):

• NVM Technologies (Memristors, STT-RAM)• Photonics

• Disruptive Technology (alternative ways of computing)• Resistive Computing• Neuromorphic Computing • Quantum Computing

• Beyond CMOS• Nanotubes• Graphene• Diamond

20