16
Issues of HPC software ——From the experience of TH-1A Lu Yutong NUDT

Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

Embed Size (px)

Citation preview

Page 1: Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

Issues of HPC software——From the experience of TH-1A

Lu Yutong

NUDT

Page 2: Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

TH-1A system Installed in NSCC-TJ, Aug. 2010 Hybrid MPP structure: CPU & GPU Custom software stack Peak performance 4.7Pflop/s, Linpack 2.566PFlop/s, No.1Top500

Power consumption 4.04MW(635.15MF/W), No.11Green500

Items Configuration

Processors 14336 Intel CPUs + 7168 nVIDIA GPUs + 2048FT CPUs

Memory 262TB in total

Interconnect Proprietary high-speed interconnecting network

Storage Global shared parallel storage system, 2PB

Cabinets120 Compute / service Cabinets

14 Storage Cabinets

6 Communication Cabinets

Page 3: Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

3 kinds of LSI chips CPU: FT-1000(PSoC) High radix router ASIC : NRC Network interface ASIC : NIC

15 kinds of PCB boards

Hardware

4 kinds of nodes, 2 sets of network Custom cabinet Water cooling system

Page 4: Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

TH-1A software Software stack

Page 5: Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

What’s the point

Customization

Optimization

Oriented to the system architecture

Page 6: Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

Operating systemOperating system

Kylin Linux compute node kernel Provide virtual running environment

Isolated running environments for different users Custom software package installation

QoS support Power aware computing

Page 7: Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

Proprietary Interconnection based on high radix router High bandwidth packet and RDMA communication

Zero copy user space RDMA MPI base on GLEX: Bandwidth 6.3GB/s Accelerate collective operation with hardware support in

communication interface Fault tolerance

Rapid error detection in large scale interconnection Rebuild communication links

Glex communicating system

Page 8: Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

Resource management, job scheduler Heterogenous resources management and

topology-aware scheduling Large scale parallel job launcher (custom)

Improve system structure, optimized protocol, network performance, file system performance

logN, klogN System power management Automatic CR supporting Accounting Enhance

Resource management system

Page 9: Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

Global parallel file system Object storage architecture (Lustre based) Capacity: 2 PB, Scalability Performance: Collective BW

Optimized file system protocol over proprietary interconnection network

Confliction release for concurrency accessing Fine-grain distributed file lock mechanism Optimized file cache policy

Reliability enhancement Fault tolerance of network protocol Data objects placement Soft-raid

Page 10: Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

Compiler systemCompiler system C, C++, Fortran, Java OpenMP, MPI, OpenMP/MPI CUDA, OpenCL Heterogeneous programming framework

Accelerate the large scale, complex applications, Use the computing power of CPUs and GPUs, hide the

GPU programming to users Inter-node homogeneous parallel programming (users) Intra-node heterogeneous parallel computing (experts)

Page 11: Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

Accelerating HPL Accelerating HPL (MPI(custom)+OpenMP+Cuda)(MPI(custom)+OpenMP+Cuda) Adaptive partition ● Asynchronous data transfer Software pipeline ● Affinity scheduling ● Zero-copy

Compiler Optimized

Page 12: Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

Programming environmentProgramming environment

Virtual running environments Provide services on demand

Parallel toolkits Based on Eclipse To integrate all kinds of tools Editor, debugger, profiler

Work flow support Support QoS negotiate Reservation

Page 13: Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

Nowadays

Our principle Practicality and Usability

HPC system software Mature technology , correctness and functionality

Optimizing technology , increase performance ,

scalarbility and reliability

Needs of new technology and theory

Page 14: Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

Programming Language

Productive language X10, Chapel, Fortress

Global Arrays, ……

MPI is still vitality Compiler independence

Distributed features

MPI + sth

Page 15: Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

Exascale computing

Hardware architecture Many cores system Nano-Photonics

System Software-- (Adaptive + Intelligent) Extreme parallelism Heterogenous computing Fault-tolerant computing Power management

Application software Legacy codes supporting Creative model

Page 16: Issues of HPC software From the experience of TH-1A Lu Yutong NUDT

Thank you!