17
Pros and Cons of HPC Cloud Computing Nils gentschen Felde CloudStat 2011

Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons ofHPC Cloud Computing

Nils gentschen Felde

CloudStat 2011

Page 2: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons of HPC Cloud Computing – 23.03.2011 – Nils gentschen Felde – TUM Garching 2

Motivation - Idea

HPC Cluster

HPC Cloud Cluster

Management benefits of virtual HPC• Dynamical sizing / partitioning• Loadbalancing• Automation / scripting of management tasks• Fault tolerance without „Checkpoint and restart“• Better security due to sandboxing of processes• …

Page 3: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons of HPC Cloud Computing – 23.03.2011 – Nils gentschen Felde – TUM Garching 3

Motivation - Challenges

But: What about performance?

•Effects expected:-Virtualization layer will induce overhead-Concurrent effects appear when running several VMs in parallel; e.g. effects on storage, network, memory access

-Effects will depend on virtualization architecture, implementations, OS, …

•What scale will that effects be?•How to measure?

Page 4: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons of HPC Cloud Computing – 23.03.2011 – Nils gentschen Felde – TUM Garching 4

Outline

High-performance aspects in virtualized infrastructuresVitalian Danciu, Nils gentschen Felde, Dieter Kranzlmüller, Tobias Lindinger

Content:-Designing a Benchmark Suite-Applying the Benchmark Suite-Impacts to HEC Problem Classes-Conclusion-Outlook & Further Research

Page 5: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons of HPC Cloud Computing – 23.03.2011 – Nils gentschen Felde – TUM Garching

Benchmark Suite – Design Goals•Measure core components of virtualized systems

-CPU-Main Memory-Network-Disk

•Compare results of different hypervisors-Architectures (full-, native-, … , para-virtualization) -Implementations (VMware ESXi, Xen, MS Hyper-V, …)

•Analyze impact of virtualization on different… -Operating Systems (Windows, Linux, …)-System architectures (32bit, 64bit, …)

•Measure-Overhead of virtualization-Effects of concurrency

•Take care about special time measurement in VMs5

Page 6: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons of HPC Cloud Computing – 23.03.2011 – Nils gentschen Felde – TUM Garching 6

Benchmark Suite – Time MeasurementMeasurement of time difficult in virtualized environments:

-Definition of time

-Time synchronizationExample: Linpack-Benchmarkstatic REAL second(void){

return ((REAL)((REAL)clock()/(REAL)CLOCKS_PER_SEC));}# clock() : Wall-Clock-Time in time ticks since process start# CLOCKS_PER_SEC: Frequency of the hardware clock

External clocks may be necessary, depending on counters used by the benchmark tool

VM1

Hypervisor

VM1

CPU1CPU2

ZeitI/O VM1

Real durationDuration as seen by VM1

Page 7: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons of HPC Cloud Computing – 23.03.2011 – Nils gentschen Felde – TUM Garching 7

Benchmark Suite - Implementation•Benchmarks used:

•Run combinations of single benchmarks to test for concurrent effects, e.g.

-CPU + Network-Parallel network access-Different network setups (VM2VM, VM2PM, …)-…

Component BenchmarkCPU LinpackMain Memory RamspeedDisk IometerNetwork Iometer

Page 8: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons of HPC Cloud Computing – 23.03.2011 – Nils gentschen Felde – TUM Garching 8

Benchmark Suite - Implementation•Benchmarks used:

•Run combinations of single benchmarks to test forconcurrent effects, e.g.

-CPU + Network-Parallel network access-Different network setups (VM2VM, VM2PM, …)-…

Component BenchmarkCPU LinpackMain Memory RamspeedDisk IometerNetwork Iometer

Page 9: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons of HPC Cloud Computing – 23.03.2011 – Nils gentschen Felde – TUM Garching 9

Applying the Test Suite

Absolute runtime Relative runtime

Physical Linux 450,58

Xen Para 448,79

OpenVZ 452,53

MS Hyper-V 485,76

VMware ESXi 472,00

0,00100,00200,00300,00400,00500,00600,00

Run-

tim

e in

sec

onds

Physical Linux 100,00%

Xen Para 99,60%

OpenVZ 100,43%

MS Hyper-V 107,81%

VWware ESXi 104,75%

0,00%20,00%40,00%60,00%80,00%

100,00%120,00%

Effe

ctiv

enes

s

Linpack – Absolute/Relative runtime

Page 10: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons of HPC Cloud Computing – 23.03.2011 – Nils gentschen Felde – TUM Garching 10

Applying the Test Suite

Xen

Hyper-V ESXi

Virtuozzo

CPU – effects of concurrency

single VM two VMs three VMs

VM3 466 481 728

VM2 481 739

VM1 728

0

500

1.000

1.500

2.000

2.500

Runt

ime

in s

econ

ds

single VM two VMs three VMs

VM3 486 487 938

VM2 472 470

VM1 939

0

500

1.000

1.500

2.000

2.500

Runt

ime

in s

econ

ds

single VM two VMs three VMs

VM3 213,21 331,18 519,7

VM2 333,2 518,74

VM1 524,77

0

500

1.000

1.500

2.000

2.500

Runt

ime

in s

econ

ds

single VM two VMs three VMs

VM3 214,2 293,55 451,96

VM2 295,59 452,52

VM1 455,63

0

500

1.000

1.500

2.000

2.500

Runt

ime

in s

econ

ds

Page 11: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons of HPC Cloud Computing – 23.03.2011 – Nils gentschen Felde – TUM Garching 11

Xen – receive data from network Xen – send data to network

Virtuozzo – receive data from network Virtuozzo – send data to network

Network – effects of concurrency

single VM two VMs three VMs

VM 3 17,17

VM 2 25,89 17,03

VM 1 49,26 25,41 16,93

010203040506070

Thro

ughp

ut in

MBy

te/s

single VM two VMs three VMs

VM 3 20,85

VM 2 30,85 20,46

VM 1 60,00 30,40 20,27

010203040506070

Thro

ughp

ut in

MBy

te/s

single VM two VMs three VMs

VM 3 17,511341

VM 2 24,436174 17,528381

VM 1 51,876715 25,291585 17,512457

010203040506070

Thro

ughp

ut in

MBy

te/s

single VM two VMs three VMs

VM 3 3,769137

VM 2 5,652343 3,767896

VM 1 60,712871 5,651204 3,765671

010203040506070

Thro

ughp

ut in

MBy

te/s

Applying the Test Suite

Page 12: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons of HPC Cloud Computing – 23.03.2011 – Nils gentschen Felde – TUM Garching 12

Hyper-V – receive data from network Hyper-V– send data to network

ESXi– receive data from network ESXi– send data to network

single VM two VMs three VMs

VM 3 17,61

VM 2 25,94 17,61

VM 1 51,79 25,94 17,61

010203040506070

Thro

ughp

ut in

MBy

te/s

single VM two VMs three VMs

VM 3 3,77

VM 2 31,97 3,77

VM 1 63,98 32,84 3,77

010203040506070

Thro

ughp

ut in

MBy

te/s

single VM two VMs three VMs

VM 3 15,36

VM 2 30,55 15,34

VM 1 46,02 30,24 15,31

0

10

20

30

40

50

60

70

Thro

ughp

ut in

MBy

te/s

single VM two VMs three VMs

VM 3 21,44

VM 2 22,87 21,31

VM 1 62,26 22,87 21,33

0

10

20

30

40

50

60

70

Thro

ughp

ut in

MBy

te/s

Network – effects of concurrency

Applying the Test Suite

Page 13: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons of HPC Cloud Computing – 23.03.2011 – Nils gentschen Felde – TUM Garching 13

Xen

Hyper-V ESXi

Virtuozzo

Network – Physical vs. virtual communication peers

Send Receive

VM to PM 51,79 63,98

VM to VM 108,29 112,23

0,00

50,00

100,00

150,00

200,00

250,00

Thro

ughp

ut in

MBy

te/s

Send Receive

VM to PM 46,02 62,26

VM to VM 129,31 129,01

0,00

50,00

100,00

150,00

200,00

250,00

Thro

ughp

ut in

MBy

te/s

Send Receive

VM to PM 49,26 60,00

VM to VM 233,85 210,78

0,00

50,00

100,00

150,00

200,00

250,00

Thro

ughp

ut in

MBy

te/s

Send Receive

VM to PM 51,88 60,71

VM to VM 12,23 9,26

0,00

50,00

100,00

150,00

200,00

250,00

Thro

ughp

ut in

MBy

te/s

Applying the Test Suite

Page 14: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons of HPC Cloud Computing – 23.03.2011 – Nils gentschen Felde – TUM Garching 14

Performance Requirements

Criterion High-Performance High-Throughput

Coupling tight loose CPU impact co-determinant critical Interconnect impact critical less Main memory addressing

sensitive; prefers contiguous, uniform latency addressing

less sensitive

Program structures inter-communicating program replicas

workflows; pipelining of computing tasks

Examples fluid dynamics problems, crash codes

high-energy physics (e.g. CERN LHC experiments), generalparameter variation studies

Page 15: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons of HPC Cloud Computing – 23.03.2011 – Nils gentschen Felde – TUM Garching 15

Conclusion•CPU virtualization is efficient (close to 100%)•RAM access is fast

High Throughput Computing (HTC) Tasks can bevirtualized efficiently

•Network Access efficiency depends on-flow direction-concurrent use-Available CPU power

•Topology changes (e.g. live migration) “confuses” MPI High Performance Computing (HPC) Tasks should not be virtualized yet

Page 16: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons of HPC Cloud Computing – 23.03.2011 – Nils gentschen Felde – TUM Garching

Outlook•Technical realization of virtual High Throughput Computing•More research for virtualized High Performance Computing needed:

-Adapt MPI to detect topology changes caused by live migrations

-Calculate distributions of VMs with minimal communication overhead

-Allocate physical NICs to dedicated VM eventually-Use additional I/O virtualization approaches

16

Page 17: Pros and Cons of HPC Cloud Computing€¦ · • Fault tolerance without „Checkpoint and restart“ • Better security due to sandboxing of processes • … Pros and Cons of HPC

Pros and Cons of HPC Cloud Computing – 23.03.2011 – Nils gentschen Felde – TUM Garching 17

Questions?