88
Introduction to HPC Workshop October 9 2014

Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Embed Size (px)

Citation preview

Page 1: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Introduction to HPC Workshop

October 9 2014

Page 2: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Introduction

Rob Lane

HPC Support

Research Computing Services

CUIT

Page 3: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Introduction

HPC Basics

Page 4: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Introduction

First HPC Workshop

Page 5: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Yeti

• 2 head nodes

• 101 execute nodes

• 200 TB storage

Page 6: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Yeti

• 101 execute nodes–38 x 64 GB–8 x 128 GB–35 x 256 GB–16 x 64 GB + Infiniband–4 x 64 GB + nVidia K20 GPU

Page 7: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Yeti

• CPU– Intel E5-2650L–1.8 GHz–8 Cores–2 per Execute Node

Page 8: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Yeti

• Expansion Round–66 new systems–Faster CPU–More Infiniband–More GPU (nVidia K40)–ETA January 2015

Page 9: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Yeti

Page 10: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

HP S6500 Chassis

Page 11: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

HP SL230 Server

Page 12: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Job Scheduler

• Manages the cluster

• Decides when a job will run

• Decides where a job will run

• We use Torque/Moab

Page 13: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Job Queues

• Jobs are submitted to a queue

• Jobs sorted in priority order

• Not a FIFO

Page 14: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Access

Mac Instructions

1. Run terminal

Page 15: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Access

Windows Instructions

1. Search for putty on Columbia home page

2. Select first result

3. Follow link to Putty download page

4. Download putty.exe

5. Run putty.exe

Page 16: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Access

Mac (Terminal)

$ ssh [email protected]

Windows (Putty)

Host Name: yetisubmit.cc.columbia.edu

Page 17: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Work Directory

$ cd /vega/free/users/your UNI

• Replace “your UNI” with your UNI

$ cd /vega/free/users/hpc2108

Page 18: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Copy Workshop Files

• Files are in /tmp/workshop

$ cp /tmp/workshop/* .

Page 19: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Editing

No single obvious choice for editor

• vi – simple but difficult at first• emacs – powerful but complex• nano – simple but not really standard

Page 20: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

nano

$ nano hellosubmit

“^” means “hold down control”

^a : go to beginning of line

^e : go to end of line

^k: delete line

^o: save file

^x: exit

Page 21: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

#!/bin/sh

# Directives

#PBS -N HelloWorld#PBS -W group_list=yetifree#PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb#PBS -M [email protected]#PBS -m abe#PBS -V

# Set output and error directories

#PBS -o localhost:/vega/free/users/UNI#PBS -e localhost:/vega/free/users/UNI

# Print "Hello World"

echo "Hello World"

# Sleep for 10 seconds

sleep 10

# Print date and time

date

Page 22: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

#!/bin/sh

# Directives

#PBS -N HelloWorld#PBS -W group_list=yetifree#PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb#PBS -M [email protected]#PBS -m abe#PBS -V

# Set output and error directories

#PBS -o localhost:/vega/free/users/UNI#PBS -e localhost:/vega/free/users/UNI

# Print "Hello World"

echo "Hello World"

# Sleep for 10 seconds

sleep 10

# Print date and time

date

Page 23: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

#!/bin/sh

# Directives

#PBS -N HelloWorld#PBS -W group_list=yetifree#PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb#PBS -M [email protected]#PBS -m abe#PBS -V

Page 24: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

#!/bin/sh

# Directives

#PBS -N HelloWorld#PBS -W group_list=yetifree#PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb#PBS -M [email protected]#PBS -m abe#PBS -V

Page 25: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

#!/bin/sh

# Directives

#PBS -N HelloWorld#PBS -W group_list=yetifree#PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb#PBS -M [email protected]#PBS -m abe#PBS -V

Page 26: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

#!/bin/sh

# Directives

#PBS -N HelloWorld#PBS -W group_list=yetifree#PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb#PBS -M [email protected]#PBS -m abe#PBS -V

Page 27: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

#!/bin/sh

# Directives

#PBS -N HelloWorld#PBS -W group_list=yetifree#PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb#PBS -M [email protected]#PBS -m abe#PBS -V

Page 28: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

#!/bin/sh

# Directives

#PBS -N HelloWorld#PBS -W group_list=yetifree#PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb#PBS -M [email protected]#PBS -m abe#PBS -V

Page 29: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

#!/bin/sh

# Directives

#PBS -N HelloWorld#PBS -W group_list=yetifree#PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb#PBS -M [email protected]#PBS -m abe#PBS -V

Page 30: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

#!/bin/sh

# Directives

#PBS -N HelloWorld#PBS -W group_list=yetifree#PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb#PBS -M [email protected]#PBS -m abe#PBS -V

Page 31: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

#!/bin/sh

# Directives

#PBS -N HelloWorld#PBS -W group_list=yetifree#PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb#PBS -M [email protected]#PBS -m abe#PBS -V

Page 32: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

#!/bin/sh

# Directives

#PBS -N HelloWorld#PBS -W group_list=yetifree#PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb#PBS -M [email protected]#PBS -m abe#PBS -V

Page 33: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

#!/bin/sh

# Directives

#PBS -N HelloWorld#PBS -W group_list=yetifree#PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb#PBS -M [email protected]#PBS -m n#PBS -V

Page 34: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

#!/bin/sh

# Directives

#PBS -N HelloWorld#PBS -W group_list=yetifree#PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb#PBS -M [email protected]#PBS -m n#PBS -V

Page 35: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

# Set output and error directories

#PBS -o localhost:/vega/free/users/UNI#PBS -e localhost:/vega/free/users/UNI

Page 36: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

# Set output and error directories

#PBS -o localhost:/vega/free/users/UNI#PBS -e localhost:/vega/free/users/UNI

Page 37: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

# Print "Hello World"

echo "Hello World"

# Sleep for 10 seconds

sleep 10

# Print date and time

date

Page 38: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

$ qsub hellosubmit

Page 39: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

$ qsub hellosubmit298151.elk.cc.columbia.edu$

Page 40: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

$ qsub hellosubmit298151.elk.cc.columbia.edu$

Page 41: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

qstat

$ qsub hellosubmit298151.elk.cc.columbia.edu$ qstat 298151Job ID Name User Time Use S Queue---------- ------------ ---------- -------- - -----298151.elk HelloWorld hpc2108 0 Q batch1

Page 42: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

$ qsub hellosubmit298151.elk.cc.columbia.edu$ qstat 298151Job ID Name User Time Use S Queue---------- ------------ ---------- -------- - -----298151.elk HelloWorld hpc2108 0 Q batch1

Page 43: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

$ qsub hellosubmit298151.elk.cc.columbia.edu$ qstat 298151Job ID Name User Time Use S Queue---------- ------------ ---------- -------- - -----298151.elk HelloWorld hpc2108 0 Q batch1

Page 44: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

$ qsub hellosubmit298151.elk.cc.columbia.edu$ qstat 298151Job ID Name User Time Use S Queue---------- ------------ ---------- -------- - -----298151.elk HelloWorld hpc2108 0 Q batch1

Page 45: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

$ qsub hellosubmit298151.elk.cc.columbia.edu$ qstat 298151Job ID Name User Time Use S Queue---------- ------------ ---------- -------- - -----298151.elk HelloWorld hpc2108 0 Q batch1

Page 46: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

$ qsub hellosubmit298151.elk.cc.columbia.edu$ qstat 298151Job ID Name User Time Use S Queue---------- ------------ ---------- -------- - -----298151.elk HelloWorld hpc2108 0 Q batch1

Page 47: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

$ qsub hellosubmit298151.elk.cc.columbia.edu$ qstat 298151Job ID Name User Time Use S Queue---------- ------------ ---------- -------- - -----298151.elk HelloWorld hpc2108 0 Q batch1$ qstat 298151qstat: Unknown Job Id Error 298151.elk.cc.columbia.edu

Page 48: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

$ ls -ltotal 4-rw------- 1 hpc2108 yetifree 398 Oct 8 22:13 hellosubmit-rw------- 1 hpc2108 yetifree 0 Oct 8 22:44 HelloWorld.e298151-rw------- 1 hpc2108 yetifree 41 Oct 8 22:44 HelloWorld.o298151

Page 49: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

$ ls -ltotal 4-rw------- 1 hpc2108 yetifree 398 Oct 8 22:13 hellosubmit-rw------- 1 hpc2108 yetifree 0 Oct 8 22:44 HelloWorld.e298151-rw------- 1 hpc2108 yetifree 41 Oct 8 22:44 HelloWorld.o298151

Page 50: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

$ ls -ltotal 4-rw------- 1 hpc2108 yetifree 398 Oct 8 22:13 hellosubmit-rw------- 1 hpc2108 yetifree 0 Oct 8 22:44 HelloWorld.e298151-rw------- 1 hpc2108 yetifree 41 Oct 8 22:44 HelloWorld.o298151

Page 51: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

$ ls -ltotal 4-rw------- 1 hpc2108 yetifree 398 Oct 8 22:13 hellosubmit-rw------- 1 hpc2108 yetifree 0 Oct 8 22:44 HelloWorld.e298151-rw------- 1 hpc2108 yetifree 41 Oct 8 22:44 HelloWorld.o298151

Page 52: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

$ ls -ltotal 4-rw------- 1 hpc2108 yetifree 398 Oct 8 22:13 hellosubmit-rw------- 1 hpc2108 yetifree 0 Oct 8 22:44 HelloWorld.e298151-rw------- 1 hpc2108 yetifree 41 Oct 8 22:44 HelloWorld.o298151

Page 53: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

$ cat HelloWorld.o298151Hello WorldThu Oct 9 12:44:05 EDT 2014

Page 54: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

hellosubmit

$ cat HelloWorld.o298151Hello WorldThu Oct 9 12:44:05 EDT 2014

Any Questions?

Page 55: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Interactive

• Most jobs run as “batch”• Can also run interactive jobs• Get a shell on an execute node• Useful for development, testing,

troubleshooting

Page 56: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Interactive

$ cat interactiveqsub -I -W group_list=yetifree -l walltime=5:00,mem=100mb

Page 57: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Interactive

$ cat interactiveqsub -I -W group_list=yetifree -l walltime=5:00,mem=100mb

Page 58: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Interactive

$ cat interactiveqsub -I -W group_list=yetifree -l walltime=5:00,mem=100mb

Page 59: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Interactive

$ cat interactiveqsub -I -W group_list=yetifree -l walltime=5:00,mem=100mb

Page 60: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Interactive

$ cat interactiveqsub -I -W group_list=yetifree -l walltime=5:00,mem=100mb

Page 61: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Interactive

$ cat interactiveqsub -I -W group_list=yetifree -l walltime=5:00,mem=100mb

Page 62: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Interactive

$ qsub -I -W group_list=yetifree -l walltime=5:00,mem=100mbqsub: waiting for job 298158.elk.cc.columbia.edu to start

Page 63: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Interactive

qsub: job 298158.elk.cc.columbia.edu ready

.--. ,-,-,--(/o o\)-,-,-,. ,' // oo \\ ', ,' /| __ |\ ', ,' //\,__,/\\ ', , /\ /\ , , /'`\ /' \ , | /' `\ /' '\ | | \ ( ) / | ( /\| /' '\ |/\ ) \| /' /'`\ '\ |/ | /' `\ | ( ( ) ) `\ \ /' /' `\ \ /' /' / / \ \ v v v v v v +--------------------------------+ | | | You are in an interactive job. | | | | Your walltime is 00:05:00 | | | +--------------------------------+

Page 64: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Interactive

$ hostnamecharleston.cc.columbia.edu

Page 65: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Interactive

$ exitlogout

qsub: job 298158.elk.cc.columbia.edu completed$

Page 66: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

GUI

• Can run GUI’s in interactive jobs

• Need X Server on your local system

• See user documentation for more information

Page 67: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

User Documentation

• hpc.cc.columbia.edu

• Go to “HPC Support”

• Click on Yeti user documentation

Page 68: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Job Queues

• Scheduler puts all jobs into a queue

• Queue selected automatically

• Queues have different settings

Page 69: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Queue Time Limit Memory Limit

Max. User Run

Batch 1 12 hours 4 GB 512

Batch 2 12 hours 16 GB 128

Batch 3 5 days 16 GB 64

Batch 4 3 days None 8

Interactive 4 hours None 4

Job Queues

Page 70: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

qstat -q

$ qstat -q

server: elk.cc.columbia.edu

Queue Memory CPU Time Walltime Node Run Que Lm State---------------- ------ -------- -------- ---- --- --- -- -----batch1 4gb -- 12:00:00 -- 42 15 -- E Rbatch2 16gb -- 12:00:00 -- 129 73 -- E Rbatch3 16gb -- 120:00:0 -- 148 261 -- E Rbatch4 -- -- 72:00:00 -- 11 12 -- E Rinteractive -- -- 04:00:00 -- 0 1 -- E Rinterlong -- -- 48:00:00 -- 0 0 -- E Rroute -- -- -- -- 0 0 -- E R ----- ----- 330 362

Page 71: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

yetifree

• Maximum processors limited–Currently 4 maximum

• Storage quota–16 GB

• No email support

Page 72: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

yetifree

$ quota -sDisk quotas for user hpc2108 (uid 242275): Filesystem blocks quota limit grace files quota limit gracehpc-cuit-storage-2.cc.columbia.edu:/free/ 122M 16384M 16384M 8 4295m 4295m

Page 73: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

yetifree

$ quota -sDisk quotas for user hpc2108 (uid 242275): Filesystem blocks quota limit grace files quota limit gracehpc-cuit-storage-2.cc.columbia.edu:/free/ 122M 16384M 16384M 8 4295m 4295m

Page 74: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

email

from: root <[email protected]>to: [email protected]: Wed, Oct 8, 2014 at 11:41 PMsubject: PBS JOB 298161.elk.cc.columbia.edu

PBS Job Id: 298161.elk.cc.columbia.eduJob Name: HelloWorldExec host: dublin.cc.columbia.edu/4Execution terminatedExit_status=0resources_used.cput=00:00:02resources_used.mem=8288kbresources_used.vmem=304780kbresources_used.walltime=00:02:02Error_Path: localhost:/vega/free/users/hpc2108/HelloWorld.e298161Output_Path: localhost:/vega/free/users/hpc2108/HelloWorld.o298161

Page 75: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

email

from: root <[email protected]>to: [email protected]: Wed, Oct 8, 2014 at 11:41 PMsubject: PBS JOB 298161.elk.cc.columbia.edu

PBS Job Id: 298161.elk.cc.columbia.eduJob Name: HelloWorldExec host: dublin.cc.columbia.edu/4Execution terminatedExit_status=0resources_used.cput=00:00:02resources_used.mem=8288kbresources_used.vmem=304780kbresources_used.walltime=00:02:02Error_Path: localhost:/vega/free/users/hpc2108/HelloWorld.e298161Output_Path: localhost:/vega/free/users/hpc2108/HelloWorld.o298161

Page 76: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

email

from: root <[email protected]>to: [email protected]: Wed, Oct 8, 2014 at 11:41 PMsubject: PBS JOB 298161.elk.cc.columbia.edu

PBS Job Id: 298161.elk.cc.columbia.eduJob Name: HelloWorldExec host: dublin.cc.columbia.edu/4Execution terminatedExit_status=0resources_used.cput=00:00:02resources_used.mem=8288kbresources_used.vmem=304780kbresources_used.walltime=00:02:02Error_Path: localhost:/vega/free/users/hpc2108/HelloWorld.e298161Output_Path: localhost:/vega/free/users/hpc2108/HelloWorld.o298161

Page 77: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

email

from: root <[email protected]>to: [email protected]: Wed, Oct 8, 2014 at 11:41 PMsubject: PBS JOB 298161.elk.cc.columbia.edu

PBS Job Id: 298161.elk.cc.columbia.eduJob Name: HelloWorldExec host: dublin.cc.columbia.edu/4Execution terminatedExit_status=0resources_used.cput=00:00:02resources_used.mem=8288kbresources_used.vmem=304780kbresources_used.walltime=00:02:02Error_Path: localhost:/vega/free/users/hpc2108/HelloWorld.e298161Output_Path: localhost:/vega/free/users/hpc2108/HelloWorld.o298161

Page 78: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

email

from: root <[email protected]>to: [email protected]: Wed, Oct 8, 2014 at 11:41 PMsubject: PBS JOB 298161.elk.cc.columbia.edu

PBS Job Id: 298161.elk.cc.columbia.eduJob Name: HelloWorldExec host: dublin.cc.columbia.edu/4Execution terminatedExit_status=0resources_used.cput=00:00:02resources_used.mem=8288kbresources_used.vmem=304780kbresources_used.walltime=00:02:02Error_Path: localhost:/vega/free/users/hpc2108/HelloWorld.e298161Output_Path: localhost:/vega/free/users/hpc2108/HelloWorld.o298161

Page 79: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

email

from: root <[email protected]>to: [email protected]: Wed, Oct 8, 2014 at 11:41 PMsubject: PBS JOB 298161.elk.cc.columbia.edu

PBS Job Id: 298161.elk.cc.columbia.eduJob Name: HelloWorldExec host: dublin.cc.columbia.edu/4Execution terminatedExit_status=0resources_used.cput=00:00:02resources_used.mem=8288kbresources_used.vmem=304780kbresources_used.walltime=00:02:02Error_Path: localhost:/vega/free/users/hpc2108/HelloWorld.e298161Output_Path: localhost:/vega/free/users/hpc2108/HelloWorld.o298161

Page 80: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

email

from: root <[email protected]>to: [email protected]: Wed, Oct 8, 2014 at 11:41 PMsubject: PBS JOB 298161.elk.cc.columbia.edu

PBS Job Id: 298161.elk.cc.columbia.eduJob Name: HelloWorldExec host: dublin.cc.columbia.edu/4Execution terminatedExit_status=0resources_used.cput=00:00:02resources_used.mem=8288kbresources_used.vmem=304780kbresources_used.walltime=00:02:02Error_Path: localhost:/vega/free/users/hpc2108/HelloWorld.e298161Output_Path: localhost:/vega/free/users/hpc2108/HelloWorld.o298161

Page 81: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

email

from: root <[email protected]>to: [email protected]: Wed, Oct 8, 2014 at 11:41 PMsubject: PBS JOB 298161.elk.cc.columbia.edu

PBS Job Id: 298161.elk.cc.columbia.eduJob Name: HelloWorldExec host: dublin.cc.columbia.edu/4Execution terminatedExit_status=0resources_used.cput=00:00:02resources_used.mem=8288kbresources_used.vmem=304780kbresources_used.walltime=00:02:02Error_Path: localhost:/vega/free/users/hpc2108/HelloWorld.e298161Output_Path: localhost:/vega/free/users/hpc2108/HelloWorld.o298161

Page 82: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Intern

• Research Computing Services (RCS) is looking for an intern

• Paid position• ~10 hours a week• Will be on LionShare next week

Page 83: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

MPI

• Message Passing Interface

• Allows applications to run across multiple computers

Page 84: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

MPI

• Edit MPI submit file

• Load MPI environment module

• Compile sample program

Page 85: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

MPI

#!/bin/sh

# Directives

#PBS -N MpiHello#PBS -W group_list=yetifree#PBS -l nodes=3:ppn=1,walltime=00:01:00,mem=20mb#PBS -M [email protected]#PBS -m abe#PBS -V

# Set output and error directories

#PBS -o localhost:/vega/free/users/UNI#PBS -e localhost:/vega/free/users/UNI

# Load mpi module.

module load openmpi

# Run mpi program.

mpirun mpihello

Page 86: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

MPI

$ module load openmpi$ which mpicc/usr/local/openmpi/bin/mpicc$ mpicc -o mpihello mpihello.c

Page 87: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

MPI

$ qsub mpisubmit298501.elk.cc.columbia.edu

Page 88: Introduction to HPC Workshop October 9 2014. Introduction Rob Lane HPC Support Research Computing Services CUIT

Questions?

Any questions?