13

Microbenchmarking the GT200 GPU

Micro Benchmark

Download PDF Report

Upload
malliwi88
View
2
Download
1

Embed Size (px)

DESCRIPTION

cuda bench

Citation preview

Microbenchmarking the GT200 GPU
Objective

Measure low-level performance

Speculate about the microarchitecture of the GT200 GPU
Objective

Pipeline

Several pipelines

Pipeline latency and throughput

Memory hierarchy

Caches

Off-chip DRAM latency and throughput
Pipeline Benchmarks

Use clock() for timing

Two instructions: move and left shift

Unrolled sequence of arithmetic ops

One thread for latency measurement

512 threads for throughput

uint start_time = clock();repeat256(a = a + b;)uint stop_time = clock();

[Save timers to global memory]
Pipeline Benchmark Caveats

Unrolled loop must fit in 8 KB icache

Variable instruction length (32- and 64-bit)

Instruction cache cold misses

Measure only second iteration of loop

PTX instructions to real instructions mapping not one-to-one

Optimization by nvopencc and ptxas

Need to circumvent optimizations by clever coding
Pipeline Functional Units

SP

SP

SP

SP

SFU

SP

SP

SP

SP

SFU

Instruction Fetch/Dispatch

Instruction L1 Data L1Streaming Multiprocessor

Shared Memory

DPU

SP

SP

SP

SP

SFU

SP

SP

SP

SP

SFU

Instruction Fetch/Dispatch

Instruction L1 Data L1Streaming Multiprocessor

Shared Memory

DPU

Functional unit

Latency (clocks)

Throughput(ops/clock)

SP add, sub,(int, float)

24 8

SFU rsqrt, log2f,(float)

28 2

DPU add, sub,(double)

48 1
SP Pipeline
32-Bit Multiplication

r1 = r1 * r2 r5 = r2.lo * r1.hi (mul24) r5 = (r2.h1 * r1.lo) + r5 (mad24) r5 = r5
Instruction Cache

Sequence of independent 64-bit SP instructions (cvt)

Loop over of varying amount of code

Cache misses increase measured runtime

L2 icache is supposed to be shared with L2 constant cache
Instruction Cache

8KB L1

32KB L2
Register File Allocation
Register File Allocation

Register file has 64 banks

Can not utilize entire register file if not a multiple of 64 threads

No bank conflicts, as alluded to in manual

8 banks per SP allows single-ported memories to be used for register files

bandwidth for ALU

bandwidth for memory fetch
Global Memory

Read Latency 435 - 450 cycles

Chain of dependent reads repeat1024 (j=array[j]) where array[j]=j+stride

TitleSlide 2Slide 3Slide 4Slide 5Slide 6Slide 732-Bit MultiplicationSlide 9Slide 10Slide 11Slide 12Global Memory

Benchmark Connector Corporation WELCOME TO BENCHMARK CONNECTOR CORPORATION WELCOME TO BENCHMARK CONNECTOR CORPORATION Capabilities Presentation

Benchmark Connector Corporation WELCOME TO BENCHMARK CONNECTOR CORPORATION WELCOME TO BENCHMARK CONNECTOR CORPORATION Capabilities Presentation

Documents

Benchmark of Micro Reactor Applications in Organic Synthesis · Benchmark of Micro Reactor Applications in Organic Synthesis Paul Watts Organische Synthesen in Mikrostrukturreaktoren

Benchmark of Micro Reactor Applications in Organic Synthesis · Benchmark of Micro Reactor Applications in Organic Synthesis Paul Watts Organische Synthesen in Mikrostrukturreaktoren

Documents

æ ã...2010 Benchmark 2011 Benchmark 2012 Benchmark 2013 Benchmark 2014 Benchmark Repeat Reporters Repeat Reporters Repeat Reporters Repeat Reporters Repeat Reporters Repeat 24% Repeat

æ ã...2010 Benchmark 2011 Benchmark 2012 Benchmark 2013 Benchmark 2014 Benchmark Repeat Reporters Repeat Reporters Repeat Reporters Repeat Reporters Repeat Reporters Repeat 24% Repeat

Documents

A Micro-benchmark Suite for AMD GPUs

A Micro-benchmark Suite for AMD GPUs

Documents

Benchmark Minerals World Tour –Perth Stop Benchmark ......Sep 17, 2018 · Benchmark Minerals World Tour –Perth Stop. Benchmark Minerals Intelligence. 17 September, 2018. Perth

Benchmark Minerals World Tour –Perth Stop Benchmark ......Sep 17, 2018 · Benchmark Minerals World Tour –Perth Stop. Benchmark Minerals Intelligence. 17 September, 2018. Perth

Documents

5 th Grade Science Physical Science Benchmark A 8; Benchmark B 9, 16; Benchmark C 10, 24; Benchmark D 11; Benchmark E 22; Benchmark F 1,18891610 241122118

5 th Grade Science Physical Science Benchmark A 8; Benchmark B 9, 16; Benchmark C 10, 24; Benchmark D 11; Benchmark E 22; Benchmark F 1,18891610 241122118

Documents

Benchmark Report 2019 1 IACCM Benchmark Report 2019

Benchmark Report 2019 1 IACCM Benchmark Report 2019

Documents

AgieCharmilles CUT 1000 CUT 1000 OilTech - Wire …...This wire cut machine is a benchmark in micro erosion. The benchmark for ultra accurate results in micro wire EDM applications

AgieCharmilles CUT 1000 CUT 1000 OilTech - Wire …...This wire cut machine is a benchmark in micro erosion. The benchmark for ultra accurate results in micro wire EDM applications

Documents

KERALA STATE BENCHMARK PARAMETERS BENCHMARK PARAMETERSOF

KERALA STATE BENCHMARK PARAMETERS BENCHMARK PARAMETERSOF

Documents

Science Benchmark Code Benchmarkdq8tyqszjypss.cloudfront.net/wp-content/uploads/2017/10/Amplify... · Benchmark Code Benchmark LESSONS WHERE STANDARD/BENCHMARK IS DIRECTLY ... an

Science Benchmark Code Benchmarkdq8tyqszjypss.cloudfront.net/wp-content/uploads/2017/10/Amplify... · Benchmark Code Benchmark LESSONS WHERE STANDARD/BENCHMARK IS DIRECTLY ... an

Documents

A Micro-benchmark Suite for AMD GPUs Ryan Taylor Xiaoming Li

A Micro-benchmark Suite for AMD GPUs Ryan Taylor Xiaoming Li

Documents

Analysis of Benchmark Characteristics and Benchmark ......Analysis of Benchmark Characteristics and Benchmark Performance Prediction† Rafael H. Saavedra ‡ Alan Jay Smith ‡‡

Analysis of Benchmark Characteristics and Benchmark ......Analysis of Benchmark Characteristics and Benchmark Performance Prediction† Rafael H. Saavedra ‡ Alan Jay Smith ‡‡

Documents

OGT Social Studies March 2008 Citizenship Rights and Responsibilities: Benchmark A 11, 31, 42; Benchmark B 41131424 Economics: Benchmark A 3, 13; Benchmark

OGT Social Studies March 2008 Citizenship Rights and Responsibilities: Benchmark A 11, 31, 42; Benchmark B 41131424 Economics: Benchmark A 3, 13; Benchmark

Documents

Ohio Graduation Test 2009 – Math Data Analysis and Probability : Benchmark A 31, 38; Benchmark B 1; Benchmark D 16; Benchmark E 22; Benchmark F 35; Benchmark

Ohio Graduation Test 2009 – Math Data Analysis and Probability : Benchmark A 31, 38; Benchmark B 1; Benchmark D 16; Benchmark E 22; Benchmark F 35; Benchmark

Documents

Ohio Graduation Test 2008 – Math Data Analysis and Probability : Benchmark A 18; Benchmark B 39; Benchmark D 15; Benchmark F 6; Benchmark G 3, 22; Benchmark

Ohio Graduation Test 2008 – Math Data Analysis and Probability : Benchmark A 18; Benchmark B 39; Benchmark D 15; Benchmark F 6; Benchmark G 3, 22; Benchmark

Documents

Tsunami benchmark cases: benchmark # 3

Tsunami benchmark cases: benchmark # 3

Documents

benchmark corporate profile - Benchmark Group of … benchmark group The Benchmark Group of Companies is ... The highly successful Benchmark Business ... benchmark_corporate_profile.indd

Documents

Benchmark - Imagine It Login · PDF file100-Point Skills Battery Benchmark 1 Benchmark 2 Benchmark 3 Benchmark 4 ... Therefore, it is important to administer each Benchmark Assessment

Benchmark - Imagine It Login · PDF file100-Point Skills Battery Benchmark 1 Benchmark 2 Benchmark 3 Benchmark 4 ... Therefore, it is important to administer each Benchmark Assessment

Documents

Profitability Benchmark Playbook › ~ › media › ...Profitability Benchmark Playbook Profitability Benchmark Playbook Profitability Benchmark Playbook Page 01 Partners are transforming

Profitability Benchmark Playbook › ~ › media › ...Profitability Benchmark Playbook Profitability Benchmark Playbook Profitability Benchmark Playbook Page 01 Partners are transforming

Documents

Options-Based Benchmark Indexes: Performance, Risk and Premium Capture (June 1986-Dec ... › micro › buywrite › wilshire-options-based... · 2019-03-23 · strategies had superior

Options-Based Benchmark Indexes: Performance, Risk and Premium Capture (June 1986-Dec ... › micro › buywrite › wilshire-options-based... · 2019-03-23 · strategies had superior

Documents

Welcome to Benchmark GroupWelcome to Benchmark · PDF fileWelcome to Benchmark GroupWelcome to Benchmark Group ... services in marketing and social communications planning, ... media

Welcome to Benchmark GroupWelcome to Benchmark · PDF fileWelcome to Benchmark GroupWelcome to Benchmark Group ... services in marketing and social communications planning, ... media

Documents

Interactive Gibson Benchmark: A Benchmark for Interactive

Interactive Gibson Benchmark: A Benchmark for Interactive

Documents

Application and Micro-benchmark ... - Ohio State Universitymug.mvapich.cse.ohio-state.edu › static › media › mug › presentation… · SAN DIEGO SUPERCOMPUTER CENTER at the

Application and Micro-benchmark ... - Ohio State Universitymug.mvapich.cse.ohio-state.edu › static › media › mug › presentation… · SAN DIEGO SUPERCOMPUTER CENTER at the

Documents

Benchmark and Benchmark Platinum Boilers

Benchmark and Benchmark Platinum Boilers

Documents

Macro Needs Micro - UW Faculty Web Servermacro needs more micro than the benchmark setup has been incorporating so far. Speci–cally, ... teaching, research, and policy advice.

Macro Needs Micro - UW Faculty Web Servermacro needs more micro than the benchmark setup has been incorporating so far. Speci–cally, ... teaching, research, and policy advice.

Documents

A Micro-benchmark Suite for Evaluating HDFS Operations on

A Micro-benchmark Suite for Evaluating HDFS Operations on

Documents

The Michigan Benchmark: A Micro-Benchmark for …pages.cs.wisc.edu/~jignesh/publ/chapter.pdfThe Michigan Benchmark: A Micro-Benchmark for XML Query Performance Diagnostics Jignesh

The Michigan Benchmark: A Micro-Benchmark for …pages.cs.wisc.edu/~jignesh/publ/chapter.pdfThe Michigan Benchmark: A Micro-Benchmark for XML Query Performance Diagnostics Jignesh

Documents

Workloads Experimental environment prototype real sys exec- driven sim trace- driven sim stochastic sim Live workload Benchmark applications Micro- benchmark

Workloads Experimental environment prototype real sys exec- driven sim trace- driven sim stochastic sim Live workload Benchmark applications Micro- benchmark

Documents

Welcome to Benchmark GroupWelcome to Benchmark Group

Welcome to Benchmark GroupWelcome to Benchmark Group

Documents

4th Grade Math Data Analysis and Probability : Benchmark B 7; Benchmark G 4; Benchmark F 12, 26741226 Geometry and Spatial Sense: Benchmark B 1;1 Benchmark

4th Grade Math Data Analysis and Probability : Benchmark B 7; Benchmark G 4; Benchmark F 12, 26741226 Geometry and Spatial Sense: Benchmark B 1;1 Benchmark

Documents