14

1 1 Evaluating the Performance of Integer Sum Reduction in SYCL on GPUs ZHEMING JIN ACKNOWLEDGMENT * Experimental Computing Laboratory at Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725. * Intel DevCloud

Evaluating the Performance of Integer Sum Reduction in

Download PDF Report

Upload
others
View
3
Download
0

Embed Size (px)

Citation preview

Page 1: Evaluating the Performance of Integer Sum Reduction in

11

Evaluating the Performance of Integer Sum Reduction in SYCL on GPUs

ZHEMING JIN

ACKNOWLEDGMENT

* Experimental Computing Laboratory at Oak Ridge National Laboratory,

which is supported by the Office of Science of the U.S. Department of

Energy under Contract No. DE-AC05-00OR22725.

* Intel DevCloud

Page 2: Evaluating the Performance of Integer Sum Reduction in

22

Overview

• Motivation

• Background

• Implementation

• Experiment

• Conclusion

Page 3: Evaluating the Performance of Integer Sum Reduction in

33

Motivation

• SYCL is a promising programming model

Page 4: Evaluating the Performance of Integer Sum Reduction in

44

Background - SYCL

Page 5: Evaluating the Performance of Integer Sum Reduction in

55

Background – SYCL with CUDA support

Page 6: Evaluating the Performance of Integer Sum Reduction in

66

Background – Integer sum reduction

Page 7: Evaluating the Performance of Integer Sum Reduction in

77

SYCL Implementation 1 – Integer sum reduction

Page 8: Evaluating the Performance of Integer Sum Reduction in

88

SYCL Implementation 2 – Integer sum reduction

Page 9: Evaluating the Performance of Integer Sum Reduction in

99

SYCL Implementation 3 – Integer sum reduction

Page 10: Evaluating the Performance of Integer Sum Reduction in

1010

Experimental Setup

• Intel P630, Nvidia P100 and V100 GPUs

• Number of integers is 1048576000, approximately 4 GB in memory size

• Timing (milliseconds) measured with the Intel OpenCL intercept layer and Nvidia performance profiler

• The average execution time of 100 invocations of a kernel for the reduction performance

Page 11: Evaluating the Performance of Integer Sum Reduction in

1111

Experimental Results

Minimum time : 110.9 ms(P630)

Minimum time : 9.197 ms(P100)

Minimum time : 4.671 ms(V100)

Page 12: Evaluating the Performance of Integer Sum Reduction in

1212

Comparison with Others

• The fastest SYCL implementations are approximately 3% and 0.3% faster than the templated reduction in Thrust and the device reduction in CUB, respectively

• The fastest SYCL implementations are approximately 1.9% and 0.4% faster than the templated reduction in Thrust and the device reduction in CUB, respectively

[1] https://github.com/intel/llvm/tree/sycl/sycl/doc/extensions/Reduction[2] Bell, N. and Hoberock, J., 2012. Thrust: A productivity-oriented library for CUDA. In GPU computing gems Jade edition

(pp. 359-371). Morgan Kaufmann.[3] Merrill, D., 2015. CUB: A pattern of “collective” software design, abstraction, and reuse for kernel-level programming.

Nvidia Research

Kernel time (ms) P100 V100

Reducer [1] 40.4 11.62

Thrust [2] 9.487 4.76

CUB [3] 9.224 4.69

https://github.com/intel/llvm/tree/sycl/sycl/doc/extensions/Reduction

Page 13: Evaluating the Performance of Integer Sum Reduction in

1313

Conclusion and Future Work

• Tuning workgroup sizes, vector widths, and workload sizes are important for performance improvement

• Thrust and CUB are mature libraries for parallel reduction on an Nvidia GPU

• There is a large optimization space for the SYCL reducer class

• Investigate the performance of SYCL applications that contain reduction kernels

Page 14: Evaluating the Performance of Integer Sum Reduction in

1414

Thanks to SYCL developers

Integer rules - Oxford Academy High School 2 student materials for BOCES.pdfInteger rules Addition Different signsSame signs ... An integer plus its opposite have a sum of zero. The

Integer rules - Oxford Academy High School 2 student materials for BOCES.pdfInteger rules Addition Different signsSame signs ... An integer plus its opposite have a sum of zero. The

Documents

XML Programming Guide - Limasfeed.limas.com/docs/LMFeedGuide.pdftotalvolume integer Total Volume totalfreq integer Total Frequency totallot integer Total Lot memcap integer Member

XML Programming Guide - Limasfeed.limas.com/docs/LMFeedGuide.pdftotalvolume integer Total Volume totalfreq integer Total Frequency totallot integer Total Lot memcap integer Member

Documents

AMPL Reference Manual · 454. AMPL REFERENCE MANUAL APPENDIX A. Current complements integer solve_result_num IN contains less suffix INOUT default logical sum …

AMPL Reference Manual · 454. AMPL REFERENCE MANUAL APPENDIX A. Current complements integer solve_result_num IN contains less suffix INOUT default logical sum …

Documents

BAB 2 PROGRAM INTEGER 2.1 Program Linearainisuri.staff.gunadarma.ac.id/Downloads/files/46375/Pemograman... · Program linear merupakan metode matematika untuk mengalokasikan sum-

BAB 2 PROGRAM INTEGER 2.1 Program Linearainisuri.staff.gunadarma.ac.id/Downloads/files/46375/Pemograman... · Program linear merupakan metode matematika untuk mengalokasikan sum-

Documents

Evaluating the impact of AND/OR search on 0-1 integer

Evaluating the impact of AND/OR search on 0-1 integer

Documents

These computer-administered, modular tests are intended to ... · • greater or less than a negative integer ... difference of two squares, greatest common ... numbers (sum, difference,

These computer-administered, modular tests are intended to ... · • greater or less than a negative integer ... difference of two squares, greatest common ... numbers (sum, difference,

Documents

Evaluating Lump Sum Incentives for Delayed Social Security ......Evaluating Lump Sum Incentives for Delayed Social Security Claiming* Olivia S. Mtichell and Raimond Maurer Abstract

Evaluating Lump Sum Incentives for Delayed Social Security ......Evaluating Lump Sum Incentives for Delayed Social Security Claiming* Olivia S. Mtichell and Raimond Maurer Abstract

Documents

syllabus 20094. Write C program for Pascal triangle. 5. Write C program to find sum and average of n integer using linear array. 6. Write C program to find sum of the digits a given

syllabus 20094. Write C program for Pascal triangle. 5. Write C program to find sum and average of n integer using linear array. 6. Write C program to find sum of the digits a given

Documents

POSITIVE POLYNOMIALSweb.math.ucsb.edu/~mputinar/POSAMS.pdf · 4 1. Minkowski and Hilbert Every positive integer is the sum of four squares LAGRANGE Academie des Sciences de Paris,

POSITIVE POLYNOMIALSweb.math.ucsb.edu/~mputinar/POSAMS.pdf · 4 1. Minkowski and Hilbert Every positive integer is the sum of four squares LAGRANGE Academie des Sciences de Paris,

Documents

The Gap Function: Evaluating Integer Programming Models ... · The Gap Function: Evaluating Integer Programming Models over Multiple Right-hand Sides Temitayo Ajayi 1, Christopher

The Gap Function: Evaluating Integer Programming Models ... · The Gap Function: Evaluating Integer Programming Models over Multiple Right-hand Sides Temitayo Ajayi 1, Christopher

Documents

Evaluating Algebraic Expressions 4-2 Integer Exponents Warm Up Warm Up California Standards California Standards Lesson Presentation Lesson PresentationPreview

Evaluating Algebraic Expressions 4-2 Integer Exponents Warm Up Warm Up California Standards California Standards Lesson Presentation Lesson PresentationPreview

Documents

POLYNOMIALS - WordPress.com integer. •A polynomial is an algebraic expression that consists of a sum of several monomials. 3 Standard form of a polynomial

POLYNOMIALS - WordPress.com integer. •A polynomial is an algebraic expression that consists of a sum of several monomials. 3 Standard form of a polynomial

Documents

Mixed-Integer Nonlinear Programming · Basic notions ofmixed-integer linear programming: mixed-integer linear program, integer variables, continuous relaxation MILPbranch-and-bound:

Mixed-Integer Nonlinear Programming · Basic notions ofmixed-integer linear programming: mixed-integer linear program, integer variables, continuous relaxation MILPbranch-and-bound:

Documents

up a number as a sum of Fibonacci numbers Fact: Every positive integer can be expressed uniquely as a sum of one or more distinct Fibonacci numbers no two of which are consecutive

up a number as a sum of Fibonacci numbers Fact: Every positive integer can be expressed uniquely as a sum of one or more distinct Fibonacci numbers no two of which are consecutive

Documents

adding inTEgErs - Core Focus on Math€¦ · step 8: Write an integer addition expression. Model it with integer chips to find the sum. step 9: Use integer chips to help you determine

adding inTEgErs - Core Focus on Math€¦ · step 8: Write an integer addition expression. Model it with integer chips to find the sum. step 9: Use integer chips to help you determine

Documents

· Solution: Since sum of two integers is zero, one integer is the additive inverse of other integer, like – 3, 3; – 4, 4 etc. But the difference has to be 10.File Size: 299KBPage

· Solution: Since sum of two integers is zero, one integer is the additive inverse of other integer, like – 3, 3; – 4, 4 etc. But the difference has to be 10.File Size: 299KBPage

Documents

BAB III PEMBAHASAN · 1. Kode Nilai Kd_nilai Integer 6 2. Kode Siswa Kd_siswa Integer 20 Foreign key 3. Attandance Atta Integer 5 4. Grammar Gram Integer 5 5. Listening List Integer

BAB III PEMBAHASAN · 1. Kode Nilai Kd_nilai Integer 6 2. Kode Siswa Kd_siswa Integer 20 Foreign key 3. Attandance Atta Integer 5 4. Grammar Gram Integer 5 5. Listening List Integer

Documents

Reversible Integer-to-Integer Wavelet Transforms for …frodo/publications/phdthesis.pdf · Reversible integer-to-integer ... 2.3 Multirate Filter Banks and Wavelet Systems ... 3.5

Reversible Integer-to-Integer Wavelet Transforms for …frodo/publications/phdthesis.pdf · Reversible integer-to-integer ... 2.3 Multirate Filter Banks and Wavelet Systems ... 3.5

Documents

St#Aloysius#College## # Name: Grade#9#Mathematics ...9+-+CAT+-+Indice… · Apply index laws to numerical expressions with integer indices (ACMNA209) • simplifying and evaluating

St#Aloysius#College## # Name: Grade#9#Mathematics ...9+-+CAT+-+Indice… · Apply index laws to numerical expressions with integer indices (ACMNA209) • simplifying and evaluating

Documents

Informatica B -Corso Boracchi · > Enter first integer > 45 > Enter second integer > 72 > Sum is 117 /* Sum of two integers */ #include int main(){int

Informatica B -Corso Boracchi · > Enter first integer > 45 > Enter second integer > 72 > Sum is 117 /* Sum of two integers */ #include int main(){int

Documents

SOLVING MIXED INTEGER BILINEAR PROBLEMS · PDF fileSOLVING MIXED INTEGER BILINEAR PROBLEMS ... a mixed integer linear programming ... continuous and one integer variable. Although

SOLVING MIXED INTEGER BILINEAR PROBLEMS · PDF fileSOLVING MIXED INTEGER BILINEAR PROBLEMS ... a mixed integer linear programming ... continuous and one integer variable. Although

Documents

Chapter 5 Integer Programming · PDF file2 3 Integer Programming Models Types of Models Total Integer Model: All decision variables required to have integer solution values. 0-1 Integer

Chapter 5 Integer Programming · PDF file2 3 Integer Programming Models Types of Models Total Integer Model: All decision variables required to have integer solution values. 0-1 Integer

Documents

1 Chapter 4: Integer and Mixed-Integer Linear Programming Problems 4.1 Introduction to Integer and Mixed-Integer Linear Programming 4.2 Solving Integer

1 Chapter 4: Integer and Mixed-Integer Linear Programming Problems 4.1 Introduction to Integer and Mixed-Integer Linear Programming 4.2 Solving Integer

Documents

7. Integer linear optimization 7.1 Introductionpage.math.tu-berlin.de/~moehring/adm2/Chapter/chap7.pdf7. Integer linear optimization 7.1 Introduction 42-1 Integer linear programs (Integer

7. Integer linear optimization 7.1 Introductionpage.math.tu-berlin.de/~moehring/adm2/Chapter/chap7.pdf7. Integer linear optimization 7.1 Introduction 42-1 Integer linear programs (Integer

Documents

Aim: Write a C Program to find the sum of individual digits … No: Aim: Write a C Program to find the sum of individual digits of positive integer. Program: #include

Aim: Write a C Program to find the sum of individual digits … No: Aim: Write a C Program to find the sum of individual digits of positive integer. Program: #include

Documents

INTEGER - Weebly · INTEGER •Integer termasuk nombor bulat positif, nombor bulat negatif, dan sifar. •Integer = {… -5, -4, -3, - 2, -, , , , , , , …} CABARAN MINDA •Apakah

INTEGER - Weebly · INTEGER •Integer termasuk nombor bulat positif, nombor bulat negatif, dan sifar. •Integer = {… -5, -4, -3, - 2, -, , , , , , , …} CABARAN MINDA •Apakah

Documents

MATHEMATICS · positive integer, Binomial series, approximation, ... sum, difference product, quotient of two functions, constant function, linear function, polynomial

MATHEMATICS · positive integer, Binomial series, approximation, ... sum, difference product, quotient of two functions, constant function, linear function, polynomial

Documents

Practice and Evaluation. Practice Develop a java class called: SumCalculator.java which computes a sum of all integer from 1 to 100 and displays the result

Practice and Evaluation. Practice Develop a java class called: SumCalculator.java which computes a sum of all integer from 1 to 100 and displays the result

Documents

Integer code Integer code name String spelling

Integer code Integer code name String spelling

Documents

Approximating a tensor as a sum of rank-one components · Given a tensor Aand an integer k find krank-one tensors such that their sum is as “close” to A as possible. Negative

Approximating a tensor as a sum of rank-one components · Given a tensor Aand an integer k find krank-one tensors such that their sum is as “close” to A as possible. Negative

Documents