Using CodeAnalyst on Red Hat Enterprise Linux to ... CodeAnalyst on Red Hat Enterprise Linux to...

Using CodeAnalyst on Red Hat Enterprise Linux to Understand Performance on AMD Servers

Name Sanjay Rao, D John ShakshoberDate May 10, 2007

AMD CodeAnalyst (CA) profiling on various user applications running RHEL5 Ga.

System Configurations● Tyan AMD 8cpu, 4socket, dual core, 1dual QLA2342 FiberChannel, 28 15k

RPM disks, on HP Enterprise Virtual Array 4000, dual path MPIO McCalpin Stream Benchmark

● Copy Bandwidth – 1 GB per stream, 1,2,4 and 8 streams● W/ and without NUMA ● Measure IPC and L2 cache, Bus traffic

Oracle OLTP workload ● Random 2k IO's (50% Read/50% Write), Sequential Write to logs, EXT3● Vary user count, tune SGA to saturate 8cpu, using EXT3 Direct and Async I/O ● Number of transactions / minute (tpm) ● Run with and without Large pages (HughTLBfs)● Measure IPC, Translation Buffer Misses

Memory

C0 C1 C0 C1

Process on S1C0

Interleaved Memory

Process on S1C0

S1 S2 S3 S4NonInterleaved (NUMA)

1 hop to any memory bank

Tyan AMD64 Numa Memory Layout

McCalpin Streams Copy Bandwidth (1,2,4,8)

1 2 4 80

NonNumaNuma%Difference

No. of Streams

IPC Comparison – McCalpin Streams

Data Access Comparison – McCalpin Streams

Instruction Comparison – McCalpin Streams

L2 Cache Comparison – McCalpin Streams

CA used to montior CPU, data access stallsw/ complex Database Workload, Oracle 10G

Oracle OLTP workload ● Random 2k IO's (50% Read/50% Write), Sequential Write to logs, EXT3● Vary user count, tune SGA to saturate 8cpu, using EXT3 Direct and Async

I/O ● Number of transactions / minute (tpm) ● Run with and without Large pages (HughTLBfs)● Measure IPC, Translation Buffer Misses

The Translation Lookaside Buffer (TLB) is a small CPU cache of recently used virtual to physical address mappings

TLB misses are extremely expensive on today's very fast, pipelined CPUs

Large memory applicationscan incur high TLB miss rates

HugeTLBs permit memory to bemanaged in very large segments

● Standard page: 4KB● Default huge page: 2MB● 500:1 difference

File system mapping interface Ideal for databases

● E.G. TLB can fully map a 2GBOracle SGA w/ 1024 TLB entries

HugeTLBFS

Physical Memory

Virtual AddressSpace

128 data128 instruction

Oracle 10G OLTP Performance (tpm k) 4k vs 2MB huge pages

Trans / min

DTLB Accesses

IC – Misses

L2 Misses

50000.00

100000.00

150000.00

200000.00

250000.00

300000.00

350000.00

400000.00

RHEL5RHEL5 – Hugepages% Difference

Data Access – DTLB Assessment Comparison – Oracle Workload

Instruction Cycle Comparison – Oracle Workload

L2 Cache Comparison – Oracle Workload

IPC Comparison – Oracle Workload

RHEL and AMD CodeAnalyst w/ Oprofile Runs w/ Standard RHEL oprofile (install sysstat) Download CA rpm from AMDdeveloper page Gui allows for easy data collection of

● Cycles, retired inst profile IPC calculation● Data Cache access (both I and D)● Memory subsystem performance

● NUMActl at OS, L2 references● Translation buffer analysis (TLB)

Using CodeAnalyst on Red Hat Enterprise Linux to ... CodeAnalyst on Red Hat Enterprise Linux to...

Documents

An introduction to analysis and optimization with AMD ...developer.amd.com/wordpress/media/2012/10/Introduction...An introduction to analysis and optimization with AMD CodeAnalyst

Red Hat Cluster for Red Hat Enterprise Linux 5 - CentOS · PDF fileRed Hat Cluster for Red Hat Enterprise Linux 5.2 ... For more information about Red Hat Cluster Suite for Red Hat

1 GREEN HAT RED HAT YELLOW HAT BLUE HAT WHITE HAT BLACK HAT

CodeAnalyst User's Manual - Home - AMDdeveloper.amd.com/.../CodeAnalyst_linux_users_guide-1.0.pdfCodeAnalyst User's Manual vi 8.1.1. Related Topics 123 8.2. Tutorial - Prepare Application

Red Hat Satellite 6.5 Administering Red Hat Satellite · Red Hat Satellite 6.5 Administering Red Hat Satellite A guide to administering Red Hat Satellite. Red Hat Satellite Documentation

CodeAnalyst User's Manual - AMD

Pro hat Aanh hat - pmwelfareschemetn.in

Enterprise Linux Part 1 (what’s new in RHEL8) Performance Analysis... · Enterprise Linux Part 1 (what’s new in RHEL8) D. John Shakshober Sr Distinguished Eng Tech Director RH

Performance Analysis and Tuning – Part 1 · Performance Analysis and Tuning – Part 1 D. John Shakshober (Shak) - Sr Consulting Eng / Director Performance Engineering Larry Woodman

Performance Analysis and System Tuning (Woodman,Shakshober)

Performance Analysis & Tuning of - Debian · 2012-12-13 · Performance Analysis & Tuning of Red Hat Enterprise Linux Larry Woodman / John Shakshober Consulting Engineers, Red Hat

Black Hat | Home...Black Hat | Home

spielen Er hat gespielt hören Er hat gehört chatten Er hat gechattet

Blue Hat, Green Hat

Red Hat Solution 및 Cloud - Cuvix · • JBoss Operations Network • Red Hat Directory Server • Red Hat Certification System Services • Red Hat Consulting • Red Hat Support

Performance Analysis and Tuning – Part 2 · Performance Analysis and Tuning – Part 2 D. John Shakshober (Shak) Sr Consulting Eng / Director Performance Engineering Larry Woodman

Red Hat Enterprise Linux 5 - Red Hat Customer Portal · Red Hat Enterprise Linux 5 Installation Guide for Red Hat Enterprise Linux 5.0 Edición 5.0 ... Red Hat, Red Hat Enterprise

Was für Haare hat sie?. Sie hat rote Haare. Was für Haare hat sie?

Red Hat Satellite 6 · 2017-10-10 · Red Hat Enterprise Linux Server Red Hat Software Collections (for RHEL Server) Red Hat Satellite Beta Red Hat Satellite 6 Beta Red Hat Software

Woodman,Shakshober Performance Analys