DVClub Sept 14 Updated Presentation v2 - T&VS · Performance Verification Nick Heaton, Cadence...

Preview:

Citation preview

Performance Verification

Nick Heaton, Cadence Design SystemsDVClub Sept 8th 2014November 14 2014

• Challenges of performance verification– ARMv8 Mobile CPU Subsystem Example

• UVM Testbench Requirements

Agenda

2 © 2014 Cadence Design Systems, Inc. All rights reserved.

• Performance Characterization

ARM® ARMv8-A Mobile Example CPU Subsystem

Cortex®-A57 Cortex®-A53

MailTM-T760GPU

3 © 2014 Cadence Design Systems, Inc. All rights reserved.

ARM® ARMv8-A Mobile Example CPU Subsystem Performance Challenges

4 © 2014 Cadence Design Systems, Inc. All rights reserved.

What is the latency of the processor clusters

to memory paths including all async

bridges ?

ARM® ARMv8-A Mobile Example CPU Subsystem Performance Challenges

5 © 2014 Cadence Design Systems, Inc. All rights reserved.

What is the latency of the processor clusters to memory

paths including all async bridges ?

What is the bandwidth of the paths from IP with high bandwidth

demands to memory ?

ARM® ARMv8-A Mobile Example CPU Subsystem Performance Challenges

What is the bandwidth and latency of the paths from real-time IP to

memory ?

6 © 2014 Cadence Design Systems, Inc. All rights reserved.

ARM® ARMv8-A Mobile Example CPU Subsystem Performance Challenges

7 © 2014 Cadence Design Systems, Inc. All rights reserved.

How do I ensure adequate performance when ALL critical IPs

are active ?

VIP

VIP

Active AMBA VIP

Passive AMBA VIP

Sy

ste

m S

core

bo

ard

& P

erf

orm

an

ce M

on

ito

r V

IP

VIP VIP VIP VIPVIP VIP VIP

Characterization TestsRouting Model

UVMTestbench

UVM TestbenchPerformance Characterization

8 © 2014 Cadence Design Systems, Inc. All rights reserved.

Sy

ste

m S

core

bo

ard

& P

erf

orm

an

ce M

on

ito

r V

IP

VIPVIPVIPVIP

DUT

• Systematic Approach– Take each path (from Master IP to Memory) in turn and do analysis for

– maxBandwidth– Fully saturating traffic – Read, Write and Read/Write transactions– Different burst lengths 1,2,4,8,16 beats (based on full bus width)

– minLatency

ARM® ARMv8-A Mobile Example SoCPerformance Characterization Requirements

9 © 2014 Cadence Design Systems, Inc. All rights reserved.

– minLatency– Single outstanding transaction– Read, Write transactions– Different burst lengths 1,2,4,8,16 beats (based on full bus width)

– Outstanding Transaction Sweep– Constant traffic– Read and Write transactions– Sweep outstanding transaction levels from 1 to maxOT for each interface

ARM® ARMv8-A Mobile Example SoCPerformance Characterization Analysis - maxBandwidth

10 © 2014 Cadence Design Systems, Inc. All rights reserved.

ARM® ARMv8-A Mobile Example SoCPerformance Characterization Analysis – minLatency

11 © 2014 Cadence Design Systems, Inc. All rights reserved.

Interconnect WorkbenchAssembly

UVM Testbench

IP-specificTraffic Profiles

CoreLink 400 System IP

RTL & IP-XACT

Interconnect WorkbenchAnalysis &

Debug

PerformanceAnalyzer

PerformanceCharacterization

Tests

Automate Simulate Analyze

Cadence VIP Cadence VIP Library for AMBA®

UserMeta-Data

SoC PerformanceTestbench

Cadence Interconnect Workbench

12 © 2014 Cadence Design Systems, Inc. All rights reserved.

PerformanceMeasurements

PerformanceAnalysis

VerificationClosure

For Interconnect IP Integration•Performance of use case traffic loads•Verify configuration functionalityFor SoC Integration•Validate performance in context of IPs

Benefits� Shorten performance tuning and analysis iteration loop from

days to hours� Reduce testbench development time from weeks to hours

Tune Architecture

Generated Testbench Flow

SoC VerificationTestbench

VIP

VIP

Active AMBA VIP

Passive AMBA VIP

Sy

ste

m S

core

bo

ard

& P

erf

orm

an

ce M

on

ito

r V

IP

VIP VIP VIP VIPVIP VIP VIP

Synthesized Traffic WorkloadsRouting Model

UVMTestbench

UVM TestbenchWorkload Analysis – for QoS refinement

13 © 2014 Cadence Design Systems, Inc. All rights reserved.

Sy

ste

m S

core

bo

ard

& P

erf

orm

an

ce M

on

ito

r V

IP

VIPVIPVIPVIP

DUT

• Significant challenges in predicting and optimizing SoC performance– Multiplicity of IP configuration options particularly in Interconnect and DDR space– Need a systematic approach which has the potential to be automated

• Performance verification accomplished in three steps– Characterization – fully automated and can be checked as a standard regressions step

Summary

14 © 2014 Cadence Design Systems, Inc. All rights reserved.

– Characterization – fully automated and can be checked as a standard regressions step– Architectural – Establish QoS functions as expected– Use-case – hunt for corner case issues

• Cadence Interconnect Workbench supports all stages of the process– Automation of testbench, supports ARM Corelink System IP– Automation of the characterization test suite– Comprehensive analysis and checking capabilities– Traffic Synthesizers for architectural and use-case analysis

Recommended