13
Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing evolution The “Griffin” architecture Memory enhancements Power management Thermal management 2 Next-Generation Mobile Computing August 2007

Next-Generation Mobile Computing · 2013-07-28 · Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Next-Generation Mobile Computing · 2013-07-28 · Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing

Next-Generation Mobile Computing:Balancing Performance and Power Efficiency

HOT CHIPS 19

Jonathan Owen, AMD

Agenda

The mobile computing evolution

The “Griffin” architecture

Memory enhancements

Power management

Thermal management

2 Next-Generation Mobile Computing August 2007

Page 2: Next-Generation Mobile Computing · 2013-07-28 · Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing

3

The Evolution of the Notebook PC

Customers increasingly demand more for less

Reduced cost and complexity

Smaller form factors

Increased battery life

Durability and reliability

Enhanced connectivity

Simplified user interface and interaction

3 Next-Generation Mobile Computing August 2007

Architectural Challenges forNorthbridges in Mobile Platforms

UMA configurations present increasing challenges:

– Performance: Bandwidth for DirectX 10/Vista

– Power efficiency and battery life (avoid the local framebuffer)

Power efficiency limitations:

– Frequency and voltage cross dependencies limit thegranularity of power savings

– Frequency agility

To be addressed without changing the CPU core!

4 Next-Generation Mobile Computing August 2007

Page 3: Next-Generation Mobile Computing · 2013-07-28 · Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing

Addressing these Challenges

Increase system bandwidth for UMA solutions

– Performance: HyperTransport™ 3, dedicated display refreshvirtual channel, maximize DRAM efficiency

– Power: HT3 power management extensions, Memorycontroller on its own power plane

Power efficiency

– Split power planes, independent frequency selection (core0,core1, NB are independent)

– Fast frequency changes without PLL relock using digitalfrequency synthesizers

– Improved efficiency allows lower power for fixed workloads,or higher performance for fixed power

5 Next-Generation Mobile Computing August 2007

6

Enhanced performance andbattery life

Introducing “Griffin”

System on chip (SOC)methodology to accelerate designand integration

– New mobile optimizedNorthbridge

– Power optimized DDR2

– Enhanced HyperTransport™ 3connectivity

– Integrating 65nm cores andlarger L2 caches

New infrastructure featuresoptimized for the mobile segment

6 Next-Generation Mobile Computing August 2007

Page 4: Next-Generation Mobile Computing · 2013-07-28 · Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing

7

Dual AMD64 CPU cores

A Few Details

Dual channel DDR2 interface– 64 bits per channel

– All speeds supported up to DDR2-800

– 12.8 GB/s peak memory bandwidth

– 16 GB max memory configuration

Dedicated 1 MB L2 cachesL2

Cache

0

L2Cache

1

CPUCore 0

CPUCore 1

Northbridge

HT3D

DR2

HyperTransport™ 3 I/O link

– Speeds supported up to 2.6 GHz

– 16x16 bit

– 10.4 GB/s simultaneous peakbandwidth in each direction

– Dynamic link power management

160 mm2, 225.6 million transistors

7 Next-Generation Mobile Computing August 2007

Mobile Optimized Memory Controller

Core 0

On-die Northbridge

MemoryController

Hyper-TransportTM

3.0Controller

L2Cache

DDR InterfaceHyperTransport 3.0

Interface

Core 1

L2Cache

Chipsetwith GPU

LCDDisplay

Dual ChannelDDR2

HT3 up to 16x16 link

Crossbar/Host Bridge

Cost and power efficient solution for UMA

Extended battery life withnew power saving features

– Operates on separate powerplane, and at a lower voltagethan the cores

– Enables C4 Deeper Sleep onsystems with UMA graphicswithout the need for local framebuffer

New integrated memorycontroller features:

Improvements in DRAM efficiency

Improved DRAM prefetcher

Dedicated Display Refresh channel

8 Next-Generation Mobile Computing August 2007

Page 5: Next-Generation Mobile Computing · 2013-07-28 · Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing

DRAM Efficiency Maximization

Aggressive use of bypass paths to minimize idle latency

Write bursting

– Writes are accumulated and done at once to minimize busturnaround

Bank state tracking

– Chip selects are interleaved to increase number of distinct banks

– Unganging of DRAM channels further increases number of banks

– 16 banks per channel are tracked, using LRU algorithm

– Pages can be closed dynamically, based on bank access pattern

Out of Order (OOO) scheduling of requests, based on:

– Request priority (programmable by type: low/med/high)

– Page status (miss/hit/conflict)

9 Next-Generation Mobile Computing August 2007

Advanced DRAM Prefetcher

Memory Prefetch Table (MPT) hasentries for 8 outstanding threads

Capable of tracking single (+n, +n,+n) or double (+n, +m, +n) stridedpatterns, with strides up to ±4 lines

When request is received:

– stride2 = change in address + stride1

– stride1 = change in address

Prefetches 2 requests ahead

stride2 + last is predicted address;prefetches are issued whenconfidence threshold achieved

stride1 stride2 count0

MPT

addr

stride1 stride2 count1 addr

stride1 stride2 count2 addr

stride1 stride2 count3 addr

stride1 stride2 count4 addr

stride1 stride2 count5 addr

stride1 stride2 count6 addr

stride1 stride2 count7 addr

A A+n A+n+m A+2n+m A+2n+2m

stride1

stride2

+n +m

+n+m

+n

+n+m

+m

+n+m

10 Next-Generation Mobile Computing August 2007

Page 6: Next-Generation Mobile Computing · 2013-07-28 · Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing

Display Refresh Optimization

High bandwidth, high latency, but latency guaranteerequired to avoid display buffer underrun

Doesn't fit well in existing HyperTransportTM VC sets

– Base channels provide no latency guarantees

– Isoc channels are low bandwidth; can starve other traffic

Requests arrive via HyperTransportTM Isoc channel

– Detected by decoding coherence and ordering requirements

– Has dedicated buffer and routing resources internally

– Chipset must manage interaction with Isoc traffic

Memory controller priority is variable based on age

11 Next-Generation Mobile Computing August 2007

Dynamic Performance Scaling Capabilities

Core 0

Max P

P1

P2

P3

P4

P5

P6

Min P

V0

V1

V2

V3

Deep Sleep

Deeper Sleep (AltVID)

V4

C1 - Halt

Core 1

Max P

P1

P2

P3

P4

P5

P6

Min P

V0

V1

V2

V3

V4

C1 - Halt

Increased battery life with advanced power management features

Increased performance withinstantaneous frequencytransitioning

No PLL relock required

Minimize power consumption by alwaysrunning at optimal P-state

Lower operating minimum p-state

Reduced processor utilization withsimplified p-state transitions

Reduced power consumptionto increase battery life

Separate voltage planes for each core

Each core can operate at independentfrequency and voltage

12 Next-Generation Mobile Computing August 2007

Page 7: Next-Generation Mobile Computing · 2013-07-28 · Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing

Power-optimized HyperTransport™ 3

HyperTransport™ 3 increases performance while helping toextend battery life with power management features

x16x16

x8x8

x4x4

x2x2

Core 1Core 0

X X

Note: Animated scaling of link widths shown here are for illustration purposes. HT disconnect is required between each link width change.Actual link width scaling may vary based on implementation.

Extended battery life with powerreduction features

Dynamic scaling of link widths

Disconnecting HyperTransportTM whenidle, even while cores are executing

HW-autonomous, no SW support needed

Increased performance withover 3x increase in peak I/Obandwidth

Delivers increased bandwidth forDX10, a requirement for 2008Windows Vista™ Premium Logo

13 Next-Generation Mobile Computing August 2007

14

Serial VID Interface (SVI) Protocol provides pin-efficientmeans to control multiple independent voltage planes

Voltage Planes and Control

CPUCore 0

+ 1MB L2

CPUCore 1

+ 1MB L2

On-Die NorthBridge

VDDNB

VDD1

VDD0

SVI

SVIDC-DC

Independently-VariableVoltage Planes

VDD0 for CPU core 0

VDD1 for CPU core 1

VDDNB for on-die NorthBridge

Fixed Analog and I/OVoltage Planes

VDDIO & VTT for DDR PHY

VDDA for on-die PLL

VLDT for HyperTransport PHY

14 Next-Generation Mobile Computing August 2007

Page 8: Next-Generation Mobile Computing · 2013-07-28 · Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing

Fine Grain Power Management

Power efficiency via dynamic hardware power management

Current-generation power management schemesremain supported

CPU core power-state transitions

Simple software interface

Cores continue execution while frequency changes are in progress

Autonomous hardware power management

CPU and chipset work together to establish the most power efficient settings

Eliminates reliance on software for maximum power efficiency

The CPU informs the chipset when P-states change or HALT conditionreached

The chipset monitors I/O traffic and CPU state and establishes the optimalpower management profile for a given set of system conditions

15 Next-Generation Mobile Computing August 2007

Autonomous Hardware Power Management

GPUGPU

RenderRender

DisplayDisplay

ActiveActiveDeepDeep

SleepSleep

HT: 2 bits up, 8 bitsHT: 2 bits up, 8 bits

downdown

DRAM ActiveDRAM Active

Cores in deep sleepCores in deep sleep

HT: disconnectedHT: disconnected

DRAM Self RefreshDRAM Self Refresh

& & TristatedTristated

Cores in deep sleepCores in deep sleep

HT: 16 bits up, 16 bits downHT: 16 bits up, 16 bits down

DRAM ActiveDRAM Active

Cores in deep sleepCores in deep sleepGfxGfx

DriverDriver

HT: 4 bits up, 4HT: 4 bits up, 4

bits downbits down

DRAM ActiveDRAM Active

Cores executingCores executing

driver softwaredriver software

16 Next-Generation Mobile Computing August 2007

Page 9: Next-Generation Mobile Computing · 2013-07-28 · Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing

17

Power and Performance Tradeoffs

FeatureFeature AttributesAttributes

Independent CPU core voltageplanes and frequency selection

Power consumption matchesCPU performance delivered

Separate voltage plane for on-die NorthBridge

Enables CPU deep sleep withintegrated graphics.

Dynamic HT ™ link powermanagement

Power consumption matchesinterconnect bandwidthdelivered

Autonomous hardware controlof CPU core deep sleep state

Increased residency in CPUdeep sleep state

Autonomous hardware controlof DRAM self-refresh state

Increased residency in DRAMsleep state

CPU core deep sleep wakeup toservice probes at lowestP-State

Increased residency in CPUdeep sleep state

17 Next-Generation Mobile Computing August 2007

Pmax,C0

Griffin Power Management Visualization

Pow

er

P0, C0

Core 1 On-dieNorth-

bridge

SystemMemory

HT Chipset/GPU

Core 0

Sta

te

Vmax,Fmax

Vmax,Fmax

Vmax,Fmax

P0, C1

Vmax,Fmax

P0, C0High

utilizationP0, C0 Active

16x16,2.6 GHz,

connected

High Perfstate

18 Next-Generation Mobile Computing August 2007

Page 10: Next-Generation Mobile Computing · 2013-07-28 · Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing

Griffin Power Management Visualization

P0, C0

Sta

te

P0, C1P0, C0

Vint,Fmax

/2

Core 1 On-dieNorth-

bridge

SystemMemory

HT Chipset/GPU

Core 0

Vmax,Fmax

Vmax,Fmax

Highutilization

P3, C0 Active16x16,

2.6 GHz,connected

High Perfstate

Pow

er

19 Next-Generation Mobile Computing August 2007

Highutilization

Griffin Power Management Visualization

P0, C0

Sta

te

P3, C0 Active16x16,

2.6 GHz,connected

High Perfstate

Vmax,Fmax

P0, C1

Vmin

Core 1 On-dieNorth-

bridge

SystemMemory

HT Chipset/GPU

Core 0

Pmin, C1

Pow

er

Vint,Fmax

/2

20 Next-Generation Mobile Computing August 2007

Page 11: Next-Generation Mobile Computing · 2013-07-28 · Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing

Self refresh

Griffin Power Management Visualization

Sta

te Clocksgated

Dis-connected

Core 1 On-dieNorth-

bridge

SystemMemory

HT Chipset/GPU

Core 0

Pmin, C1Low Perf

statePmin, C1

Pow

er

C1E with AltVIDapplied

(Deeper Sleep)

21 Next-Generation Mobile Computing August 2007

Active

Griffin Power Management Visualization

Sta

te

Pmin, C1 Active

2x8,2.6 GHz,

connected

Low Perfstate

(refresh)Pmin, C1

Core 1 On-dieNorth-

bridge

SystemMemory

HT Chipset/GPU

Core 0

Pow

er

C1E with AltVIDapplied

(Deeper Sleep)

22 Next-Generation Mobile Computing August 2007

Page 12: Next-Generation Mobile Computing · 2013-07-28 · Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing

AMD Multi-Point Thermal Control

SB-TSI

PROCHOT

MEMHOT

ThermalMonitorCircuit

PROCHOT

“Griffin”

Core 1

Core 0

= Thermal sensor

“Rev. G”

SMBUS

Core 1

Core 0

Embedded Controller

Embedded Controller

Simple and accurate thermal management

Simplified thermalmanagement

Designed to automaticallyreduce p-state when temperatureexceeds pre-defined limit

Multiple on-die thermalsensors

New memory powermanagement

External thermal monitor canforce reduction in memorytemperature with MEMHOTsignal

23 Next-Generation Mobile Computing August 2007

24

Summary

“Griffin” is an important evolutionarystep in AMD’s notebook product portfolio

Greater memory efficiency for improvedprocessor and graphics bandwidth

Enhanced power management

Improved thermal management

Optimized for UMA power/performancewhile maintaining direct CPU-Memoryconnection

Simplified, reliable platform architecture

Low platform cost

Maximal CPU performance

Tighter integration of the chipset andCPU an important milestone to Fusion

24 Next-Generation Mobile Computing August 2007

Page 13: Next-Generation Mobile Computing · 2013-07-28 · Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing

Disclaimer

The information presented in this document is for information purposes only.

The information contained herein is subject to change and may be rendered inaccurate for many reasons, including,but not limited to product and roadmap changes, component and motherboard version changes, new model and/orproduct releases, product differences between differing manufacturers, software changes, BIOS flashes, firmwareupgrades, or the like. AMD assumes no obligation to update or otherwise correct or revise this information.

However, AMD reserves the right to revise this information and to make changes from time to time to the contenthereof without obligation of AMD to notify any person of such revisions or changes.

AMD MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE CONTENTS HEREOF AND ASSUMES

NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS OR OMISSIONS THAT MAY APPEAR IN THIS INFORMATION.

AMD SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR ANYPARTICULAR PURPOSE. IN NO EVENT WILL AMD BE LIABLE TO ANY PERSON FOR ANY DIRECT, INDIRECT, SPECIALOR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF ANY INFORMATION CONTAINED HEREIN OR

FOR THE PERFORMANCE OR OPERATION OF ANY PERSON, INCLUDING, WITHOUT LIMITATION, ANY LOST PROFITS,BUSINESS INTERRUPTION, DAMAGE TO OR DESTRUCTION OF PROPERTY, OR LOSS OF PROGRAMS OR OTHERDATA, EVEN IF AMD IS EXPRESSLY ADVISED OF THE POSSIBILITY OF SUCH DAMAGES.

Trademark Attribution

AMD, the AMD Arrow logo and combinations thereof are trademarks of Advanced Micro Devices, Inc. PCIe is aregistered trademark of PCI-SIG. HyperTransport is a licensed trademark of the HyperTransport Consortium. Othernames used in this presentation are for identification purposes only and may be trademarks

of their respective owners.

©2007 Advanced Micro Devices, Inc. All rights reserved.

25 Next-Generation Mobile Computing August 2007