37
2006 Intel Distinguished Lecture Multi Multi - - Core Microprocessor Chips: Core Microprocessor Chips: Motivation & Challenges Motivation & Challenges Dileep Bhandarkar, Ph. D. Dileep Bhandarkar, Ph. D. Architect at Large Architect at Large Digital Enterprise Group Digital Enterprise Group Intel Corporation Intel Corporation May 2006 May 2006 Copyright Copyright © © 2006 Intel Corporation. 2006 Intel Corporation.

Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecture

MultiMulti--Core Microprocessor Chips:Core Microprocessor Chips:Motivation & ChallengesMotivation & Challenges

Dileep Bhandarkar, Ph. D.Dileep Bhandarkar, Ph. D.Architect at LargeArchitect at Large

Digital Enterprise GroupDigital Enterprise GroupIntel CorporationIntel Corporation

May 2006May 2006

Copyright Copyright ©© 2006 Intel Corporation.2006 Intel Corporation.

Page 2: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Semiconductor Technology EvolutionSemiconductor Technology EvolutionDesign ChallengesDesign ChallengesWhy MultiWhy Multi--Core Processor Chips?Core Processor Chips?Power/Performance TradePower/Performance Trade--OffsOffsCMP DirectionsCMP DirectionsBeyond CMP Beyond CMP SummarySummary

©2006, Intel CorporationIntel, the Intel logo, Pentium, Itanium and Xeon are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries

*Other names and brands may be claimed as the property of others

Agenda

Page 3: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

180nm 130nm 90nm 65nm 45nmWafer Size (mm): 200 200/300 300 300 3001st Production: 1999 2001 2003 2005 2007

Intel only:Intel only:OnOn--time time ““22--yearyear--cyclecycle””

SiGeSiGe

100nm LG 70nm LG 50nm LG 35nm LG DetailsCoSi2 CoSi2 NiSi NiSi Coming!

Strain Si Strain Si6 Al 6 Cu 7 Cu 8 CuSiOF SiOF Low-k Low-k

Transistors:

Interconnects:

Page 4: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

Process Name P1262 P1264 P1266 P1268

Lithography 90 nm 65 nm 45 nm 32 nm

1st Production 2003 2005 2007 2009

45 nm Logic Process on Track 45 nm Logic Process on Track for Delivery in 2007for Delivery in 2007

Moore's Law continues!Moore's Law continues!Intel continues to develop a new technology Intel continues to develop a new technology

generation every 2 yearsgeneration every 2 years

Page 5: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

Intel 11th EMEA Academic Forum

Historical Driving ForcesHistorical Driving Forces

0.01

0.1

1

10

1970 1980 1990 2000 2010 20200.01

0.1

1

10

1970 1980 1990 2000 2010 20201

10

100

1000

10000

100000

1970 1980 1990 2000 2010 20201

10

100

1000

10000

100000

1970 1980 1990 2000 2010 2020

Increased PerformanceIncreased Performancevia Increased Frequencyvia Increased Frequency

FeatureFeatureSizeSize(um)(um)

FrequencyFrequency(MHz)(MHz)

20052005MontecitoMontecito

1.7B Transistors1.7B Transistors

197119714004 Processor4004 Processor2300 Transistors2300 Transistors

Shrinking GeometryShrinking Geometry

197819788008 Processor8008 Processor

IBM PCIBM PC

19851985i386 Processori386 Processor

3232--bitbit

19931993Pentium ProcessorPentium Processor3.1M transistors3.1M transistors

Page 6: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

The ChallengesThe Challenges

10

100

1000

1990 1995 2000 2005 2010 2015

CPUCPUPowerPower(W)(W)

30nm

45nm65nm

90nm

0.13um

0.18um0.25um

0.35um

0.5um

0.7um

0.1

1

10

1990 1993 1997 2001 2005 2009

~30%

30nm

45nm65nm

90nm

0.13um

0.18um0.25um

0.35um

0.5um

0.7um

0.1

1

10

1990 1993 1997 2001 2005 2009

~30%SupplySupplyVoltage Voltage

(V)(V)

Power = Capacitance x VoltagePower = Capacitance x Voltage22 x Frequencyx Frequencyalsoalso

Power ~ VoltagePower ~ Voltage33

Power LimitationsPower Limitations Diminishing Voltage ScalingDiminishing Voltage Scaling

Page 7: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Semiconductor Technology EvolutionSemiconductor Technology EvolutionDesign ChallengesDesign ChallengesWhy MultiWhy Multi--Core Processor Chips?Core Processor Chips?Power/Performance TradePower/Performance Trade--OffsOffsCMP DirectionsCMP DirectionsBeyond CMP Beyond CMP SummarySummary

Agenda

©2005, Intel CorporationIntel, the Intel logo, Pentium, Itanium and Xeon are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries

*Other names and brands may be claimed as the property of others

Page 8: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Design ChallengesDesign ChallengesMemory latency not scaling as fast as Memory latency not scaling as fast as processor speedprocessor speedPower growing nonPower growing non--linearly with single thread linearly with single thread performanceperformanceDesigner productivity lagging design Designer productivity lagging design

complexitycomplexityAbility to validate and test complex designAbility to validate and test complex designKeeping up with new process technology Keeping up with new process technology

every two yearsevery two years

Page 9: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Long Latency DRAM Accesses:Long Latency DRAM Accesses:Needs Latency Tolerant TechniquesNeeds Latency Tolerant Techniques

00

200200

400400

600600

800800

10001000

12001200

14001400

PentiumPentium®®ProcessorProcessor

66 MHz66 MHz

PentiumPentium--ProProProcessorProcessor200 MHz200 MHz

Pentium IIIPentium IIIProcessorProcessor1100 MHz1100 MHz

Pentium 4Pentium 4ProcessorProcessor

2 GHz2 GHz

Future CPUsFuture CPUs

Peak

Inst

ruct

ions

dur

ing

DR

AM

Acc

ess

Peak

Inst

ruct

ions

dur

ing

DR

AM

Acc

ess

Page 10: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

DRAM Latency ToleranceDRAM Latency Tolerance

Continue building even larger cachesContinue building even larger caches–– Every semiconductor process generation Every semiconductor process generation

provides opportunity to double cache sizeprovides opportunity to double cache size–– Cache becomes larger part of dieCache becomes larger part of die

Hide multiple threads of execution Hide multiple threads of execution behind memory latencybehind memory latencyIntel implemented simultaneous multiIntel implemented simultaneous multi--threading in 2000threading in 2000Implement multiImplement multi--core products as core products as MooreMoore’’s Law allowss Law allows

Page 11: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Semiconductor Technology EvolutionSemiconductor Technology EvolutionDesign ChallengesDesign ChallengesWhy MultiWhy Multi--Core Processor Chips?Core Processor Chips?Power/Performance TradePower/Performance Trade--OffsOffsCMP DirectionsCMP DirectionsBeyond CMP Beyond CMP SummarySummary

Agenda

©2005, Intel CorporationIntel, the Intel logo, Pentium, Itanium and Xeon are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries

*Other names and brands may be claimed as the property of others

Page 12: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Situational AnalysisSituational AnalysisWith Each Process Generation transistor density With Each Process Generation transistor density doublesdoubles–– Frequency has increased by ~1.5X; ~1.3x in futureFrequency has increased by ~1.5X; ~1.3x in future–– Vcc has scaled by about ~0.8x; ~0.9x in futureVcc has scaled by about ~0.8x; ~0.9x in future–– Capacitance has scaled by 0.7x Capacitance has scaled by 0.7x –– Total power may not scale down due to increased leakageTotal power may not scale down due to increased leakage

Instruction Level Parallelism harder to find Instruction Level Parallelism harder to find Increasing singleIncreasing single--stream performance often stream performance often requires nonrequires non--linear increase in design complexity linear increase in design complexity Many server applications are inherently parallelMany server applications are inherently parallelParallelism exists in multimedia applications Parallelism exists in multimedia applications MultiMulti--tasking usage models becoming popular tasking usage models becoming popular

Page 13: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Processor PowerProcessor Power

Page 14: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Design Complexity and Productivity factorsDesign Complexity and Productivity factors

Huge transistor Huge transistor budgets stress ability budgets stress ability to design and verify to design and verify complex chipscomplex chipsMultiMulti--core fits well with core fits well with increasing transistor increasing transistor budgetsbudgetsMultiMulti--core design core design addresses addresses density/designer gapdensity/designer gap

Page 15: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Semiconductor Technology EvolutionSemiconductor Technology EvolutionDesign ChallengesDesign ChallengesWhy MultiWhy Multi--Core Processor Chips?Core Processor Chips?Power/Performance TradePower/Performance Trade--OffsOffsCMP DirectionsCMP DirectionsBeyond CMP Beyond CMP SummarySummary

Agenda

©2005, Intel CorporationIntel, the Intel logo, Pentium, Itanium and Xeon are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries

*Other names and brands may be claimed as the property of others

Page 16: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Iron Law of PerformanceIron Law of PerformanceExecution Time is the product ofExecution Time is the product of–– Path LengthPath Length–– Cycles Per Instruction (CPI)Cycles Per Instruction (CPI)–– Cycle TimeCycle TimeCPI is the sum ofCPI is the sum of–– infiniteinfinite--cache core cpi cache core cpi –– miss rate * effective memory latencymiss rate * effective memory latencyBad (good) news is that Bad (good) news is that performance does not scale up performance does not scale up (down) linearly with frequency(down) linearly with frequency

Page 17: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

The Magic of Voltage ScalingThe Magic of Voltage ScalingPower = Capacitance * VoltagePower = Capacitance * Voltage2 * FrequencyFrequencyFrequency Frequency αα Voltage in region of interestVoltage in region of interestPower increases as the cube of FrequencyPower increases as the cube of FrequencyGood news is that voltage scaling worksGood news is that voltage scaling works10% reduction in voltage yields10% reduction in voltage yields–– 10% reduction in frequency10% reduction in frequency–– 30% reduction in power30% reduction in power–– less than 10% reduction in performanceless than 10% reduction in performance

Page 18: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Simple Dual Core ExampleSimple Dual Core ExampleAssume Single Core processor at 100WAssume Single Core processor at 100W–– 80W for core, 20W for cache and I/O80W for core, 20W for cache and I/O–– 50% die are is core50% die are is core

Dual core within same power envelopDual core within same power envelop–– 20W for I/O and cache20W for I/O and cache–– 40W per core40W per core–– Die size increases by 50%Die size increases by 50%–– Reduce voltage by 21% to reduce core power Reduce voltage by 21% to reduce core power

to 40Wto 40W–– Frequency reduces by ~20%Frequency reduces by ~20%–– Single thread perf reduces by ~15%Single thread perf reduces by ~15%–– Throughput increases by 70Throughput increases by 70--80%80%

Page 19: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Possible ImprovementsPossible ImprovementsDevelop new power efficient core Develop new power efficient core –– E.g. extensive clock gatingE.g. extensive clock gating–– Big power savings with little or no Big power savings with little or no

performance lossperformance lossDesign a smaller core with lower Design a smaller core with lower

performanceperformance–– Area and power savings much greater than Area and power savings much greater than

performance lossperformance loss–– Use larger number of coresUse larger number of coresAdjust frequency and power of each Adjust frequency and power of each

core with load factorcore with load factor–– Inactive cores can be put in sleep modeInactive cores can be put in sleep mode–– Maintain overall die power constantMaintain overall die power constant

Page 20: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

A New EraA New Era……

PerformanceEquals Frequency

Unconstrained Power

Voltage Scaling

PerformanceEquals IPC

THE OLDTHE OLD

THE NEWTHE NEW

Multi-Core

MicroarchitectureAdvancements

Power Efficiency

Page 21: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

IntelIntel®® WideWideDynamic ExecutionDynamic Execution

IntelIntel®® AdvancedAdvancedDigital Media BoostDigital Media Boost

IntelIntel®® IntelligentIntelligentPower CapabilityPower Capability

IntelIntel®® SmartSmartMemory AccessMemory Access

IntelIntel®® AdvancedAdvancedSmart CacheSmart Cache

Intel Core MicroIntel Core Micro--architecturearchitectureFive Key InnovationsFive Key Innovations

Page 22: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

MultiMulti--Core TrajectoryCore Trajectory

DualDual--CoreCore

QuadQuad--CoreCore

2H 20062H 2006 1H 20071H 2007

Page 23: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

Architecture Transitions

Page 24: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

Shrink/DerivativePRESLER · YONAH · DEMPSEY

New MicroarchitectureMEROM · CONROE · WOODCREST

65nm

2 Y

EA

RS

45nm

2 Y

EA

RS Shrink/Derivative

PENRYN

New MicroarchitectureNEHALEM

32nm

2 Y

EA

RS Shrink/Derivative

NEHALEM-C

New Microarchitecture

PRINCIPLES

1. One micro-architecture for all high volume market segments

2. Optimized for performance/watt

3. Parallel design teams

4. No waiting on new process technology

5. Chipset cadence offset for fast ramp

Microprocessor Design Model

GESHER

OBJECTIVE: Sustained Technology Leadership

Page 25: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Semiconductor Technology EvolutionSemiconductor Technology EvolutionDesign ChallengesDesign ChallengesWhy MultiWhy Multi--Core Processor Chips?Core Processor Chips?Power/Performance TradePower/Performance Trade--OffsOffsCMP DirectionsCMP DirectionsBeyond CMP Beyond CMP SummarySummary

Agenda

©2005, Intel CorporationIntel, the Intel logo, Pentium, Itanium and Xeon are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries

*Other names and brands may be claimed as the property of others

Page 26: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Possible EvolutionPossible EvolutionTransistor density doubles with each process Transistor density doubles with each process generationgenerationNew generation enables complex new coreNew generation enables complex new corePossible alternative design pointPossible alternative design point–– Double the cache capacity in same areaDouble the cache capacity in same area–– Double the number of processor coresDouble the number of processor cores–– Frequency improves with process technologyFrequency improves with process technology

CoreCore

CacheCache

CoreCore CoreCoreCoreCore CoreCore

CoreCore CoreCore

2 x Cache2 x Cache 4 x Cache4 x Cache

90 nm90 nm 65 nm65 nm 45 nm45 nm

Page 27: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

Ramping Multi-core Everywhere

All products and dates are preliminary and subject to change without notice.

Desktop Desktop Mainstream/PerformanceMainstream/Performance

Server

MobileMobileMainstream/PerformanceMainstream/Performance

Server & Server & WorkstationWorkstation

Shipping >85% ~100%

* Data is projected run rate exiting the year. Source: Intel

2005 2006* 2007*

ShippingShipping >70%>70% >90%>90%

ShippingShipping >70%>70% >90%>90%

Desktop Desktop ClientClient

Mobile Mobile ClientClient

Expect to ship >60 million multiExpect to ship >60 million multi--core core processors by end of 2006processors by end of 2006

Page 28: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

CMP ChallengesCMP ChallengesHow much Thread Level Parallelism is How much Thread Level Parallelism is

there in most workloads?there in most workloads?Ability to generate code with lots of Ability to generate code with lots of

threads & performance scalingthreads & performance scalingThread synchronizationThread synchronizationOperating systems for parallel machinesOperating systems for parallel machinesSingle thread performance tradeoffSingle thread performance tradeoffPower limitationsPower limitationsOnOn--chip interconnect/cache infrastructurechip interconnect/cache infrastructureMemory and I/O bandwidth requiredMemory and I/O bandwidth required

Page 29: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

IntelIntel’’s Software Tools and Supports Software Tools and Support

Thread Checker Thread Checker Thread ProfilerThread Profiler

VTuneVTune™™ AnalyzersAnalyzers

Math Kernel LibrariesMath Kernel LibrariesPerformance PrimitivesPerformance Primitives

CompilersCompilers

Solutions, Blueprints,Solutions, Blueprints,Sizing/Scaling GuidesSizing/Scaling Guides

Solution ServicesSolution ServicesDeveloper ServicesDeveloper Services

Software CollegeSoftware CollegeEarly Access ProgramsEarly Access Programs

Driver Optimization Driver Optimization LabsLabs

Drivers

Page 30: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

How Many Cores?How Many Cores?

Where does the doubling stop?Where does the doubling stop?–– Driven by software issuesDriven by software issuesToday Microsoft Windows supports Today Microsoft Windows supports only 64 threads!only 64 threads!How many applications scale to 64 How many applications scale to 64 threads?threads?How well does performance scale How well does performance scale with thread count?with thread count?

Page 31: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Semiconductor Technology EvolutionSemiconductor Technology EvolutionDesign ChallengesDesign ChallengesWhy MultiWhy Multi--Core Processor Chips?Core Processor Chips?Power/Performance TradePower/Performance Trade--OffsOffsCMP DirectionsCMP DirectionsBeyond CMPBeyond CMPSummarySummary

Agenda

©2005, Intel CorporationIntel, the Intel logo, Pentium, Itanium and Xeon are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries

*Other names and brands may be claimed as the property of others

Page 32: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Looking Beyond CMPLooking Beyond CMP

How far do we push the number How far do we push the number of general purpose cores?of general purpose cores?Is there are role for application Is there are role for application specific engines?specific engines?Programming model for Programming model for heterogeneous coresheterogeneous cores

Page 33: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Improving Power EfficiencyImproving Power EfficiencyP

erfo

rman

ce

Power

penalty:

parallelism

Conventional design 1% performancefor 3% in power

One processor

2 CMP

3 CMP

4 CMP

*CMP = Symmetric General Purpose (GP) cores

CMP*

penalty:

Specialized MIPS

~1% performancefor 1% in power

>3% performancefor 1% in power

Page 34: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Application Specific EnginesApplication Specific Engines

Can achieve better power efficiency than Can achieve better power efficiency than general purpose coresgeneral purpose coresSimpler design due to targeted application Simpler design due to targeted application and lack of support for full operating and lack of support for full operating systemsystemChallengeChallenge–– Needs to support high volume applicationNeeds to support high volume application–– Reconfigurable?Reconfigurable?

Graphics and Multimedia engines are good Graphics and Multimedia engines are good candidatescandidates

Page 35: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Semiconductor Technology EvolutionSemiconductor Technology EvolutionDesign ChallengesDesign ChallengesWhy MultiWhy Multi--Core Processor Chips?Core Processor Chips?Power/Performance TradePower/Performance Trade--OffsOffsCMP DirectionsCMP DirectionsBeyond CMP Beyond CMP SummarySummary

Agenda

©2005, Intel CorporationIntel, the Intel logo, Pentium, Itanium and Xeon are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries

*Other names and brands may be claimed as the property of others

Page 36: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

SummarySummaryOne billion transistors are here already!One billion transistors are here already!Chip Level Multiprocessing and large Chip Level Multiprocessing and large

caches can exploit Moorecaches can exploit Moore’’s Laws LawAmount of parallelism in future Amount of parallelism in future

microprocessor systems will increasemicroprocessor systems will increaseHeterogeneous cores may emerge Heterogeneous cores may emerge eventuallyeventuallyNeed applications and tools that can Need applications and tools that can

exploit parallelismexploit parallelismDesign challenges and software issues Design challenges and software issues remainremain

Collaborate, Innovate, Lead!Collaborate, Innovate, Lead!

Page 37: Multi-Core Microprocessor Chips: Motivation & Challenges€¦ · 03/05/2006  · Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 45 nm Logic Process on Track

2006 Intel Distinguished Lecturewww.intel.com/education

Closing ThoughtClosing Thought““DonDon’’t be encumbered by t be encumbered by past history, go off and do past history, go off and do something wonderful.something wonderful.””

-- Robert Robert NoyceNoyceIntel CoIntel Co--founderfounder