60
1 SEMINAR REPORT On I-7 processor architecture Submitted for partial fulfillment of award for the degree of Bachelor of Technology Computer Science and Engineering Submitted by AVNEESHWAR SINGH(0813310024) ATUL KUMAR MISHRA (0813310023) ANKUR CHAUHAN(0813310014) Submitted to Mr.Mohd Jawed Khan

Final Report on i7

Embed Size (px)

Citation preview

1

SEMINAR REPORT On

I-7 processor architectureSubmitted for partial fulfillment of award for the degree of Bachelor of Technology Computer Science and Engineering Submitted by AVNEESHWAR SINGH(0813310024) ATUL KUMAR MISHRA (0813310023) ANKUR CHAUHAN(0813310014) Submitted to Mr.Mohd Jawed Khan

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING NOIDA INSTITUE OF ENGINEERING AND TECHNOLOGY GREATER NOIDA

2

ACKNOWLEDGMENTI express my sincere gratitude to N.I.E.T for giving me an opportunity to express and say a few worlds about the research that I have done in the seminar. I also thank all the faculty members, batch mates for imparting their skills nad taking pain in giving me precious knowledge about various concept of i7 processor architecture. I owe my regards to Mr. Mohd. Jawed Khan (lecturer, computer science department) for his encouragement nad valueable suggestions. A great appreciation is extended to all our teachers for their encouragement and valueable suggestion. Regards, Avneeshwar Singh

CERTIFICATE

3

This is certified that Avneeshwar Singh has carried out the research work presented in the report entitled i7 processor architecture for the award of Bachelor of Technology (CSE) from Noida Institute of Engineering & Technology as a part of syllabus prescribed by Gautam Buddha Technical Univercity,Lucknow. The report embodies result of work and studies carried out by students themselves and the contents of the report do not form the buries for the award of any other degree to the candidates or to anybody else.

Mr. Mohd. Jawed Khan (Seminar Guide)

INDEX1.Introduction 2.Feature of i7 processor 3.Technical specification of i7 processor 4.Detail study of i7 05-05 06-06 07-07 08-38

4

4.1 Nehalem(microarchitecture) 4.2Bloomfield(microprocessor) 4.3Processor cores 4.4Lynnfield 4.5Overclocking 5.Working in i7 processor 5.1Turbo boost technology 5.2 Hyper thredding 5.3 Smart cache 5.4 Quick path interconnection 6.Advantage of i7 processor 7.Intel core-i7 965XE conclusion 8.References

08 28 31 34 37 39-43 39 41 42 43 44-44 45-46 47-47

I7 processor 1.IntroductionLooking for the best desktop processor on the planet? Then, take a closer look at Intel Core i7 Processor that delivers a great breakthrough in quad-core performance. Intel Core i7 Processor features the following:

Intel Turbo Boost Technology Intel Virtualization technology

5

Enhanced Intel SpeedStep technology Execute Disable Bit Intel 64 architecture

Using Intel Turbo Boost Technology, the processor maximizes the speed for demanding applications. With this technology, you can experience a greater performance when you need it the most. It comprises of processor numbers i7-940 and i7-920. The only difference between the two is the clock speed. The i7-940 has 2.93 GHz clock speed while i7-920 has 2.66 GHz. Both have 8 processing threads with Intel Hyper-Threading technology and 8 MB of Intel Smart Cache. These are quad core processors that have 3 channels of DDR3 1066 MHz memory. With this processor, you can multitask applications faster. You can convert a video for your iPod while you are encoding a video for You Tube. This processor enables you to create spectacular slide-shows, 3D images for animations or games, or even create your own movie without suffering PC performance. People who like gaming, graphics, video editing, video streaming and photo gallery can benefit from the incredible breakthrough performance of the inel i7 processor.

2.Features of I7 Processors:There are many features of I7 processors that are more advanced as compared to the previous or the old versions of the processors invented by the Intel Company. Some of the latest technological features of the I7 features are as follows. 1. With the help of I7 processors users can enjoy the high speed working with the additional feature of the multitasking i.e. using two different documents or the files at the same time.

6

2 .Dual core technology is also a part of I7 processors that provide reliable and high data rate working performance to the customers.

3. A big feature of the I7 processors is the Turbo Boost technology. This technology provides the high performance to the system to the users to overcome the work load of different applications on the system and maintain the speed of the system.

4. Like I5 and I3 processors I7 also have a feature of Hyper threading technology enhances the activity and the speed of the system by managing the multitasking, work load and different types of heavier applications of the users.

5. I7 processors are so advanced to deals with the integrated memory of e system and have ability to increase the memory up to 1066 Mbits and provide the working speed of 25.6 GB/sec.

Contd..

3. Technical Specifications of I7 Processors:Some technicalities specifications of I7 processors that encourages the customers to buy the processing units are as follows

7

1. I7 processors have integrated memory up to 1066 Mbits and can provide the faster working for applications.

2. I7 processors are also able to supports cache that is very important for the high rate performance and can support up to 8 to 12 MB.

3. the higher clock speed of the newt poking of the I7 processors is 3.2GHz i.e. is why these are very famous among the users to meet the requirements of the latest technological world

4. there are many other technologies that really make the processors and the system powerful such as high visualization, and can support up to 64 bits, with the help of these specifications system maintains the working of the BIOS and other chipset devices

5. There are different types of cores involved on the manufacturing of the Intel I7 processors but the common cores that are involved are Bloomfield or Lynnfield etc.

Contd

4. DETAILS STUDY OF I74.1 Nehalem (microarchitecture)Nehalem (pronounced /nhelm/) is the codename for an Intel processor microarchitecture, successor to the Core microarchitecture. The first processor released with the Nehalem

8

architecture was the desktop Core i7, which was released in November 2008. It was followed by several Xeon processors and by the i3 and i5. Mobile Nehalem-based processors were introduced in September 2009. The first computer to use Nehalem-based Xeon processors was the Apple Mac Pro workstation announced on March 3, 2009. Nehalem-based Xeon EX processors for larger servers were originally expected in Q4 2009, However, they were released in April 2010. Initial Nehalem processors use the same 45 nm manufacturing methods as Penryn. A working system with two Nehalem processors was shown at Intel Developer Forum Fall 2007. The microarchitecture is named after the coastal Oregon town of Nehalem. Originally it was supposed to be the latest evolution of the NetBurst microarchitecture. Since the abandonment ofNetBurst, the codename has been recycled and refers to a completely different project, although Nehalem still has some things in common with NetBurst. Nehalem-based microprocessors utilize higher clock speeds and are more energy-efficient than Penryn microprocessors. Hyper-Threading is reintroduced along with an L3 Cache missing from most Core-based microprocessors.

Two, four, and eight cores 45 nm manufacturing process at initial release (with 32 nm variants following)

Integrated memory controller supporting two or three memory channels of DDR3 SDRAM or four FB-DIMM2 channels Integrated graphics processor (IGP) located off-die, but in the same CPU package for Westmere.

A new point-to-point processor interconnect, the Intel QuickPath Interconnect, in highend models, replacing the legacy front side bus

Integration of PCI Express and Direct Media Interface into the processor in mid-range models, replacing the northbridge

Hyper-threading has been reintroduced.

Native (monolithic, i.e. all processor cores on a single die) quad- and octo-core processors 64 KB L1 cache/core (32 KB L1 Data + 32 KB L1 Instruction) and 256 KB L2 cache/core.

412 MB L3 cache shared by all cores 33% more in-flight micro-operations than Conroe Second-level branch predictor and second-level translation lookaside buffer

9

Performance and power improvements It has been reported that Nehalem has a focus on performance, which accounts for the increased core size. Compared to Penryn, Nehalem has:

10

1.1 to 1.25 the single-threaded performance the multithreaded performance at the same power level

or

1.2

to

2

30% lower power usage for the same performance

According to a preview from AnandTech "expect a 2030% overall advantage over Penryn with a 10% increase in power usage." Per Core, clock-for-clock, Nehalem provides a 1520% increase in performance compared to Penryn.

PC Watch found that a Nehalem "Gainestown" processor has 1.6 the integer performance and 2.4 the floating-point performance of a 3.0 GHz Xeon X5365 "Clovertown" quad-core processor.A 2.93 GHz Nehalem "Bloomfield" system has been used to run a 3DMark Vantage benchmark and gave a CPU score of 17,966.[15] The 2.66 GHz variant scores 16,294. A 2.4 GHz Core 2 Duo E6600 scores 4,300.AnandTech tested the Intel QuickPath Interconnect ("QPI", 4.8 GT/s version) and found the copy bandwidth using triple-channel 1066 MHz DDR3 was 12.0 GB/s. A 3.0 GHz Core 2 Quad system using dual-channel 1066 MHz DDR3 achieved 6.9 GB/s. Overclocking is possible with Bloomfield processors and the X58 chipset. The Lynnfield processor uses a PCH removing the need for a northbridge chipset. The Nehalem processors are the first to incorporate the SSE 4.2 SIMD instructions, adding 7 new instructions to the SSE 4.1 set available in the Core 2 series.The Nehalem architecture also significantly reduces atomic operation latency; cutting it to 50% in an attempt to eliminate atomic overhead . Westmere Westmere (formerly Nehalem-C) is the name given to the 32 nm die shrink of Nehalem. The first Westmere-based processors were launched on January 7, 2010 and branded as members of the Core i3, Core i5, and dual-core mobile Core i7 families. Westmere's features and improvements from Nehalem have been reported as follows:

Native six-core processors.

A new set of instructions that gives over 3x the encryption and decryption rate of Advanced Encryption Standard (AES) processes compared to before. Delivers seven new instructions (AES instruction set or AES-NI) that will be used by the AES algorithm. Also an instruction called PCLMULQDQ (see CLMUL instruction set) that will perform carry-less multiplication for use in cryptography. These

11

instructions will allow the processor to perform hardware-accelerated encryption, not only resulting in faster execution but also protecting against software targeted attacks.

AES-NI may be included in the integrated graphics of Westmere.

Integrated graphics, built into the processor. Improved virtualization latency.

New virtualization capability: "VMX Unrestricted mode support," which allows 16-bit guests to run (real mode and big real mode).

Support for "Huge Pages" of 1GB in size.

Code names Each combination of a Nehalem/Westmere processor die and package has both a separate codename and a product code. Typically, the same dies are used for uniprocessor (UP) and dualprocessor (DP) servers, but using an extra QuickPath link for the inter-processor communication in the DP server variant. Like the Core microarchitecture, Nehalem also uses four different processor packages, one for each market segment. The high-end chips with eight cores are only available in the MP server segment, while all others are used in at least two different packages. No details have been released about the Westmere EX processor so far. Mobile Desktop UP Server DP Server MP Server

Eight-Core 45 nm Quad-Channel Quad-Core 45 nm Triple-Channel Quad-Core 45 nm Dual-Channel, PCIe Bloomfield Gainestown 80601 80602 Clarksfield Lynnfield Jasper Forest 80607 80605 80612

Beckton 80604

Dual-Core 45 nm Auburndale Havendale Dual-Channel, PCIe, Graphics Core canceled canceled

12

Ten-Core 32 nm Quad-channe Six-Core 32 nm Triple-Channel Dual-Core 32 nm Arrandale Dual-Channel, PCIe, Graphics Core 80617 45 nm Lynnfield processors feature integrated PCIe 1 x16 or 2 x8.1

Westmere-EX 80615 Gulftown Westmere-EP 80613 80614 Clarkdale 80616

6500 series scalable up to 2 sockets, 7500 series scalable up to 4/8 sockets.

Interfaces Pric Core Processor CPU L3 Rele e Codena s / Soc Tur Market Branding Clock TDP cac ase for me Thre ket bo & Model rate Chipset Memory he Date 1k ads Unit Beckto MP 8 1 n Server / (16) DP Server LG Xeon X75 A 60 156 7 X75 50 2.26 G Yes 130 4 QPI 6.4 DDR3- 24 2010- $36 Hz W GT/s 800 / MB 03-30 92 1066 (Up to 4x 2.0 G 18 $28 with Hz MB 37 SMBReady Motherb X65 $24 oard) 50 61

13

L755 1.86 G 5 Hz E754 0 2.0 G Hz E65 40 6 (12) E75 30 1.86 G Hz L75 45 X75 2.66 G 42 Hz E75 1.86 G 20 Hz 4 (8) E65 1.73 G 10 Hz W55 3.33 G 90 Hz LG Gaines DP A W55 3.2 G 4 (8) Xeon Yes town Server 136 80 Hz 6 X55 2.93 G 70 Hz No

95 W

4 QPI 5.86 GT/s

24 MB 18 MB

$31 57 $19 80 $17 12

4 QPI 6.4 GT/s 105 W 12 MB $13 91 4 QPI 5.86 GT/s 130 W 18 MB $20 87 $19 80 $85 6 105 4 QPI 4.8 W GT/s 12 MB $74 4 2009- $16 08-09 00 130 W 2 QPI 6.4 3 DDR 8 $15 1 GT/s 3-1333 MB 2009 00 -0329 $12 86

6 (6)

95 W

14

X55 2.8 G 60 Hz X55 2.66 G 50 Hz E554 2.53 G 0 Hz E55 2.4 G 30 Hz E55 2.26 G 20 Hz 2 5.86 GT/s L553 2.4 G 0 Hz 60 W 3 DDR310661 80 W

$10 72 $85 8 $74 4 $53 0 $37 3 2009 $74 -084 09 $53 2009 0 -0330 $

L552 2.26 G 0 Hz L55 2.13 G 18 Hz E55 2.26 G 07 Hz 4 (4) E550 2.13 G No 6 Hz L55 2.13 G

2010- $26 03-16 6 80 W 2 4.8 GT/s 3 DDR38001 4 MB 2009 $26 6 -0329 $42

60

15

06

Hz

W 80 W 3 DDR31066

3 $22 4

E55 2.0 G 04 Hz

2 (4)

L55 2.0 G 38 Yes 08 Hz W

2 5.86 GT/s

8 MB

$

E55 2.0 G 03 Hz 2 (2) E550 1.86 G 2 Hz 80 No W 2 4.8 GT/s 3 DDR3800

2010- $22 03-16 4 4 MB 2009 $18 -038 29 8 2009- $99 MB 08-09 9 3 1 QPI 6.4 DDR3GT/s 1333 2009 $99 -039 29 2009 $56 -112 01 2009 $56 -082 09 2009 $56 -03-

Bloomfi UP 4 (8) eld Server

Xeon W35 3.33 G Yes 130 80 Hz W

W35 3.2 G 70 Hz

W35 3.2 G 65 Hz

1 QPI 4.8 3 GT/s DDR31066

W35 3.06 G 50 Hz

W35 2.93 G 40 Hz

16

29

2

W35 2.8 G 30 Hz

2010 $29 -034 16 $28 4 2009 -03- $ 29 $

W35 2.66 G 20 Hz W3 2.53 G 505 Hz 2(2) W3 2.4 G 503 Hz Lynnfiel d 4 (8) LG A 115 6 X34 3.06 G Yes 95 80 Hz W X34 2.93 G 70 Hz X34 2.8 G 60 Hz X34 2.66 G 50 Hz X34 2.53 G 40 Hz DMI 2 DDR31333 No 4 MB

8 2010- $61 MB 05-30 2 2009 $58 -09- 9 08 $31 6 $24 1 $21 5

17

L34 1.86 G 26 Hz X34 2.4 G 30 Hz

45 W 95 W

$28 4 $18 9 8 2009- $99 MB 05-31 9 2008 $99 -119 17 2009 $56 -102 20 2009 $56 -052 31 2008 $56 -112 17 2010 $29 -024 28 2008 $28 -11- 4 17

4 (4)

Bloomf Enthusi 4 (8) LG 3.33 G Yes 130 3 975 ield ast A Core Hz W DDR3Desktop 136 i7 1 QPI 6.4 1066 6 Extre GT/s 3.2 G me 965 Hz

Core 3.2 G i7 960 Hz

1 QPI 4.8 GT/s

3.06 G 950 Hz

2.93 G 940 Hz

2.8 G 930 Hz

920 2.66 G Hz

18

880

3.06 G Hz 201005-30

$58 3 $34 2 2009 $56 -092 08 2010 $35 -071 19 2009 $28 -094 08 2010 $33 -017 07 2010 $20 -079 17 2009 $19 -096 08 2010 $25 -019 07

875 K 2.93 G Hz 870

95 W

4 (8) 2.66 G 870S Hz 82 W

Perform Lynnfiel ance d Desktop

LG A 115 6

2.8 G 860 Hz

Yes

95 W

DMI

2 DDR31333

8 MB

2.53 G 860S Hz

82 W

2.8 G 760 Hz

95 W

4 (4)

Core 2.66 G 750 i5 Hz

95 W

2.4 G 750S Hz

82 W

19

Intel states the Gainestown processors have six memory channels. Gainestown processors have dual QPI links and have a separate set of memory registers for each link in effect, a multiplexed six-channel system

Beckton will move to 32 nm at the end of 2010. The 32 nm CPUs will not have significantly different clock speeds compared to 45 nm CPUs.

4.1.1 32 nm

Thermal design power (TDP) values for CPUs with integrated GPUs include the GPU. Clarkdale processors feature an integrated PCIe 1 x16.

Clock rate Interfaces Pric Core Processo L3 e T Rele Coden s / Soc r Tur ca for Market D ase ame Thre ket Branding bo ch 1k P Chip Mem Date ads & Model Core GPU e Uni set ory t

West LGA MP 10 mere156 Xeon Server (20) EX 7

N/A N/A

2011 -Q2

Gulfto DP 6 LGA Xeon X56 wn / Server (12) 136 90 West 6 mereX56 EP 80

3.46 GHz

N/A Yes

3.33 GHz

2 3 12 QPI DDR3 MB 13 6.4 G -1333 0 T/s W

??

??

X56 2.93 70 GHz

9 5 W

201 $16 063 0316 $14 40

X56 2.8 G 60 Hz

$12 19

20

X56 2.66 50 GHz

$99 6

E56 2.4 G 45 Hz

8 0 W 2 QPI 5.86 6 GT/s 0 W

$95 8

L56 2.26 40 GHz

$99 6

L56 2.0 G 38 Hz

$95 8

4 (8) X56 3.46 77 GHz

X56 3.06 67 GHz

1 3 0 2 W QPI 6.4 GT/s 9 5 W

$16 63

$14 40

E56 2.66 40 GHz

E56 2.53 30 GHz

2 3 QPI DDR3 5.86 -1066 8 GT/s 0 W

$77 4

$55 1

E56 2.4 G 20 Hz

$38 7

L56 2.13 30 GHz

4 0

$55 1

21

L56 1.86 18 GHz W 4 (4) L56 1.86 09 GHz No

$53 0

2 QPI 4.8 GT/s

$44 0

W36 3.33 80 GHz UP Server Xeo n W36 3.20 70 GHz

1 3 D QPI DR36.4 G 1333 T/s

201 0$99 03- 9 16

1 3 QPI DDR3 4.8 -1066 GT/s 13 N/A Yes 0 W

201 0$88 08- 5 29

6 (12)

990 3.46 X GHz Core i7 Extre me 980 3.33 X GHz

12 201 MB 1-Q1 1 QPI 6.4 GT/s $99 201 9 00316

Extrem e/ Perform ance Deskto p

3 DDR3 -1066

Core 3.2 G 970 i7 Hz

1 QPI 4.8 GT/s

201 0$88 07- 5 17

Clarkd UP 2 (4) LGA Xeon L340 2.26 ale Server 115 6 GHz 6

N/A Yes 30 DMI W

2 4 201 $18 DDR3 MB 09 -1066 0316

22

L340 2.0 G 3 Hz

201 0$ 10

Mainstr eam / Value Deskto p

3.6 G 680 Hz 733 MHz 670 Core i5 661 3.33 GHz 660 3.46 GHz 73 W

2 DDR3 -1333

201 0$29 04- 4 18

$28 4

900 MHz

8 7 W

201 0$19 01- 6 07

733 MHz 3.2 G Hz

7 3 W

650

$17 6

Core i3

No 560 3.33 GHz

201 0$13 08- 8 29

3.20 550 GHz

201 0$13 05- 8 30

540

3.06 GHz

530 2.93

201 $13 03 0107 $11

23

GHz

3

Penti G69 2.8 G um 50 Hz 2 (2) Cele G11 2.26 ron 01 GHz

533 MHz

2 DDR3 -1066

3 MB

$87

2 MB

OE M

Clarkdale and Arrandale contain the 32 nm dual core processor Hillel and the 45 nm integrated graphics device Ironlake, and support switchable graphics. Mobile Processors

Released processors are set in bold.

Extreme processors have an unlocked clock multiplier. Thermal design power (TDP) values for CPUs with integrated GPUs include the GPU.

All processors use a 133MHz Base Clock.

4.1.2 45 nm Clarksfield

All are quad-core models(8 Threads). They feature Hyper-threading and Turbo Boost. Interface support :

DMI 2x DDR3-1333 PCIe 1 x16 / 2 x8Price for 1k Unit

Processor Sock Market Branding & et Model

Core L3 Turb Release Clock TDP cach o Date rate e

Extreme / PGA Core i7 2.13 GH 940XM Performan 988 Extrem z ce e Mobile 920XM 2.0 GHz

Yes 55 W

8 MB 2010-06$1096 21

2009-09- $1054

24

23

840Q 1.86 GH M z

2010-06$568 21

820Q M Core i7 740Q M 6 MB 720Q 1.6 GHz M 1.73 GH z 45 W

2009-09$546 23

2010-06$378 21

2009-09$364 23

32 nm Arrandale

Socket used is PGA-988 or BGA-1288. Interface support:

DMI PCIe 1 x16Clock rate Pric L3 Relea e Turb TD Memor cach se for o P y e Date 1k GPU Unit Yes 2 DDR31066 35 W 4 MB 2010$346 09-26 $332 201001-07 OEM 25 2010- $346

Cores Processor / Market Branding & Threa Model ds 2 (4) Core i7

Core 2.8 GH z

Mainstrea m/

640M

Value Mobile

620M

2.66 G 766 MH Hz z 2.53 G Hz

610E

660LM 2.26 G 566 MH

25 Hz 640LM 2.13 G Hz 09-26 $332 z W 201001-07

620L 2.0 GH M/ z 620LE 680UM 1.46 G Hz 1.33 G Hz 2 18 DDR3W 800

$30 0

2010$317 09-26 2010$289 05-25 $305 201001-07

660UM

500 MH 1.2 GH 640UM z z 620U M/ 1.06 G 620U Hz E 580M 2.66 G Hz 2.66 G Hz 2.53 G Hz

$27 8

$266 201009-26 766 MH z 2 35 DDR3W 1066 201001-07 3 MB $26 0 $257 $22 5

560M

540M Core i5

520M 2.4 GH / 520E z 560UM 1.33 G Hz 2 18 DDR3W 800

2010$250 09-26 2010$250 05-25 2010$241 01-07

1.2 GH 500 MH 540UM z z 520UM 1.06 G Hz

26 2.53 G Hz 2.4 GH 766 MH z z 2.26 G Hz 1.33 G Hz 2 35 DDR3W 1066 2010- OEM 09-26 201006-26 201001-07 201010-01 201005-25 201009-26 201006-20

460M

450M

430M

470UM

430U 1.2 GH M z 380M 2.54 G Hz 2.4 GH z 2.26 G Hz

500 MH z

2 18 DDR3W 800

No

370M

667 MH z

350M Core i3

2 35 DDR3W 1066

330M 2.13 G / 330E Hz 380UM 1.33 G Hz

201001-07

330U 1.2 GH M z 2 (2) Pentiu 2.13 G P6200 m Hz P6100

500 MH z

2 18 DDR3W 800

201010-01 201005-25

2.00 G 667 MH Hz z 1.86 G Hz

2 35 DDR3W 1066

201009-26

P6000

201006-20 18 2 W DDR3201005-25

U5400 1.20 G 500 MH Hz z

27 800 P4600 2.00 G Hz 667 MH z 2010$86 09-26 2010- $86 / 03-28 OEM

P4500 1.86 G / Celero Hz P4505 n

2 35 DDR3W 1066 2 MB 2 18 DDR3W 800 / 1066

U3400 1.06 G 500 MH / Hz z U3405

$134 2010/ 05-25 OEM

Clarkdale and Arrandale contain the 32 nm dual core processor Hillel and the 45 nm integrated graphics device Ironlake, and support switchable graphics.

Successor The successor to Nehalem and Westmere is Sandy Bridge.

Intel CPU core roadmaps from NetBurst to Haswell

28

4.2 Bloomfield (microprocessor)Bloomfield Produced Designed by Max. CPU clock rate Min. feature size Instruction set From 2008 to present Intel 2.66 GHz to 3.33 GHz

45 nm x86, x86-64, MMX, SSE,SSE2, SSE3, SSSE 3,SSE4.1, SSE4.2

Microarchitecture Nehalem CPUID code Product code 106Ax 80601

29

Cores L2 cache L3 cache Application Socket(s) Brand name(s)

4 4 256 kB 8 MB Desktop

LGA 1366 Xeon 35xx Core i7-9xx

Bloomfield is the code name for Intel high-end desktop processors sold as Core i7-9xx and single-processor servers sold as Xeon 35xx.,[1][2][3] in almost identical configurations, replacing the earlier Yorkfield processors. The Bloomfield core is closely related to the dualprocessor Gainestown, which has the same CPUID value of 0106Ax (family 6, model 26) and which uses the same socket. Bloomfield uses a different socket than the later Lynnfield andClarksfield processors based on the same 45 nm Nehalem microarchitecture, even though some of these share the same Intel Core i7 brand. Features Bloomfield has many new features that represent significant changes from Yorkfield:

The new LGA 1366 socket is incompatible with earlier processors.

On-die memory controller: the memory is directly connected to the processor. It is called the uncore part and runs at a different clock (uncore clock) than the execution cores. Three channel memory: each channel can support up to two DDR3 DIMMs. Motherboards for Bloomfield generally have three, four (2, 1, 1), or six DIMM slots.

Support for DDR3 memory only.

The front side bus has been replaced by the Intel QuickPath Interconnect interface. Motherboards must use a chipset that supports QuickPath Interconnect.

The following caches:

32 kB L1 instruction and 32 kB L1 data cache per core 256 kB L2 cache (combined instruction and data) per core

30

8 MB L3 (combined instruction and data) "inclusive", shared by all cores

Single-die device: all four cores, the memory controller, and all cache are on a single die, instead of a Multi-chip module of two dual-core dies as in Yorkfield "Turbo Boost" technology allows all active cores to intelligently clock themselves up in steps of 133 MHz over the design clock rate as long as the CPU's predetermined thermal and electrical requirements are still met.

Re-implemented Hyper-threading. Hyperthreading was introduced in the older NetBurst microarchitecture, but omitted from the subsequent Core, which was a descendant of the Pentium III family. With hyperthreading enabled, each of the four physical cores can process up to two threads simultaneously, so the processor appears to the OS as eight logical CPUs.

Only one QuickPath interface: not intended for multi-processor motherboards. 45 nm process technology. 731 million transistors. 263 mm2 die size. Sophisticated power management can place an unused core in a zero-power mode. Support for SSE4.2 & SSE4.1 instruction sets.

Brand names Brand name Clock frequency range

Model (list)

Market

QuickPath Interface

i7-9xx Core i7 i7-9xx Extreme Edition Xeon W35xx

Performance Desktop

2.663.20 GHz

1 4.8 GT/s

Extreme Desktop

3.23.33 GHz

1 6.4 GT/s

Server

2.43.2 GHz

4.86.4 GT/s

31

4.3 Processor coresThe clock rates listed here are as specified by Intel for normal mode. "Turbo boost" can increase the rate on active cores in steps of the base clock (133 MHz if not overclocked) up to a predetermined limit for short periods when required, which can be useful with single threaded applications.

The I7-965 XE and I7-975 XE have separate unlocked multipliers for memory and cores. Core clock above those in the table are not guaranteed by Intel.A CPU Clock Speed of 6.158 GHz has been achieved.

Memory rates above those in the table are not guaranteed by Intel. Memory Speeds of DDR3-2133 have been achieved. Higher Rates have been Reported to have been achieved.

The processor has a Thermal Design Power of 130 W and will slow itself down if this power is exceeded. This feature can be disabled from an option in most of the new motherboards' BIOS.

Prices are Intel's wholesale prices for lots of 1,000 units in US$ at product launch.

Clock Ther multipli mal Core Core Pri Quick er Memory desi Rele i7 n s ce Cach Path Soc controlle gn ase Mod m (thre (US e interf ket r powe date el ads) $) ace Co Unc Bas Co Unc r re ore e re ore Clocks (GHz)

32

i7920

2.6 6

20

284

2008 -1117

i7930

2. 80

21

29 4

2010 -0228

i7940

2. 2.13 93

22

16

1 4.8 GT/s

2008 -1117

i7950

3. 06 0.1 33

23

45 n 4 (8) i7- m 3. 960 20

56 256 2 kB L2/co 3 DDR3re 800/1,066 8 MB MT/s shar ed L3

2009 -0603 LGA 130 136 W 6 2009 -1018

i7965 Extre me Editio n

24 3.2 0 1 6.4 GT /s 2009 -0603 2008 -1117

2.66 i7975 Extre me Editi on

20 999

3. 33

25

Performance

33

The Inquirer managed to get a 965 engineering sample to a core clock rate of up to 4 GHz with fan cooling and Turbo Boost.

IT OC Taiwan overclocked an engineering sample of the 965, to 4.20 GHz with a QPI rate of 200 MHz and a multiplier value of 21.0. A vCore setting of 1.72 V was used, which is far higher than the stock voltage of 1.25V and could have led to damaging the CPU or motherboard.

A Core i7 940 system running at stock clock rates has obtained a 3DMark Vantage benchmark CPU score of 17,966. A Core i7 920 system scored 16,294 running at stock clock rates. An IntelCore 2 Extreme QX9770, a very expensive member of the previous generation of Intel processors (costing over four times the price of the 920 at its launch), scored 13,182 also running at stock clock rates.

AnandTech tested the Intel QuickPath Interconnect (4.8 GT/s version) and found the copy bandwidth using triple-channel 1,066 MHz DDR3 was 12.0 GB/s. A 3.0 GHz Core 2 Quad system using dual-channel 1066 MHz DDR3 achieved 6.9 GB/s.

Maximum PC has discovered that Intel has unlocked the QPI clock and memory multipliers on retail 920s and 940s. This is allegedly due to consumer feedback.

Core i7-975 will have the new D0 Stepping. Tests made by X-bit labs shows that it has better energy efficiency and overclockability than C0 stepping.

The Intel Core i7-975 Extreme Edition was considered the world's fastest desktop processor (until the i7-980x) by a review from Hot Hardware. It runs at a clock rate of 3.33 GHz with Turbo Boost clock rates running the processor up 3.46 GHz with all four cores put at work and 3.6 GHz with a single core at work. The processor was overclocked to 4.1 GHz while keeping a 50 C (122 F) core temperature with the stock cooling unit.

34

contd4.4 Lynnfield (microprocessor)Lynnfield Produced Designed by Max. CPU clock rate Min. feature size Instruction set 2009 Intel 2.40 GHz to 3.06 GHz

45 nm x86, x86-64, MMX, SSE,SSE2, SSE3, SSSE 3,SSE4.1, SSE4.2

Microarchitecture Nehalem

35

CPUID code Product code Cores L2 cache L3 cache Application Socket(s) Brand name(s)

106Ex 80605 4 4x256kb 8 MB Desktop

LGA 1156 Core i5-7xx Core i5-7xxS Core i7-8xx Core i7-8xxS Core i7-8xxK Xeon X34xx Xeon L34xx

Lynnfield is the code name for a quad-core processor from Intel released in September 2009. [1] It is sold in varying configurations as Core i5-7xx, Core i7-8xx or Xeon X34xx. Lynnfield uses the Nehalem microarchitecture and replaces the earlier Penryn based Yorkfield processor, using the same 45 nmprocess technology, but a new memory and bus interface. The product code for Lynnfield is 80605, its CPUID value identifies it as family 6, model 30 (0106Ex). Lynnfield is related to the earlier Bloomfield and Gainestown microprocessors, which are used in server and high-end desktop systems. The main difference between the two is Lynnfield's use of the LGA 1156 processor socket as opposed to the LGA 1366 used in the others. LGA 1156 processors include Direct Media Interface and PCI Express links, which Intel has previously connected to the processor with a dedicated northbridge chip, called the memory controller hub or I/O hub. The mobile version of Lynnfield is Clarksfield.

36

Brand names Brand name Model (list) i5-7xx Core i5 i5-7xxS i7-8xx Core i7 i7-8xxK i7-8xxS Xeon 34xx UP Server Performance desktop 2.40 GHz 2.803.06 GHz 2.93 GHz (unlocked) Yes 2.53 GHz 1.862.93 GHz some Yes/32 GB No/16 GB Clock frequency range 2.662.80 GHz No ECC RAM/Max. RAM

Market

HT

Bloomfield has three memory channels, and the channel bandwidth can be selected by setting the memory multiplier. However, in early benchmarks, when the clock rate is set higher than a threshold (1333 for the 965XE) the processor will only access two memory channels simultaneously. A 965XE has higher memory throughput with 3xDDR3-1333 than with 3xDDR3-1600, and 2xDDR3-1600 has almost identical throughput to 3xDDR3-1333. Drawbacks The Core i7 Bloomfield does not support error-correcting memory. Some motherboards with an LGA 1366 socket support both Core i7 and the Xeon 35xx and 55xx series processors, and advertise support for ECC memory. However ECC functionality is only available if a Xeon is installed, not if a Core i7 is installed. Product evolution The Core i7 950 and the Core i7 975 Extreme Edition were introduced in March 2009 with prices similar to the prices for the 940 and 965 Extreme Edition, respectively, but with better

37

performance in each case. Intel has scheduled the discontinuation of the 940 and 965XE for Q3 2009. Intel announced discontinuations in other older families at the same time.

4.5 OverclockingIn recent years Intel has become much more understanding that the enthusiast community has grown quite large and that raw performance ranks number one with this consumer group. For that reason Intel has softened their view significantly regarding the overclocking of their processors and motherboards. In fact with the materials provided with the Core i7 review kit Intel even included a rather in depth 22 page document entitled Intel Extreme Motherboard DX58SO ...Performance Tuning Process. This document elaborates quite freely on the best way to overclock the Core i7 family of processors. In the document Intel states that there are four multipliers on the motherboard which are used to set the system speed:1. CPU Speed: When multiplied by the system base clock speed (default 133.33 MHz)

gives the CPU frequency. Four multipliers are used to define different speeds based on the number of active CPU cores. 2. Memory Speed: When multiplied by the system base clock speed gives the memory frequency. For example a memory multiplier of 10 times the base clock of 133.33 MHz results in a memory frequency of 1333 MHz. 3. Quick Path Interconnect (QPI) Speed: Selectable transfer rate of data transferred between the CPU and the IOH. 4. Uncore Speed: This multiplier applies to the non-CPU related items in the processor. The limit on this multiplier is set by the memory multiplier. There are essentially 3 ways of overclocking the Core i7-965XE: 1. Increasing the base clock rate using this method also increases both memory clock and the QPI speed 2. Increasing the processor core multipliers with turbo mode enabled

38

3. Using a combination of 1 & 2

Being the curious sort we tried all three of the methodologies describe above and got our best results from utilizing a combination of increasing the processor core multipliers and increasing the base clock rate. Intel has pretty much drawn a line in the sand when it comes to using VDimm settings of greater than 1.65 Volts. They have gone as far as to say that disregarding this warning will likely lead to permanent damage of your new $999.00 CPU. Not wanting to gain fame as the first reviewer to scorch a Core i7-965XE while still in the initial benchmarking phase of the review I kept things well on the South side of that mark. We found that the best overclock for the Intel Core i7-965XE was quite easily obtained by setting the multiplier to 29 on all cores, increasing the base clock rate to 144 MHz and using a 250mv default voltage override. This resulted in a clock speed of 4.035 GHz which proved to be rock stable. We were able to attain significantly higher clock speeds that approached 4.3 GHz but we had to use votages that were a bit to close to the 1.65 Volt limit for our liking. We feel that this is a good starting point and as supporting equipment, motherboard BIOS, and drivers mature this overclock will be significantly higher.

39

5.WORKING IN I7 PROCESSOR5.1TURBO BOOST TECHNOLOGYIntel Turbo Boost is a technology implemented by Intel in certain versions of their Nehalem-based CPUs, including Core i5 and Core i7 that enables the processor to run above its base operating frequency via dynamic control of the CPU's "clock rate".It is activated when the operating system requests the highest performance state of the processor. Processor performance states are defined by the Advanced Configuration and Power Interface (ACPI) specification, an open standard supported by all major operating systems; no additional software or drivers are required to support the technology.The design concept behind Turbo Boost is commonly referred to as "dynamic overclocking". The increased clock rate is limited by the processor's power, current and thermal limits, as well as the number of active cores and the maximum frequency of the active cores. When workload on the processor calls for faster performance, and the processor is below its limits, the processor's clock will increase the operating frequency in regular increments as required to meet demand. Frequency increases occur in increments of 133 MHz for Nehalem microarchitecture processors and 100 MHz for Sandy Bridge microarchitecture processors. When any of the electrical or thermal limits are reached, the operating frequency automatically decreases in decrements of 133 MHz/100 MHz until the processor is again operating within its design limits. While Turbo Boost has the potential to speed up single threaded tasks that are unable to otherwise take advantage of the additional cores, it is very rare to see the full advantage in practice. At issue is the need for two or three cores to be inactive to reach the two or one core active turbo speeds; Windows will take a single thread and run 25% of it on each of four cores instead of putting it all on one core. While, since it is a single thread, there is only one core active at a time, the other cores need time to go to sleep and allow the running core to boost up. As a result, the single-core speed is not seen and the twocore speed is rarely seen unless processor affinity has been set to a single core. Technology like core parking will need to evolve before full benefit is seen.

40

41

5.2HYPER THREDDING

Hyper-threading is an Intel-proprietary technology used to improve parallelization of computations (doing multiple tasks at once) performed on PC microprocessors. For each processor core that is physically present, the operating system addresses two virtual processors, and shares the workload between them when possible. Hyper-threading requires not only that the operating system support multiple processors, but also that it be specifically optimized for HTT, and Intel recommends disabling HTT when using operating systems that have not been optimized for this chip feature.

Hyper-threading works by duplicating certain sections of the processorthose that store the architectural statebut not duplicating the main execution resources. This allows a hyper-threading processor to appear as two "logical" processors to the host operating system, allowing the operating system to schedule two threads or processes simultaneously. When execution resources would not be used by the current task in a processor without hyper-threading, and especially when the processor is stalled, a hyperthreading equipped processor can use those execution resources to execute another scheduled task. (The processor may stall due to a cache miss,branch misprediction, or data dependency.)

This technology is transparent to operating systems and programs. The minimum that is required to take advantage of hyper-threading is symmetric multiprocessing(SMP) support in the operating system, as the logical processors appear as standard separate processors.

5.3 SMART CACHE

42

Cache is an important component that impact overall system performance.Cache is a fast memory area where the processor place frequently acess data.System memory (RAM) is a larger storage area that holds more data but is slower for accessing data.Processor with greater cache can benefits more applications-from running rich media titles and games,to everybody productivity application. Intel smart cache increases the portability that each processor core can acess data from faster,more efficient cache subsystem.

5.4

QUICK PATH INTERCONNECTION

43

the Intel QuickPath Interconnect (QuickPath, QPI) is a point-topoint processor interconnect developed by Intel which replaces the Front Side Bus (FSB) in Xeon, Itanium, and certain desktop platforms. It was designed to compete with HyperTransport. Intel first delivered it in November 2008 on the Intel Core i79xx desktop processors and X58 chipset. Intel developed QPI at itsMassachusetts Microprocessor Design Center (MMDC) by members of what had been DEC's Alpha Development Group, which Intel acquired from Compaq and HP. Prior to the name's announcement, Intel referred to it as Common System Interface (CSI). Earlier incarnations were known as YAP (Yet Another Protocol) and YAP+.s QPI operates at a clock rate of either 2.4 GHz, 2.93 GHz, or 3.2 GHz. The clock rate for a particular link depends on the capabilities of the components at each end of the link and the signal characteristics of the signal path on the printed circuit board. The non-extreme Core i7 9xx processors are restricted to a 2.4 GHz frequency at stock reference clocks. Bit transfers occur on both the rising and the falling edges of the clock, so the transfer rate is double the clock rate.

6.Advantages of I7 Processors:

44

As we know that I7 is latest processors for the systems that re able to support dual processors technology and provide the faster and the convenient working performance to the users for the sake of better working. Due to its advancement I7 processors has many advantages, some of them are listed below

1. they are faster from the old versions of the processors given by the Intel and can support dual processor technology

2. Provide high data visualization technology to the users to view the high quality images and video graphics.

Instead of advantages the main and the major disadvantage of the I7 processors is that they are very much expensive to install.

Contd.

7. Intel Core i7-965XE Conclusion

45

Well the day has finally arrived where the Intel Core series of processors aren't just hype, rumor, and innuendo anymore. Much of the marketing hype that was circulating before these processors were sent out for review has proven to be true and their new processor line is truly the fastest on the planet. We'll have to admit we were taken back by these processors raw power in CPU intensive applications as they cut through processes in record time that would have been much more challenging to their predecessors. Yes we're extremely impressed! The Intel Core i7-965 Extreme Edition is the kingpin of the lot and literally smokes all its competition. The Core i7965XE processor in itself is enough to stir the very lifeblood of all that appreciate raw unadulterated computing power. Ok, so much for the enthusiast side of me, now it's time to put on my reviewer hat and come back down to earth and complete this review.

Presentation although being one of the most subjective categories we judge, is no less equally important in a product's consumer appeal. I must however admit that judging presentation on a processor is a very difficult task as in most cases it is heard in the form of results rather than seen. For that reason alone there must be some spillover in this category from the performance category. We feel the presentation of the Core i7-965XE is virtually awe inspiring to anyone that truly needs the capabilities this processor has to offer. This category of consumer would consist of those that perform very processor dependant tasks on a daily basis; those that utilize some heavily graphical intensive programs such as Photoshop, Auto CAD, or video editing software; and finally the extreme computer enthusiast who wants the best there is regardless of the price.

46

With regard to appearance, again rating a processors appearance offers a bit of the conundrum as it does with presentation. The Core i7-965XE's presentation can be equated somewhat with an extremely fast and agile automobile that is responsive to your slightest command. Unfortunately processors again are judged by their end result and not by the way they look. The end result of the Core i7-965XE is extremely ostentascious so we have to rate this category quite high. The construction of the Intel Core i7-965 Extreme Edition is like all of its predecessors, rock solid. The benefit of the Core i7-965 over the previous Core 2 line of processors is that there are virtually no protruding pins to bend if something goes wrong with installation. The protruding pins that are present are in the center of the chip and much less likely to be damaged if your placement of the processor in its 1366-pin socket isn't perfect. In the case of the Core i7-965XE we chose to borrow Intel's statement, it is the fastest performing production processor currently on the planet. We feel that just about sums it up! At the time of this writing, the Core i7-965XE is sold at NewEgg for $1069.99 while other online retailers are selling the BX80601965at around the same price. Normally we would rate the value of a processor with a $999.00 MSRP even lower than we rated the Core i7-965XE. The Intel model BX80601965 Core i7-965XE brings so much to the table in the form of features such as a completely unlocked multiplier and added scalability that we rated it slightly higher. Again value is in the eyes of the beholder and if you are one of those that can truly use this processors capabilities then the price is a pittance. To the average consumer the price of th Core i7-965XE will probably put it out of reach, especially in toady's economy.

47

8.REFERENCES:1.www.wikipedia.com 2.www.intel.com 3.www.bit-tech.com 4.www.tomshardware.com 5.www.google.com