Quote:
Originally Posted by
KTE
K10h is a 12 stage pipeline, 65nm, 283mm², 463M transistor, 23.x FO4 delays design. Not made for high clocks in any way, AMD intended, as presented at one of the global IEEE 2006 conferences to reach 2-2.8GHz with Barcelona with it's rated supply Vdd. Intel Core 2 is a 21 FO4 depth design AFAIK and Penryn at FO4 ~18, it is supposed to have been reduced substantially since HKMG integration.
The IBM Power6 is not the least nor the only architecture with 13 FO4 inversion delay, it just happens to be very well tuned for absolute speed and performance. P3 had FO4 15 depth, Willamette P4 FO4 8-10, Alpha 21264 has 15 FO4, and so on. Neither of those could achieve what IBM did.
Power 6 is in-order and has a far higher power budget.
Quote:
However, I don't think the IBM Power6 bears any relevance to desktop computers. It is a major success for it's HPC market and trumped anything any competitor had to offer in 2007 including Harpertown and Itanium 2 Montecito. It's the only CPU to hold all 4 major industry records in one go, transactions, Java, throughput and floating point. Beat Harpertown 3.16GHz 8 core vs 8 core in Int too. Best in SAP, TPC-C OLTP, OASO, Spec Jbb2005, Linpack HPC and so on last I checked late 2007. For instance in TPC-C:
Bull Escala PL1660R 16-cores IBM Power6 4.7 GHz 1,616,162tpmC
NEC Express5800/1320Xf 32-cores Intel Dual-Core Itanium 2 9050 1.6GHZ 1,245,516tpmC
Bull Escala PL1660R 4-core IBM Power6 4.7 GHz 404,462tpmC
HP ProLiant ML370G5 X5460 QC 8-core Intel X5460 3.16GHz 275,149tpmC
As you can see, it trumps anything for what it was designed to do.
It's not the core , it's the I/O that rocks => with 300GBs of I/O per CPU and 5000+ pins .
In single threaded tasks it get obliterated by Core and sometimes even Itanium Montecito altough the later has a 3x frequency and 30x BW gap.What does that say about the core ?
Quote:
What it does show is those who usually guess SOI is the only clocking restriction are wrong, as if you look at IBM technical documentations, IBM Power6 scales to 6.1GHz with low LpolySi tuning on air using SOI at 1.3V Vdd supply. Far more than anything else out there including HKMG 45nm CPUs. The official 3.2GHz IBM Power6 is rated for less than 100W TDP at 65nm SOI, big achivement. The 4.7GHz is rated for a maximum of 160W TDP with massive 790M transistors inside a big 341mm² transitor package, wowzer achievement, especially at the same pipeline, instructions per cycle and latch cycle overhead from 90nm. No other chip from AMD/Intel at 65nm or 45nm can do sub-200W TDP at those specs or temperatures (sub 60C air, with 105C limit).
Hold on a little.
With unlimited TDP , yeah , you could get to 6GHz.Too bad Power 6 has a fairly substantial problem once you go above 4.7GHz , leakage sky-rocket.
At 5.2Ghz you already burn 200w.
http://www.research.ibm.com/journal/rd/516/berri12.gif
I suggest you read this : http://www.realworldtech.com/forums/...85903&roomid=2
Quote:
To compare, the Power5+ 1.9GHz is a 200W TDP CPU at 389mm² 276M transistors. Intel "Montecito" Itanium 9000 running at 1.6GHz is less than half as powerful as IBM Power6 with a hefty 104W TDP, just shows how brilliant the engineering on Power6 really is and yet, you forget the +25W minimum TDP of the memory controller with Power6 that Itanium 2 doesn't have. Comparing Kentsfield 2.67GHz MCM 65nm was at 130W TDP, 286mm² 582M transistor package to IBM Power6 4.7GHz 160W gives us:
2.67GHz vs 4.7GHz
130W (+35W NB) vs 160W
582M vs 790M
20.5MHz/W vs 29.38MHz/W
0.455W/mm² vs 0.469W/mm²
0.22W/MilT vs 0.20W/MilT
In all respects it is far better a CPU at 65nm, but it isn't a desktop market intended chip, hence comparisons with our market shouldn't be made to judge absolute numerical performance, although electrically, you can do.
Somehow you've lost a 128MB L3 for an Power MCM , but what's 128MB anyway ?:rolleyes:
Quote:
Anyway, as for L3 cache, many have had it but not more than one Intel SKU before the K10h architectural lineup in the desktop market AFAIR (?). Alpha EV5 was the first that I can recall with others such as >Power4, UltraSPARC IV+, Madison/9M, etc. It helps only mainly when your L2 and L1 is saturated for high memory access or large matrix array applications, such as databases. It was always mainly a server design bonus, hence not featured much on the desktop but now that seems to be changing and it's led by AMD quite obviously with their MPU+IMC design. Not that Intel didn't know this before AMD, they just couldn't produce a chip below 45nm with it.
Complete and utter BS.Why would Intel put an L3 on a chip just for the sake of it ? Core 2 could have had 2x 512KB L2s and a 2MB L3.But why do it since you already have the best answer - fast and large L2s ?
L3s are needed more for QC because of coherency issues and saturation of a shared L2.
Quote:
Well, the additions for Nehalem are good on paper, but fingers crossed as Native+IMC is too difficult to have running as a design without problems, esp. at your first go. 45nm HKMG helps a lot but not as much as 32nm would. I'm fearing the prices on these, as clocks are far harder to get, yeilds much lower, defect rates very high, and hence, price is where it'll bite us in the hind, unless AMD has something Intel fears by 13th October '08.
You know that how ? Magic crystal ball ? If AMD couldn't get it right that means Intel won't either ? :rolleyes:
Quote:
*IMC also increases power/TDP much, especially with triple channel memory support. You can add 30-60W of minimum to maximum power here at just 2.0-2.8GHz clocks, maybe even more so with SMT and QPI support being internal. Maximum theoretical AC and DC power consumption becomes much higher through individual latch testing.
More BS.A MC burns less than 10w.
Quote:
*Triple channel memory is essentially needed, I reckon its a clever move, because Native+IMC design suffers from low real bandwidth, and worse so for write/copy bandwidth than read. Individual DRAM access by each core is the best way to go, should improve or at least keep level write/copy bandwidth but improve read bandwidth over current Penryn. Just having IMC+3 controllers, doesn't gurantee this at all though.
The stupidest claim of the month.
If native+ IMC suffers from low BW , how does a non-native non-IMC design do ? Crawls ?
Got bored with the rest of your post.Sorry , but it's obviously you had too much free time lately to write such rubbish.