Too bad AMD didn't update Thuban to a real 8 core 32nm processor. Give it a triple channel memory controller, more L3 cache (12MB+), faster L3 cache speed (3Ghz+) and a target speed of 4+GHz at launch.
Too bad AMD didn't update Thuban to a real 8 core 32nm processor. Give it a triple channel memory controller, more L3 cache (12MB+), faster L3 cache speed (3Ghz+) and a target speed of 4+GHz at launch.
As quoted by LowRun......"So, we are one week past AMD's worst case scenario for BD's availability but they don't feel like communicating about the delay, I suppose AMD must be removed from the reliable sources list for AMD's products launch dates"
Maybe I'm the stupid one here...
Llano's IPC difference vs Propus is neglegible. It's worth about 100 Mhz...or roughly 3%. Sure, up to 5% in some apps...too bad we are 30% behind Sandy with "STARS". As far as AMD's efficiency falling with more cores? Yeah. Thuban is a slower CPU in single thread than Deneb (partially due to less cache)...but look it up.
I leave you with a long string of quotes from chew in this thread that were posted in reaction to BD's leaked performance, ending with a response to freeloader:
By the way, VR-Zone Cinebench leak has shown ~5x scaling over all cores from single thread (not one whole module-core like the patent design states)
Regardless of what is behind NDA and what is not, between what we have for current leak and what chew* has said, I expect scaling from 5.0 to 5.5x on retail chips.
*insert flank3r post about how X8 would be a multithreading beast*
...I've been claiming far before bulldozer talk how well "STARS" does multithreaded, if multithread was AMD's focus they should have stayed with it in my opinion. On 45nm, it was very competetive with Lynnfield and Bloomfield multithreaded.
*insert more arguements about howty BD is and its not 4 cores blah blah, cinebench sucks it sucks blah blah*
...some example talk, because PR said it was true and the website said it was true, and AMD has a habit of either
A. Not talking to public
B. Talking to public (not us enthusiasts directly...like Simon) in horrible ways, misleading info, cheesy presentations, how Denebs with disabled cores are a "True Dual Core Design?"
I'd give that CPU a launch speed of 3 Ghz. New processes, GF lagging behind and seemingly sucking at yields looks like trouble with new process.
As far as 12MB of cache...triple channel memory controller, two more cores...about 45-50% more die space (rough estimate, larger DDR3 controller + reorder cores for 3+3+2 and AMD's horrible cache density equalling 15% of Thubans die (6MB)) on a 40% shrink so you are looking at 5% larger than thuban around 350-375 mm^2? Leaves no room at all to grow, even if they could pull that off.
Last edited by BeepBeep2; 09-23-2011 at 09:01 PM.
Smile
BeepBeep, I admit that I have no clue about what was your intention with the last post. Summary of quotes plus some vague estimate how much die area would X8 K10 have? Who cares about 32nm X8 K10? It's not going to happen so why discuss it? And as for Bulldozer ,there are so many unknowns that it's not even funny. We have no real/official benchmark on final platform, no official launch date, no shipping date for Zambezi and no official price. We do have some bits and pieces here and there,some painting good some bad picture. That's all.
Rig 1:
ASUS P8Z77-V
Intel i5 3570K @ 4.75GHz
16GB of Team Xtreme DDR-2666 RAM (11-13-13-35-2T)
Nvidia GTX 670 4GB SLI
Rig 2:
Asus Sabertooth 990FX
AMD FX-8350 @ 5.6GHz
16GB of Mushkin DDR-1866 RAM (8-9-8-26-1T)
AMD 6950 with 6970 bios flash
Yamakasi Catleap 2B overclocked to 120Hz refresh rate
Audio-GD FUN DAC unit w/ AD797BRZ opamps
Sennheiser PC350 headset w/ hero mod
It really should have been three or four posts...and I seem to add on to my thoughts so a lot of information does get jumbled up.
Ignore my jumbled response to manicdan + freeloader on "stars" arch to 8 cores on 32nm and the effects it would have on performance + die space. (in freeloaders case, 8 core with 12MB L3 + tri channel memory) would be a larger die on 32nm than thuban is on 45nm.
As far as Bulldozer:
Yes you are right, no "real" official benchmarks.
But there are two peices of information available:
chew*'s words about how the architecture is more like a 4 core with HT + leaked unofficial benchmarks that show multi-threading performance in several cases lower than Thuban, making some people assume that IPC is lower than Deneb. People are looking at these results and dividing by 7 to 8 for "theoretical leaked single thread performance" and talking about how IPC is lower. Maybe it is(!)
...chew* states that running a program as simple as pi on two cores "one module" brings better SINGLE thread performance, goes back to what I predicted a month ago as soon as he said the architecture works more like a 4 core with HT the first time.
Several weeks ago, my prediction was this:
Higher IPC in 4 threads or less
Lower IPC in more than four threads, but better scaling than HyperThreading.
Hopefully this post is a little easier to understand
EDIT:
Cleaned up the first post a bit if you'd like to take another stab at it.
Last edited by BeepBeep2; 09-23-2011 at 09:02 PM.
Smile
I think his point is: Read between the lines of the post's from one who has had vast experience with AMD chips and hands on with BD. Read the clues!! Has come to the same conclusion as me..... BD wont be the monster everyone is hoping for.
I also believe the events this summer, rumors and delays point to the same thing.
Disclaimer: ^ My personal opinion, hope I am wrong and the 5 years of BD planning nets a real winner for the enthusiast .
Windows 7 X64/ Phenom X6 1090T@4.1/ Crosshair IV Formula 890FX/ Saphire Toxic 5850/ Patriot Torqx SSD 2x 64GB RAID 0, Seagate 1TB storage/ Corsair TX750W/ Haf 932/ DDR3- Muskin
People in the AMD crowd seem to be way too hung up on multi-threaded performance and the fact that AMD calls this an 8 core CPU(even though they've been crying about single thread but pretending like more cores makes up for it, hence "why didn't they make X8 32nm").
Because leaked benchmarks show the 8 core CPU performing near 2500K level multithreaded, people are disappointed. However, chew* has stated multiple times the CPU internally is a 4 core with 8 threads. People have simply given him hell for making such statements. The funny thing here to me though, is that he has Bulldozer and they do not.
The goal here in my opinion was not to add more cores, but to get single thread performance up. If they can find a way to make CMT work well (1 thread per module) and that does in fact speed up single thread performance, then I see absolutely nothing wrong with this architecture. If you guys haven't noticed, Sandy cores are huge...Llano cores are not. Bulldozer modules are huge...Llano cores are not. 1 Sandy Core = 2 threads...1 BD module = 2 threads.
Last edited by BeepBeep2; 09-23-2011 at 09:12 PM.
Smile
Don't call it HT, in relation to AMD, please. HyperThreading is a brandname of Intel for its implementations of SMT.
So, CMT is a kind of Multi Threading. A quite different one to SMT. Otherways both are "physical".
I agree that it's a 4-core + CMT, from engineering POV. Altough, if it performs comparably to a true 8-core of earlier uarch's, then it can be valid to call it a 8-core.
Last edited by dess; 09-23-2011 at 10:18 PM.
But we have known the processor would act like a 4 core + advanced hyper threading for what, 2 years now? Maybe 3? Everyone got caught up in the whole 8 core thing but we have known for a LONG time now that this architecture that uses cluster multi threading was just a more advanced and better form of HT. It was talked about extensively years ago by AMD. Slides were shown explaining the 4 types of multi-threading in a core and why AMD hadn't done the same thing as Intel and what AMD would do and was trying to implement in Bulldozer for its multi-threaded cores.
Rig 1:
ASUS P8Z77-V
Intel i5 3570K @ 4.75GHz
16GB of Team Xtreme DDR-2666 RAM (11-13-13-35-2T)
Nvidia GTX 670 4GB SLI
Rig 2:
Asus Sabertooth 990FX
AMD FX-8350 @ 5.6GHz
16GB of Mushkin DDR-1866 RAM (8-9-8-26-1T)
AMD 6950 with 6970 bios flash
Yamakasi Catleap 2B overclocked to 120Hz refresh rate
Audio-GD FUN DAC unit w/ AD797BRZ opamps
Sennheiser PC350 headset w/ hero mod
Just 5,95 pts in Cinebench: http://www.donanimhaber.com/islemci/...onuclari_5.htm
To calculate single thread performance one should look at JF's posts. One thread = 100% two threads =180%, this is probably top scaling so probably lower most often, maybe 40-60%. If BD IPC is less than 11% higher than Thuban it will not increase IPC per two threads (111% of Thuban + 88% of thuban) best case scenario. In non optimal scenarios it would need from 25-50% higher IPC than Thuban to equal it per two threads (run within one module of course).
Edit: The purpose of CMT is probably to get a more consistent scaling with the second thread per core than SMT. I also believe that the figure 80% is more likely to happen with a server workload than a desktop workload. That would also explain why the desktop test are lacking in performance compared to Thuban in MT.
Last edited by marten_larsson; 09-24-2011 at 01:07 AM.
shopblt was only 1 source, there were 4-5 other shops, which back-uped the lineup of 3 models.
Now concerning your Interlagos example, that does not fit, because it is another segment. The retail segment of server CPUs is very, very small. If there is demand for an Opteron, than people buy a complete machine+service and don't do a "DIY-build" of the server in their basement.
Therefore, most shops do not have any Opterons, some only have a few. Furthermore, there is no other shop which could back up that fact.
Anyhow, we have confirmation now:
amdfxpressdeck10adhfx57.jpg
People who says that AMD wouldn't be able to add two cores to Phenom II and rise frequencies at the same time. Consider this:
Agena 4C 2.6GHz 2Mb L3 65nm
Thuban 6C 3.3GHz 6Mb L3 45nm
Thats 50% more cores, 27% higher frequency and 200% more cache, and still cooler than Agena. Of course you can't just extrapolate this gains into 32nm, but I think it's reasonable to say that we would be able to see 33% more cores, higher frequencies (even more with turbo) and an extra 2Mb cache and still have a smaller chip than Thuban. Throw in some core enhancements and you're set.
Would it be as competitive as BD? Maybe, maybe not, but I think it's a bad sign when you first really new architecture in 12 years isn't clearly a better choice than the old one. Intel did this once, they crippled their old P6 in favor for Pentium 4 a century ago, it turned out that then the improved P6 came back 5 years later it was twice as fast as their current Pentium 4, and that's dual vs. dual, their P6 derived architecture had more thermal room for extra cores and came as C2Q delivering even more performance.
Well I think even a beefed up high frequency 4C/6C Phenom III with doubled FMAC capable FPU and better prefetchers.would actually perform better against Intels quads than BD will considering the leaks and price. But AMD made the choice to focus on cores to make it an Apples to Oranges comparison they could benefit from. I think it's their server focus that makes them do bad desktop processors, in server world multithreading is everything.
has anyone taken a look at this?
http://www.donanimhaber.com/islemci/...onuclari_1.htm
It seems "official" Amd slides...
some people post ONLY the cinebench slide, because fx seems to perform worse than i7 2600k and similar to i5 2500k, but take a look at the other slides.
![]()
Last edited by liberato87; 09-24-2011 at 02:24 AM.
Fair comparision, but with a brand new architecture the important point is the 2nd or even 3rd generation. The first one is very often buggy and not well optimized. Intels very first S423 Pentium4 was crap, the Socket 478's Northwoods were quite good, but then Preshott was the end, because they could not control the power consumption, and the pipeline got ridiculously long.
I do not see these problems here, the only problem is low-IPC, and a still not well running process at GF. Both could be fixed in the next revision. Then you still have superiour clocks. One should not forget, that Llano's CPU part @32nm is neither running well. Only ~3GHz clock, with OC you can reach maybe 3.6, Bulldozer seems to reach easily 4.8Ghz, with 8cores... and it has AES, AVX, SSE4.1/4.2... IMO really the better option to have build Zambezi, especially if the next revisions will gain some IPC.
Looks fake in design, it's nothing professional about it. But the numbers look real, game benches are clearly GPU limited and the rest is hand picked FMA4 with cut Y-axis. Looks like official charts since they try to hide performance disadvantages.
These leaks are useless, most are surely crippled and they mean nothing. However, the pricing does seem official and points to BD performing very close to the level of a 2600K in MT apps. If it overclocks well, its all good. I also believe the release date being very near, most likely two to three weeks at max from now.
i7 920@4.34 | Rampage II GENE | 6GB OCZ Reaper 1866 | 8800GT (zzz) | Corsair AX750 | Xonar Essence ST w/ 3x LME49720 | HiFiMAN EF2 Amplifier | Shure SRH840 | EK Supreme HF | Thermochill PA 120.3 | MCP355 | XSPC Reservoir | 3/8" ID Tubing
Phenom 9950BE @ 3400/2000 (CPU/NB) | Gigabyte MA790GP-DS4H | HD4850 | 4GB Corsair DHX @850 | Corsair TX650W | T.R.U.E Push-Pull
E2160 @3.06 | ASUS P5K-Pro | BFG 8800GT | 4GB G.Skill @ 1040 | 600W Tt PP
A64 3000+ @2.87 | DFI-NF4 | 7800 GTX | Patriot 1GB DDR @610 | 550W FSP
I dont know if these are true or fake.. I assume that they re fake, unitl we have a confirmation.
Ok game benches are gpu limited but full hd is the resolution that gamers use most.. so I dont care about benches in 1280x1024 only to see the best cpu, useless IMHO.
what about multithread slide?
Would recommend taking a look at this one, as well...
amdfxpressdeck_12a_dh_fx57.jpg
Isn't affected at all!? Then why it is in short supply? While they're concentrating on these as being mainstream parts they face the biggest demand here.
Then, most Orochi dies that produced are going to server OEM's and Cray, as Interlagoses and perhaps Valencias, at much higher profit margins than desktop CPU's.
No cut in that diagram, base is at 0. And why hand-picked is 15 different kind of tests?hand picked FMA4 with cut Y-axis.
Last edited by dess; 09-24-2011 at 03:02 AM.
Meh, I was hoping for an advantage in multithread over the 2600K at least, this is mediocre performance.
But brand new architectures isn't usually worse or equal to the old, compare K7 to K6, K8 to K7, Core i7 to Core 2, Core 2 to Core, the only example of this is the failed P4 architecture. See the pattern? And your comparison with Llano is unfair. It has low stock clocks because it has a full 400 shaders eating power, and at the same time it isn't meant to be high performance. And you can't compare overclocking either, Llanos bad overclocking abilities is due to lack of dividers and frequency locks. An unlocked Llano without graphics part would overclock much much better.
Last edited by liberato87; 09-24-2011 at 03:37 AM.
Based on these slides it is a good step in the right direction! Not a leap forward but a solid step up.
With that performance I won't be disappointed. Plenty of knobs to turn and keep me entertained with new BD CPU coming to my system ;].
Can't wait really because 3 days ago I've sold my 1090T and temporary putted X2 555 in it's place. I was shocked how much slower this CPU is in games even @4GHz compared to 1090T. In Crysis2 FPS cut in HALF! Civilization 5 visibly slower, even EVE Online is jerky now! Game engines are definitely going in the right direction with MT support![]()
RiG1: Ryzen 7 1700 @4.0GHz 1.39V, Asus X370 Prime, G.Skill RipJaws 2x8GB 3200MHz CL14 Samsung B-die, TuL Vega 56 Stock, Samsung SS805 100GB SLC SDD (OS Drive) + 512GB Evo 850 SSD (2nd OS Drive) + 3TB Seagate + 1TB Seagate, BeQuiet PowerZone 1000W
RiG2: HTPC AMD A10-7850K APU, 2x8GB Kingstone HyperX 2400C12, AsRock FM2A88M Extreme4+, 128GB SSD + 640GB Samsung 7200, LG Blu-ray Recorder, Thermaltake BACH, Hiper 4M880 880W PSU
SmartPhone Samsung Galaxy S7 EDGE
XBONE paired with 55''Samsung LED 3D TV
Game benches which are not GPU limited are relevant, because you can see which processor which will be bottlenecking first in future games. That both a P4B 2.4GHz and Athlon X2 at 2.4GHz is able to get the same result in a GPU limited Q3 bench doesn't mean they perform the same in Crysis.
GF isn't in full 32nm production yet, a large part of their capacity is still 45nm. And Llano is mostly laptop, and I don't have a hard time finding those. Nothing unusual for an AMD node transition. AMD can't have a large 32nm production so short time after the transition. At least the last 4 node transitions has been low volume the first months. Well 45nm went unusually fast. But 65nm, 90nm and 130nm was low volume for several month before they had transitioned to full capacity on the new node.
You're right, my mistake. And it's 15 different tests, but not different kinds of test. It all measures FMA4 performance, which is unfair against SB. So still handpicked.
But I feel AMD with it's better multi core performance inflate the total score across the board compared to i7 and i5. So even if it perform worse than i5 in say games they're still more expensive since benches show them faster in other areas.
Last edited by -Boris-; 09-24-2011 at 04:07 AM.
Bookmarks