Just for kicks : Bobcat's performance in superpi and wprime. This is a mind experiment since bobcat can't reach this clock,but bulldozer can be clocked down to 1.6Ghz.
1.6Ghz Bobcat in super pi gets a score of 49s => 4.2Ghz should get ~18.7s
1.6Ghz Bobcat in wprime (2 thread/cores) gets a score of 72.1s => 4.2Ghz should get ~55s in single core test
So 4.2Ghz Bulldozer core which in single thread workload such as super pi has 128bit fmac on its disposal is slower than Bobcat @ 4.2Ghz by 6% and in wprime is faster by 6%. Bobcat has 64bit FPU. It does look weird. So even if someone has a hardware that posts those scores (real hardware,not faking benchmarks),it may not be final platform.
Last edited by informal; 09-16-2011 at 06:04 PM.
It shows nothing as it's based on ancient coding.
For "pi" wprime is more up to today's standard.
Single thread performance? good joke.
We know how much A64 beat P4 on IPC but it lose in superpi![]()
Athlon II X4 620 2.6Ghz @1.1125v | Foxconn A7DA-S (790GX) | 2x2GB OCZ Platinum DDR2 1066
| Gigabyte HD4770 | Seagate 7200.12 3x1TB | Samsung F4 HD204UI 2x2TB | LG H10N | OCZ StealthXStream 500w| Coolermaster Hyper 212+ | Compaq MV740 17"
Stock HSF: 18°C idle / 37°C load (15°C ambient)
Hyper 212+: 16°C idle / 29°C load (15°C ambient)
Why AMD Radeon rumors/leaks "are not always accurate"
Reality check
_________________________________________________
............................ImAcOmPuTeRsPoNgE............................
[SIGPIC][/SIGPIC]
MY HEATWARE 76-0-0
This thread is getting nowhere
Peeps fighting and arguing over benchmarks from "half baked BO whatever ES" and "probly final rev.ES before retail" chips...This is not cool
Getting bored with SB and exited to find new chip to overclock, but this is such a turn off. Lol
Last edited by Dumo; 09-16-2011 at 09:17 PM.
The Cardboard Master Crunch with us, the XS WCG team
Intel Core i7 2600k @ 4.5GHz, 16GB DDR3-1600, Radeon 7950 @ 1000/1250, Win 10 Pro x64
Of course not, but if multiple benchmarks are slower on average 25-35%, that's tells us that something is wrong, or the marchitecture is flop.
wprime is more integer than float and uses numerical method to calculate square root.For "pi" wprime is more up to today's standard.So, it measures integer multithread performance, not FP.
Interesting comparison. There isn't chance for BD to be slower per clock than Bobcat.
Last edited by drfedja; 09-16-2011 at 10:56 PM.
"That which does not kill you only makes you stronger." ---Friedrich Nietzsche
PCAXE
Nobody knows why intel's since Yonah's times are better than a K8 or K10, Yonah was generally not faster than a K8 either. So we don't know anything, hence why shouldn't be Bobcat faster? Anyhow, it is another design, and the important thing is that Bobcat won't hit 4.2 GHz, however, BD will.
Comparisons at the same clock are interesting from a theoretical point of view - but not practical.
Of course, but main difference between Bobcat and Bulldozer is that the Bobcat is low power design with high latency, low performance single channel memory controller, much smaller core, less cache and much simpler FPU. I simply can't believe in that the Bobcat per clock is faster than Bulldozer. Maybe, Bulldozer per clock is equal to Atom ?![]()
"That which does not kill you only makes you stronger." ---Friedrich Nietzsche
PCAXE
why do u think it?
ROG Power PCs - Intel and AMD
CPUs:i9-7900X, i9-9900K, i7-6950X, i7-5960X, i7-8086K, i7-8700K, 4x i7-7700K, i3-7350K, 2x i7-6700K, i5-6600K, R7-2700X, 4x R5 2600X, R5 2400G, R3 1200, R7-1800X, R7-1700X, 3x AMD FX-9590, 1x AMD FX-9370, 4x AMD FX-8350,1x AMD FX-8320,1x AMD FX-8300, 2x AMD FX-6300,2x AMD FX-4300, 3x AMD FX-8150, 2x AMD FX-8120 125 and 95W, AMD X2 555 BE, AMD x4 965 BE C2 and C3, AMD X4 970 BE, AMD x4 975 BE, AMD x4 980 BE, AMD X6 1090T BE, AMD X6 1100T BE, A10-7870K, Athlon 845, Athlon 860K,AMD A10-7850K, AMD A10-6800K, A8-6600K, 2x AMD A10-5800K, AMD A10-5600K, AMD A8-3850, AMD A8-3870K, 2x AMD A64 3000+, AMD 64+ X2 4600+ EE, Intel i7-980X, Intel i7-2600K, Intel i7-3770K,2x i7-4770K, Intel i7-3930KAMD Cinebench R10 challenge AMD Cinebench R15 thread Intel Cinebench R15 thread
Who are you asking? Me?
If yes, then I am basing this on my own experience. I remember with 1st Phenom IIs we had some RAM compatibility problems, and with every updated AGESA code in new BIOS release we would get better and larger compatibility list and higher NB clocks would be possible and CPU would be less resistant to various performance tweaks. I am not saying that this is how it works, just extrapolating from my own observations.
In pipelined archiceture who cares if instruction has 4 or 6 cycles latency. Pipeline miss-predict has 15-16 cycles of penalty, so if instruction latency is lower than that, that is not problem if code isn't too brancy. Problem is only FDIV latency, but how much often is usage of FDIV.... very low. Overall high latency FDIV can't affect on performance.
"That which does not kill you only makes you stronger." ---Friedrich Nietzsche
PCAXE
For most of those benchmarks,regardless if they are real or fake, the key thing is SIMD performance. Integer SIMD is handled also by the new FPU inside bulldozer. So this will be the key for good or bad reviews. If it doesn't beat K10 in single thread or poorly thread integer SIMD workloads,it may not get so favorable reviews. If it manages to do this it will be both faster and more efficient than K10,in both single thread and multi thread benchmarks. Now,how all these "leaks" correlate to real performance is anybody's guess. At least in single thread workloads,when both halves of the flexfp are working on one thread,Bulldozer should be faster than K10 at similar clock. We don't see this in any of the leaks. It is actually slower by 15-20%,which goes against what we know about its FPU.
Well L1 is bigger, and Write Back, not only Write Through. Thus, Bobcat has a clear advantage there. AFAIK, sPi is quite cache depended, maybe that alone is enough for that bench to scale a bit better @bobcat.
Nobody knows, but higher version numbers are always better. Someone with a BD CPU has to grab an AsRock board now and make some comparisons, or wait for his board for (beta) BIOS updates.
As usual, the fun thing with BD is its FlexFPU. In the FDIV case, both FMAC pipes can calculate a FDIV µOp, thus you have double throughput. Even latency might be better. If there are lot of FDIVs, two FDIV µOps can be scheduled simultaneously ;-)
That is probably the reason why somebody wrote some pages before, that FDIV is "faster" on BD ;-)
Last edited by dess; 09-17-2011 at 03:49 PM.
Bookmarks