It's Lemon Meringue.
Printable View
some options are: one is fake, both are fake. either option is probable
Then why do you keep reading it and posting it on multiple sites. Based on his comment that amd doesn't know when it is launching, I tend to believe it is not true.
Just for kicks : Bobcat's performance in superpi and wprime. This is a mind experiment since bobcat can't reach this clock,but bulldozer can be clocked down to 1.6Ghz.
1.6Ghz Bobcat in super pi gets a score of 49s => 4.2Ghz should get ~18.7s
1.6Ghz Bobcat in wprime (2 thread/cores) gets a score of 72.1s => 4.2Ghz should get ~55s in single core test
So 4.2Ghz Bulldozer core which in single thread workload such as super pi has 128bit fmac on its disposal is slower than Bobcat @ 4.2Ghz by 6% and in wprime is faster by 6%. Bobcat has 64bit FPU. It does look weird. So even if someone has a hardware that posts those scores (real hardware,not faking benchmarks),it may not be final platform.
It shows nothing as it's based on ancient coding.
For "pi" wprime is more up to today's standard.
Single thread performance? good joke.
We know how much A64 beat P4 on IPC but it lose in superpi ;)
This thread is getting nowhere:(
Peeps fighting and arguing over benchmarks from "half baked BO whatever ES" and "probly final rev.ES before retail" chips...This is not cool
Getting bored with SB and exited to find new chip to overclock, but this is such a turn off. Lol
Of course not, but if multiple benchmarks are slower on average 25-35%, that's tells us that something is wrong, or the marchitecture is flop.
wprime is more integer than float and uses numerical method to calculate square root. ;) So, it measures integer multithread performance, not FP. ;)Quote:
For "pi" wprime is more up to today's standard.
Interesting comparison. There isn't chance for BD to be slower per clock than Bobcat.
Nobody knows why intel's since Yonah's times are better than a K8 or K10, Yonah was generally not faster than a K8 either. So we don't know anything, hence why shouldn't be Bobcat faster? Anyhow, it is another design, and the important thing is that Bobcat won't hit 4.2 GHz, however, BD will.
Comparisons at the same clock are interesting from a theoretical point of view - but not practical.
Of course, but main difference between Bobcat and Bulldozer is that the Bobcat is low power design with high latency, low performance single channel memory controller, much smaller core, less cache and much simpler FPU. I simply can't believe in that the Bobcat per clock is faster than Bulldozer. Maybe, Bulldozer per clock is equal to Atom ? :D
why do u think it :)?
Who are you asking? Me?
If yes, then I am basing this on my own experience. I remember with 1st Phenom IIs we had some RAM compatibility problems, and with every updated AGESA code in new BIOS release we would get better and larger compatibility list and higher NB clocks would be possible and CPU would be less resistant to various performance tweaks. I am not saying that this is how it works, just extrapolating from my own observations.
In pipelined archiceture who cares if instruction has 4 or 6 cycles latency. Pipeline miss-predict has 15-16 cycles of penalty, so if instruction latency is lower than that, that is not problem if code isn't too brancy. Problem is only FDIV latency, but how much often is usage of FDIV.... very low. Overall high latency FDIV can't affect on performance.
For most of those benchmarks,regardless if they are real or fake, the key thing is SIMD performance. Integer SIMD is handled also by the new FPU inside bulldozer. So this will be the key for good or bad reviews. If it doesn't beat K10 in single thread or poorly thread integer SIMD workloads,it may not get so favorable reviews. If it manages to do this it will be both faster and more efficient than K10,in both single thread and multi thread benchmarks. Now,how all these "leaks" correlate to real performance is anybody's guess. At least in single thread workloads,when both halves of the flexfp are working on one thread,Bulldozer should be faster than K10 at similar clock. We don't see this in any of the leaks. It is actually slower by 15-20%,which goes against what we know about its FPU.
Well L1 is bigger, and Write Back, not only Write Through. Thus, Bobcat has a clear advantage there. AFAIK, sPi is quite cache depended, maybe that alone is enough for that bench to scale a bit better @bobcat.
Nobody knows, but higher version numbers are always better. Someone with a BD CPU has to grab an AsRock board now and make some comparisons, or wait for his board for (beta) BIOS updates.
As usual, the fun thing with BD is its FlexFPU. In the FDIV case, both FMAC pipes can calculate a FDIV µOp, thus you have double throughput. Even latency might be better. If there are lot of FDIVs, two FDIV µOps can be scheduled simultaneously ;-)
That is probably the reason why somebody wrote some pages before, that FDIV is "faster" on BD ;-)
It depends. Each FMAC consists of an FADD and an FMUL, so the amount of FADDs and FMULs per core is the same as in K10. But, I wonder if the FADD and the FMUL in an FMAC were capable of working parallelly. AFAIK those in the K10 cores are capable of this. In SB cores they certainly do.
I just updated my icafe870R2.0 bios was released Aug. 30, saying update CPU code
http://www.asrock.com/mb/download.as...%20R2.0&o=BIOS
i guess most mobo manu. has the latest Agesa by now
I hope its just a BIOS problem. Either way i am tied into getting a BD whether it stinks or not.
Interesting post by Phantom over at OCN: http://www.overclock.net/14972413-post5517.html
I don't remember who mentioned it in the last few pages (lost the multi-quote I had going), but it was about making a 4M/4T chip and if AMD would do that... I don't know, it's unlikely, but about 1/2 way through the thread I was thinking how it'd be interesting for them to make one since it would give each core total access of all module resources. I think they'd make for pretty good gaming chips. Clock them at 4GHz with maybe a 4.5GHz 4C-Turbo, weee! :D
Nordic Hardware got time with the Zambezi system:
As you can see, in his case FX 8150 has gotten some odd performance numbers in super pi and wprime. And this was system directly from AMD. This practically validates what JF-AMD said,don't trust anyone (with X files music in the background) :).Quote:
CPU / Chipset | 17/09/2011 20:00 | Jacob Hugosson
AMD's processor architecture Bulldozers have been a long roller-coaster with delays and silence from AMD. AMD has not been able to offer other than silence, but in conversation with one of the employees of AMD confirmed that the FX series begun mass production and that the launch should not be too far away.
AMD was first to have launched its FX-series processors in the second quarter of this year, something that had to be postponed to a later date. Instead, they promised one within 60 - 90 days June 1, therefore, that by August something we all know did not occur. The next rumor was September, but we do not know will happen.
NordicHardware have now got hold of more concrete information on Bulldozers from one of its employees. We asked about what has actually been going in recent months with the FX series and why AMD has given us nothing but total silence. The answer was somewhat alarming, AMD has not even themselves know where they could launch the FX series, so they simply have not had anything to say to the media about it.
The situation remains very uncertain in AMD and you do not really know when the launch will take place, but mass production of AMD "Bulldozer" FX series has begun. As it stands today, hope AMD internally at a launch in 4-5 weeks. The latest rumors talk about a launch October 13 and in view of the 4 - 5 weeks we have heard directly from AMD, it may very well be true.
FX2 The test system with the FX 8150 AMD showed off
We also got the chance to briefly sit by a system equipped with 8-core processor, the FX-8150 with a clock speed of 3.6 GHz with a turbo mode at 4.2 GHz. Because of the very limited time period so we had time only to run the obsolete SuperPi 1.5 and wPrime 2:05. Unfortunately, the results far out to be something that represents the final product so take this information with a large pinch of salt.
http://img839.imageshack.us/img839/8305/img1184h.jpg
In the obsolete SuperPi took 20.992 seconds to figure out a million decimal places. Given that Bulldozer is a completely new architecture, it is very possible that AMD has chosen not to do further optimization for x87 code. Then we should not forget that SuperPi is no longer a reliable indicator of performance, but it is much worse compared to previous AMD processors.
http://img18.imageshack.us/img18/2549/img1183cn.jpg
In the newer and more interesting wPrime 2:05 to get the CPU a final score of 15.815 seconds. We have reason to believe that this is not true, as Bulldozers in such cases would perform roughly equivalent to AMD A6-3650, according to our own tests .
http://img193.imageshack.us/img193/8194/wprime32.png
The odd, or should we say really bad performance figures may be due to several different things. But something that is likely is that the processor has either an older stepping, or the BIOS, the computer had not arrived with the latest AGES code from AMD.
Although performance figures are not the most credible, we can confirm one thing: If all goes as planned with AMD is launching the new FX series within 4-5 weeks. We should also then get the green light in on the new flagship from AMD has been worth the long wait. AMD still say they expect the FX series is the world's fastest consumer processor.
Thanks informal... I think people are getting too hung up on benchmarks from the 90's.
SPi lost any relevance back when the A64 came out. Slowly but surely info is leaking and personally, I don't think they look as bad as some folks would have us believe... ;)
It may take a little time for OS's/compilers to take advantage of what BD has to offer, but I don't doubt the performance will be there for modern software.
Now we just need them to be available for purchase! :p:
I have the mobo and $$$ waiting! It's been a long time, but I'm getting excited... :yepp:
If nothing else, it should end all this crazy speculation!!
SPi has never lost relevance Dave ;)
SuperPi AMD vs AMD has always been comparable, Intel vs intel has been comparable.
That might change if it gets slower than Phenom II, people might not want to bench it anymore...but SuperPi is still benched for comparing single thread performance within a single architecture. I still find it fun to bench ;)
Ok Beep, I'll give you some amount of credit with this statement...
I know your a pup, but us old farts remember that P4's crushed A64's in Spi, but in everyday usage the P4 lost baddly!
I get the feeling BD is going to be the point where Spi totally looses relevance (x87 commands are so out dated it's ridiculous)... :rolleyes:
No modern software uses x87 anymore!
Persoanally, I'll be a happy camper when folks don't even bother to run Spi anymore...
Hopefully this will be the chip that finally pushes it into obselesence (where it belongs)!
SPi 1M will be benched till it goes to 1 second completion. Lol
Imo, 32m which required os tweaking for efficiency is why overclockers can't get enough of it
Attachment 120199
:rotf:
But what about wprime? 15.8s is not possible with 8 threads. How mayn threads do u used? Why this :banana::banana::banana::banana: version of wprime (2500k cant be better than 1100T never in wprime! 1.55 version seems is better optimalized than newest versions...)
What is this ages code all about? I've seen it referenced but what is it.
Some kind of firmware,i think it's quite similar to Intel'S Management Engine.Somebody correct me if i am wrong :)
Pestilence chew* already answered the same question page110
Quote:
It's microcode injected into a bios.
For every cpu generation and or revision the microcode can be tuned/optimized and even without a silicon revision.
From there board partners can play around and "tweak" it even more via bios.
Pestilence No problem. Sometimes, you are not present for a day and no new comment is added but next time you must read at least 2 pages.
So BLT has pulled the listings for the 8150/8120/6100 from there site.. Hmmmm
There are many bios options that can effect the outcome of benches.
HPET is 1 for example, it stops the cpu from throttling back in mulithreaded apps.
Running pi on a cluster, versus a core ( 2 threads ) versus being able to disable a single cluster in a core ( which 99% boards/bios's do not have implemented so resources are not shared ) can all influence the results in single threaded.
Knowing all this tells you one thing for sure, you can make it look worse or make it look better all depending on your knowledge of the chip and or your intentions.
As far as PI it's an antiquated bench and has not been AMD's strong point for quite some time.
Granted some results shown tend to lead to the fact that 1m times are bad but looking at the bigger picture we also know that in many cases you can validate 1000mhz higher in many cases with BD, which would point to the fact that you can run 1m at alot faster speeds than current AMD tech.
Things that make you go hmm like what kind of times will we see at 8 gig or even comparing BD to deneb/thuban when same cooling is used.
Chew, I want to thank you for all the insights you have given on BD. You have made me go from :shakes: to :D waiting for the CPU to come out again. I want to get one, but right now I think it is a big if with MSI updating my AM3 880G board with support. I do not want to drop money on a new board, so I may just end up waiting for the revision (piledriver) to come out.
I think it will be more interesting to see "performance per Watt" instead of "clock vs clock".
Say you have a 1100T system @ 4GHz drawing 600W from the wall in full load, what frequency will a identical FX8150 system (with same components & cpu-cooler) end up with? 5GHz? :shrug:
On top of this it might even run cooler, ending up with another win: a more quiet system...
You can disable the throttling.
http://www.xtremesystems.org/forums/...3&d=1315750652
In ASRock Fatal1ty at least.
Chew,
I know you might not be able to answer this in a timely manner but I have to ask it.
I have a GA-MA790FXT(A) UD5 motherboard $200 USD you sayed that the old CH would support the K15 but not optimal. If it is only a power saving issue could the 790FX also work in this situation?
Thank you for taking the time to answer
I think motherboard sellers want to sell motherboards lol
I am gonna take the under on this one. FP benchmarks are less interesting except in the HPC world. ~90% of what your CPU is doing all day long is integer. FP has long latencies (deeper instructions) and is really only ~10% of the work. So speeding up FP might look awesome on SPEC FP benchmarks, but for desktop apps and most server apps, it gives you little or no noticeable impact.
No, there is a 4-core 4 memory channel interlagos, but that is a server part. It has 2 modules. There is not that much of a benefit for having a single thread running on the module. The overhead from sharing is pretty low, so you get pretty close to the same performance. Running one thread on each module vs. filling the modules means that, to run the same # of threads, you will be doubling the amount of power required because you have fired up 2 modules. You'd be better off in most cases to run it on one module, power gate the other, and get the uplift from turbo because you have more headroom.
Your actual mileage will vary, but the idea that there is a huge overhead for sharing just isn't there. In most cases it will probalby be lightly impacting, if impacting at all.
AGESA is the base code that is given to board developers and OEMs to build their BIOS. The version of AGESA that is used will determine what features are exposed and any performance optimizations that might be available. But just because the feature is in AGESA does not mean that your vendor will take advantage of it, you get the features that they expose to you.
I said before and I will say it again. Public sales prior to launch are not allowed per the embargo guidelines. I can't speak to how the desktop guys would handle it, but back when I used to run business development, if someone was advertising prior to launch, they were not the first guys in line to receive product. Production parts went to the folks that honored the rules.
Considering that ~5% of the market at most is buying top bin parts, you can assume that ~95% are interested in price/performance or performance/watt. Base on that, you are absolutely correct, this would be the most interesting thing to consider in comparisons.
Sabertooth has the same bios options, just not as good onboard sound and doesnt have the special software stuff. From what I remember they both have the same VRM stuff so you should be able to get the exact same overclocks. Sabertooth also has a 5 year warranty instead of 3, however I dont think it doesnt support quad-SLI/crossfire?
...so it's th 19th of Sept, anything New?
was just curious as to what was/is the factor in the delays? (more focused on cpu arch. not production or outside factors) if there was any...
i understand the first couple steppings were sub par, but what part of the architecture was to blame? or was it bad yields? or just couldn't reach desired clock speeds?
dont wanna start another couple of pages of arch. wars...it is just innocent curiousness.... if anybody knew some solid evidence.
hopefully BD will be what everybody hopes and we will look back on this thread and:shakes:
I disagree, FP is used extensively for gaming, and I'm not talking about 3D rendering (GPU has taken care of that for the last decade), rather I'm referring to physics. Physics accounts for a huge amount of FPU utilization in modern games. Physx would be the out here, however it's rarely used, and Radion has no support for it.
So if all you do is basic computing, i.e. internet, email, word processing. Then by all means just dismiss FPU performance.
This may help your argument: http://www.realworldtech.com/page.cf...0510142143&p=3
However, there was some news some while back that Nvidia was recompiling the Physx API to move to SSE code, and take better advantage of multicore architectures, not sure if that ever panned out though....
I don't see any "metal box" thingy in Local store
So No Bulldozer yet
@imamage
always keep us updated as Hong Kong mostly gets the new stuff first
Anyone have any thoughts on the Gigabyte 970a-UD3 with BD? Seems like a very good board for the money, 8+1 power too :)
given the ipc of those applications and the #fpu ratio. i don't see a big problem. Low ipc shows alot of dependant ops or lots of bubbles. So while it is fpu intensive code it isn't really intensive for the fpu. So longer latency would affect the thread throughput, another thread would not interfere with the performance. (which is also and advantage). Biggest problem is that the code uses an obsolete marked instruction set for x86-64. So in SSE this would or should run pretty decent on BD single threaded and very good if it is multithreaded... but given the x87 dependancy i'm not sure how BD will perform in that. Might be pretty bad if they really disregarded those legacy instructions.
this code would run great on K8 architectures. (if they can predict and feed enough to their execution resources)
Question for JF-AMD
I have got a supermicro H8QGL-6F ( quad socket motherboard ) with only 2 opterons 6128 HE. Now i want to buy two others 6128 but not HE. So my question is : is it possible to mix opterons 6128 with 6128 HE on the same motherboard ?
Thanking you in advance for the answer .
He's incorrect. From the Sabertooth Product Page
Quote:
Quad-GPU SLI and Quad-GPU CrossFireX Support!
Flexible Multi-GPU solutions, Your Weapon of Choice!
SABERTOOTH 990FX brings multi-GPU configurations through both SLI™ and CrossFireX. This motherboard features the powerful AMD® 990FX/SB950 platform, optimizing PCIe allocation in multiple GPU configurations. Expect a brand new gaming style like you’ve never experienced before!
that's cause sabertooths are made to last even in harsh conditions
Quote:
Certified for Tough Duty
Get rugged performance even in the most challenging conditions with robust TUF chokes, solid capacitors, and MOSFETs--certified through third-party, military-grade testing. TUF Chokes, also known as the “Alloy Choke”, is a made of a compound of various types of metal instead standard iron, enables the support of up to a massive 40A of rated current, 25% higher than conventional component. Furthermore, the single piece packaing also elimates the emission of virbation noise, delivering superb charactoristics as well as durability under extreme conditions.
sourceQuote:
5 Year Warranty
The ultimate reliability of TUF Series is not only based on advanced thermal design, military-standard components and strict reliability tests, but also on the 5 Years Warranty* which is proudly provided.
The Sabertooth board is tuned more for reliability than performance like the ROG boards...
Hey what I wonder is when they are going to release an actual SB950 driver. Im sure the SB850 driver doesnt fully support the 950. Any thoughts?
Last time i checked all CPUs had to have the same vcore. That would not be the case with HE and non-HE CPUs.
I would prefer the older 870A-UD3 Rev. 3.1. It has better VRMs (low-rds) and more connectors (IDE, eSATA). Furthermore, where I live, it is a bit cheaper, too.
The only thing which is missig is the VRM heat-sink, but well, the low-rds parts are getting less hot:
http://www.gigabyte.com/products/com...pids=3786,3907
Best source i know of is www.amdcompare.com, here are your 2 models:
http://products.amd.com/en-us/Optero...?id=648&id=650
Hi,
According to this page http://www.overclock.net/amd-motherb...tion-list.html
GA-970A-UD3 has low-rds VRMs too... plus heatsink
Think I'll get this cheap baby soon !
Until 8PM local time
No sight of Bulldozer at retail yet
No news