amd has their definition of core, some do not agree with that definition
but the product is not going to perform any differently based on the name
Printable View
amd has their definition of core, some do not agree with that definition
but the product is not going to perform any differently based on the name
I'm not disagreeing with you but some people you just can't please. It has shared resources, big deal. I read somewhere that the module is 80-90% of the efficiency of a true dual core. Close enough for me to call a module two cores. :)
BD integer scheduler receives the 4 macro ops but the very execution units work on micro ops sent by that same scheduler. Execution units can only work with micro ops. Back end's job is to retire the macro ops that schedulers received. Per clock and per AMD (per AMD"s Mike Bulter) each core can retire 4 macro ops.
freeloader yet you are still missing 10-20% of performance, thats a lot not to mention what they added was just a second cluster which means only 20% bigger core size without L2 cache, true 2 core without sharing would mean 100% bigger core size on a die.
something like this(without L2 cache)
module without one integer(no sharing, everything can be used by one integer cluster) 15.58mm2
module design 19.4mm2
2 core design 2*15.58mm2=31.16mm2
The performance gain per area increase is worth the performance penalty but comparing it to a regular 8 core is not right in my opinion thanks to the penalty hit.
informal thanks for the info.
This is from dresdenboy @ AT:
Quote:
Originally Posted by dresdenboy
It will be a few weeks before Newegg put it on sale, lets just stop arguing:).......unless peep really work at AMD's BD design team. Lol
i think nda is up the 9/9 read that somewhere.
It's in official slides from FAD 2010. This is an average number,there are probably some cases where sharing doesn't cost any performance and somewhere where it does cost more. BTW Bulldozer has all (integer) cores turbo so this alone will negate some of the penalties that *may* occur.
Also note that there is no multicore chip design out there that scales perfectly with more threads and at the same time has none of the MT technologies(fine grained,coarse grained,SMT). Usually conventional CMP designs (a la Opteron,Conroe) scale ~90% with second thread,on well multithreaded workloads.
Too me it's no different than Intel classifying it's parts as having 4 cores/8 threads. Bulldozer's turbo function will also make up for some of the 10 to 20% deficiencies in a multi threaded work load, however no processor (that I'm aware of) with a shared cache system can obtain a 100% unified work load over two cores on the same die. (cache latency, cache miss, operating system overhead, etc, etc...)
The problem with comparing BD and SB with SMT is that in SB's case you have 3 ALU units(and 2 AGU units) in each integer core that are shared by 2 execution threads. The main idea behind this concept is to use pipeline as much as possible and not let those 3 ALU units sit idle (note that those ALUs are both for int and fp!). This happens usually if MT code is not written well enough or the nature of the problem that code is written for is such that it is not parallel-friendly((not able to be broken into smaller amounts of threads and efficiently executed due to many dependencies among the threads).
Now in BD's case,in each module you have a big shared front end and behind it you have 2 execution cores each of which has 2 integer ALUs and 2(+2) floating point execution units. As you can see there is a BIG difference between AMD's "cores" and intel's "threads/cores". Former are real hardware units that are partly dedicated(int) and partly shared(FP) while in later case we have fixed amount of units that are shared equally between 2 threads.
i think at this point we all understand how it works..
a 4 module bulldozer has 8 cores. yes some shared resources but regardless there is still 8 cores in the cpu. intel cpu has 4 cores, 8 threads. bd has 8 cores 8 threads. if you still see it as a 4 core cpu that is perfectly fine with me, no matter how we see/classify it we will all have the same thing powering our pc :D
anyways this is a 100% pointless "argument" so i will return to how i have been the last n months, quietly waiting patiently :)
Yes and no. A module can potentially retire up to 8 macro ops per cycle when executing two threads and tracking two contexts. If only executing one thread it can potentially retire only 4.
This is the strongest argument that it is an 8 core CPU because when running 8 threads the total potential retirement will be 4 issue x 8
no need for the smart ass reply
considering amd calls it an 8 core on amd.com but you call it a 4 core, i dont see how you can call anyone out who says its 8core as "knowing more about bd than amd"
but anyways like i said, i honestly could not care any less what people call it because regardless its performance wont change depending on what the owner calls it.
edit: i even said in my last post i am perfectly fine if you see it as a 4 core so idk why you try to put me/anyone who thinks it 8core on blast like that
I think a lot of the guys here should work for AMD,
maybe they could come up with a better CPU.?
crazydiamond I'm pretty sure chew has Bulldozer in possession or has seen real internal numbers and you haven't so why argue back? Just because marketing team thinks its cool to make sure everyone knows it's an 8 core doesnt mean :banana::banana::banana::banana:.
Very strong 4 core + 4 half cores = great 4 threads and moderately crippled 8 threads. I expect IPC to be up well, but not when all 8 threads are used. Multi-core scaling will be nothing like people will want it to be, but in four threads I believe AMD will be very close to Sandy Bridge.
His smartass reply is in reply to your smartass reply about his facts...engineer behind Bulldozer that Brian talked to is not advertising/marketing person. Each module is literally a core and a half. Dumbed down it's Hyper-Threading in a physical form, with extra parts in the core not only to help some single thread scenarios but help execute 8 threads...imo I guess AMD should have called each module a core and made it execute 8 threads but then again each module is not a normal core.
I bet the CPU is around Thuban performance in multi-thread, if AMD wanted a good high core CPU the stars core is still a great arch. It's very competetive with Sandy Bridge from a performance per mm2 standpoint core only. Unfortunately IPC must increase which means larger cores and AMD doesn't have the cache density intel has so something has to give in the multi-threading dept for that to happen...
+1
/end sarcasm
thx for the lulz...
I'm so glad you've tested the CPU yourself and know exactly how wrong chew* is. Why the hell do you think chew* is leaving? He gives real information and advice and people like you act like he has no idea what he's talking about. Why do you think Zambezi scores so low in cinebench? I have a hunch that it makes a great quad core...
Are you chew*'s bodyguard or something? If he is not here giving advice, the world will go on :rolleyes: Also it's worth noting crazydiamond never actually accuses chew* of being "wrong", and he's actually stating quite calmly a laughably obvious observation - that it is pointless arguing over whether you define this is a quad core or an octo core. The nameless engineers at AMD can call it a purple-headed people eater for whatever difference it makes (ie, none). AMD is officially calling it an octo core, end of story.
I can't believe I got sucked into posting in this thread, I wish they'd hurry up and bloody release it :yepp:
It's not a smart ass remark, it's fact, merely stating what was said factually in a discussion.
Thank you for reinforcing one of the reasons why I am just about done with this.
Try to help people and get called an ASS, especially after helping that person only days ago in a PM.