It really should have been three or four posts...and I seem to add on to my thoughts so a lot of information does get jumbled up.
Ignore my jumbled response to manicdan + freeloader on "stars" arch to 8 cores on 32nm and the effects it would have on performance + die space. (in freeloaders case, 8 core with 12MB L3 + tri channel memory) would be a larger die on 32nm than thuban is on 45nm.
As far as Bulldozer:
Yes you are right, no "real" official benchmarks.
But there are two peices of information available:
chew*'s words about how the architecture is more like a 4 core with HT + leaked unofficial benchmarks that show multi-threading performance in several cases lower than Thuban, making some people assume that IPC is lower than Deneb. People are looking at these results and dividing by 7 to 8 for "theoretical leaked single thread performance" and talking about how IPC is lower. Maybe it is(!)
...chew* states that running a program as simple as pi on two cores "one module" brings better SINGLE thread performance, goes back to what I predicted a month ago as soon as he said the architecture works more like a 4 core with HT the first time.
Several weeks ago, my prediction was this:
Higher IPC in 4 threads or less
Lower IPC in more than four threads, but better scaling than HyperThreading.
Hopefully this post is a little easier to understand
EDIT:
Cleaned up the first post a bit if you'd like to take another stab at it.
Bookmarks