Page 135 of 181 FirstFirst ... 3585125132133134135136137138145 ... LastLast
Results 3,351 to 3,375 of 4519

Thread: AMD Zambezi news, info, fans !

  1. #3351
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    Quote Originally Posted by mattkosem View Post
    It doesn't really look that far off TBQH. If we check the difference between the 1100T and the 8150, it shows the 8150 as performing 33% faster. With the 1100T lagging slightly behind the reference i7 950, it's hard to draw a solid conclusion from that first pic anyways. We also don't know how accurate that second slide is either. The subtext indicates that the numbers are AMD's own official results (and intel's own on the 3960x), but from where and with what system configuration?

    --Matt
    Actually what we have now in the latest slides from AMD and what we saw from that first Scorpius slide is as different as night and day. Originally , the slide was portraying Zambezi as Thuban crushing machine,it was a massive 86% faster in legacy SSE code (C11.5) that supports 8 threads, a massive 50% faster in 3dmark 06 that is not so well threaded and a solid 15 to 20% faster in serial code such as PCmark TV and Movies benchmark. What we have now in all these previews and even "official" AMD slides that leaked is quite a bit slower Zambezi,a chip that barely can beat Thuban 6C in many workloads and that is on average just a bit faster than 1100T. It still has massive OC headroom though,but none of the advantages of new floating point unit is there any more. New FlexFP is now lagging behind one thuban core in many workloads,instead of outperforming it significantly. Something is not right. We can wait until launch for real benchmarks but probably we won't see much of a change .

  2. #3352
    Xtreme Member
    Join Date
    Apr 2007
    Location
    Serbia
    Posts
    102
    Old slides is probably based on CPU simulation, but in realworld scenario BD marchitecture behaves very different. Maybe some of advanced features are disabled because of serious bug, and there is performance penalty. I can't wait to see BD errata list. I can't believe in that the cache architecture is simple reason for low performance because they can simulate how much performance penalty came from smaller 16K L1D, and WT based cache policy. Hit rate of 4-way 16KB is only 1 or 2 % less than 64K 2-way. 2MB L2 has also much higher hit rate than 0.5MB L2. Maybe BD is better optimized for large data work sets.
    "That which does not kill you only makes you stronger." ---Friedrich Nietzsche
    PCAXE

  3. #3353
    Xtreme Addict
    Join Date
    Dec 2007
    Location
    Hungary (EU)
    Posts
    1,376
    Quote Originally Posted by xdan View Post
    AMD lied saying that BD is an 8 core. In fact even now is lying. It is not.
    Can you tell me that a non 8 core processor how can do 6.72x speedup in a multithreaded task?

    cb11.png
    -

  4. #3354
    Xtreme Member
    Join Date
    Jul 2004
    Location
    Berlin
    Posts
    275
    Quote Originally Posted by rog View Post
    There is no performance difference between 0.9x and 1.x Agesa code at all.
    There are indeed only small diffs in the disassembled AGESA code. Did you perform tests?
    Now on Twitter: @Dresdenboy!
    Blog: http://citavia.blog.de/

  5. #3355
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    Quote Originally Posted by drfedja View Post
    Old slides is probably based on CPU simulation, but in realworld scenario BD marchitecture behaves very different. Maybe some of advanced features are disabled because of serious bug, and there is performance penalty. I can't wait to see BD errata list. I can't believe in that the cache architecture is simple reason for low performance because they can simulate how much performance penalty came from smaller 16K L1D, and WT based cache policy. Hit rate of 4-way 16KB is only 1 or 2 % less than 64K 2-way. 2MB L2 has also much higher hit rate than 0.5MB L2. Maybe BD is better optimized for large data work sets.
    The slide is from December 2010. Bulldozer taped out in Q2 2010. The data is most likely NOT based on simulations but on real hardware. The clocks on the initial steppings could be low but they can extrapolate performance of higher clocked models easily. The point is that by Dec 2010,the design was well pass the finish point and they were doing the sampling to partners. The only problem they may hit by that time is low clock,low yield or leaky parts with power draw issues(or any combination of those). If anything was wrong(design bug)they would have known it and figured it in all the performance expectations and hence that scorpius slide would have it too. That is why I think there is no performance problem but something else is going on.

    @ Oliverda

    While the scaling is that of an 8 core chip, the single core performance is abysmal. Leaks point out to a single thread score of around 24% lower than one Thuban core at same clock. Either both fmacs are not utilized in single thread test or something else is the problem. 2x fmac should not be slower in legacy code than one Thuban core,especially since we know that fmacs are more flexible and can do any of the fp commands.

  6. #3356
    Xtreme Member
    Join Date
    Apr 2007
    Location
    Serbia
    Posts
    102
    Quote Originally Posted by Oliverda View Post
    Can you tell me that a non 8 core processor how can do 6.72x speedup in a multithreaded task?

    cb11.png
    BD modules scales very well, but there is problem with very very low per module IPC. It is ~30% lower than K10.5.

    Quote Originally Posted by informal View Post
    The slide is from December 2010. Bulldozer taped out in Q2 2010. The data is most likely NOT based on simulations but on real hardware. The clocks on the initial steppings could be low but they can extrapolate performance of higher clocked models easily. The point is that by Dec 2010,the design was well pass the finish point and they were doing the sampling to partners. The only problem they may hit by that time is low clock,low yield or leaky parts with power draw issues(or any combination of those). If anything was wrong(design bug)they would have known it and figured it in all the performance expectations and hence that scorpius slide would have it too. That is why I think there is no performance problem but something else is going on.
    Maybe data from slides was came from earlier time than Q2 2010. Maybe R&D team was thinking that some mysterious bug can be solved before launch. Maybe they don't know what's wrong with BD....
    Last edited by drfedja; 09-30-2011 at 04:49 AM.
    "That which does not kill you only makes you stronger." ---Friedrich Nietzsche
    PCAXE

  7. #3357
    Xtreme Member
    Join Date
    Jul 2004
    Location
    Berlin
    Posts
    275
    Quote Originally Posted by BeepBeep2 View Post
    chew* stated something about SuperPi, a single threaded program gaining a significant amount when set to more than one thread...we will see where this progresses upon release/NDA release in my opinion.
    He wrote about running SPi on a module while allowing a 2nd thread to run on it and running SPi w/ only one core enabled on that module.
    Now on Twitter: @Dresdenboy!
    Blog: http://citavia.blog.de/

  8. #3358
    Registered User
    Join Date
    Mar 2011
    Location
    South Italy
    Posts
    89
    Quote Originally Posted by informal View Post
    Actually what we have now in the latest slides from AMD and what we saw from that first Scorpius slide is as different as night and day. Originally , the slide was portraying Zambezi as Thuban crushing machine,it was a massive 86% faster in legacy SSE code (C11.5) that supports 8 threads, a massive 50% faster in 3dmark 06 that is not so well threaded and a solid 15 to 20% faster in serial code such as PCmark TV and Movies benchmark. What we have now in all these previews and even "official" AMD slides that leaked is quite a bit slower Zambezi,a chip that barely can beat Thuban 6C in many workloads and that is on average just a bit faster than 1100T. It still has massive OC headroom though,but none of the advantages of new floating point unit is there any more. New FlexFP is now lagging behind one thuban core in many workloads,instead of outperforming it significantly. Something is not right. We can wait until launch for real benchmarks but probably we won't see much of a change .
    that's exaclty what I'm saying.

    In the 1st slide, below the graph we see

    "estimates and projections subject to change "

    but "change" cannot be 100% to 0%

  9. #3359
    Xtreme Member
    Join Date
    Jul 2004
    Location
    Berlin
    Posts
    275
    Quote Originally Posted by drfedja View Post
    BD modules scales very well, but there is problem with very very low per module IPC. It is ~30% lower than K10.5.
    In CB it's - besides other factors - a 4 vs 6 FPU fight.
    Now on Twitter: @Dresdenboy!
    Blog: http://citavia.blog.de/

  10. #3360
    Registered User
    Join Date
    Mar 2011
    Location
    South Italy
    Posts
    89
    Quote Originally Posted by Dresdenboy View Post
    In CB it's - besides other factors - a 4 vs 6 FPU fight.
    I think amd engineers know that fpu are less, and the related consequences.
    So why they made a slide showing zambezi with DOUBLE performance of thuban in cinebench 11.5 if it can be even worse than thuban (due to fpu and other factors)?

    How did they made that slide? they were drunk? we are not talking about an error of 5-10%, but we're talking about performance that had to be doubled and instead are the same. I cannot think they make such a big fail in prediction.
    or we have to think that the leaked test are fake?

    the two things are like night and day, as informal said.

  11. #3361
    Xtreme Member
    Join Date
    Apr 2007
    Location
    Serbia
    Posts
    102
    Quote Originally Posted by Dresdenboy View Post
    In CB it's - besides other factors - a 4 vs 6 FPU fight.
    I think that in CB code FPU is underutilised. Because of that, two integer blocks can push enough data to process with FPU and thread scaling is good. But I think, there is the problem with weak or narrow integer cores, maybe some serious L1D cache design flow.
    4 BD FPU's doing job pretty well comparing to integer blocks. There is more scenarios where BD shows weak integer performance.
    "That which does not kill you only makes you stronger." ---Friedrich Nietzsche
    PCAXE

  12. #3362
    Xtreme Member
    Join Date
    Sep 2010
    Posts
    139
    Quote Originally Posted by Dresdenboy View Post
    In CB it's - besides other factors - a 4 vs 6 FPU fight.
    Well the other thing, and the most likely thing is that R11.5 isn't limited by fpu power at all for BD. Given the scaling on the BD leaks, adding another thread on the same module doesn't drop performance of either thread. If you look at intel HT, it doesn't reach the scaling AMD gets while the fpu is also SMT like. (running 2threads on them is hurting performance of each thread dramatically)

    So performance of R11.5 isn't limited by the fpu resources but by something else on BD. Would also expleain why the leaked sisoft and other benchmarks the BD is ahead of the K8like architecture in fpu on a core and clock basis.


    edit: i'm with drfedja on this.

  13. #3363
    Xtreme Enthusiast
    Join Date
    Oct 2008
    Posts
    678
    Quote Originally Posted by liberato87 View Post
    you mean fx 8150p?
    It would be a fail if the 8150p < 1100t.

    anyone remember this (december 2010) : zambezi performs 50% better than 1100t.

    and we can read above "render performance is based on Cinebench R11.5"

    Attachment 120656

    somoene lied..

    Please, these slides were obvious fakes already back then! There where several things that was clearly wrong. I told people not to trust them because I knew people would start accusing AMD of lying. I know you just staded that someone lied, but know people are at it again. Look at the post below.

    Quote Originally Posted by xdan View Post
    AMD lied saying that BD is an 8 core. In fact even now is lying. It is not.
    Second, if BD is further delayed for an actual launch, on 12 octomber we should have at least a "paper launch".
    It's an eightcore, it has 8 integer clusters and 8 FPUs, the only thing is that a pair of cores can borrow FPUs from each other and share some L1 cache and prefetch and scheduler.

  14. #3364
    Xtreme Enthusiast
    Join Date
    Sep 2008
    Location
    ROMANIA
    Posts
    687
    It's an eightcore, it has 8 integer clusters and 8 FPUs, the only thing is that a pair of cores can borrow FPUs from each other and share some L1 cache and prefetch and scheduler.
    It's an 4 core, 8 integer clusters simple and plain.
    More like a HT thing.
    Intel Core quad aproach( to put 2 dual cores together) was more "true" core design than BD is now.
    8 cores, o yeah...
    Last edited by xdan; 09-30-2011 at 05:50 AM.
    i5 2500K@ 4.5Ghz
    Asrock P67 PRO3


    P55 PRO & i5 750
    http://valid.canardpc.com/show_oc.php?id=966385
    239 BCKL validation on cold air
    http://valid.canardpc.com/show_oc.php?id=966536
    Almost 5hgz , air.

  15. #3365
    I am Xtreme FlanK3r's Avatar
    Join Date
    May 2008
    Location
    Czech republic
    Posts
    6,823
    Quote Originally Posted by liberato87 View Post
    you mean fx 8150p?
    It would be a fail if the 8150p < 1100t.

    anyone remember this (december 2010) : zambezi performs 50% better than 1100t.

    and we can read above "render performance is based on Cinebench R11.5"

    Attachment 120656

    and now?
    watching the slides of donaninhaber we see in cinebench that

    Attachment 120657

    somoene lied..
    My meaning, there is some problem. Multiefectivity between cores/moudles is good, score is whorse than in original slide, so problem could be in module or in core unit-more lower performance. I dont know which exactly problem, but...hope, will be fixed in Piledriver.
    ROG Power PCs - Intel and AMD
    CPUs:i9-7900X, i9-9900K, i7-6950X, i7-5960X, i7-8086K, i7-8700K, 4x i7-7700K, i3-7350K, 2x i7-6700K, i5-6600K, R7-2700X, 4x R5 2600X, R5 2400G, R3 1200, R7-1800X, R7-1700X, 3x AMD FX-9590, 1x AMD FX-9370, 4x AMD FX-8350,1x AMD FX-8320,1x AMD FX-8300, 2x AMD FX-6300,2x AMD FX-4300, 3x AMD FX-8150, 2x AMD FX-8120 125 and 95W, AMD X2 555 BE, AMD x4 965 BE C2 and C3, AMD X4 970 BE, AMD x4 975 BE, AMD x4 980 BE, AMD X6 1090T BE, AMD X6 1100T BE, A10-7870K, Athlon 845, Athlon 860K,AMD A10-7850K, AMD A10-6800K, A8-6600K, 2x AMD A10-5800K, AMD A10-5600K, AMD A8-3850, AMD A8-3870K, 2x AMD A64 3000+, AMD 64+ X2 4600+ EE, Intel i7-980X, Intel i7-2600K, Intel i7-3770K,2x i7-4770K, Intel i7-3930KAMD Cinebench R10 challenge AMD Cinebench R15 thread Intel Cinebench R15 thread

  16. #3366
    Xtreme Enthusiast
    Join Date
    Oct 2008
    Posts
    678
    Quote Originally Posted by flyck View Post
    Well the other thing, and the most likely thing is that R11.5 isn't limited by fpu power at all for BD. Given the scaling on the BD leaks, adding another thread on the same module doesn't drop performance of either thread. If you look at intel HT, it doesn't reach the scaling AMD gets while the fpu is also SMT like. (running 2threads on them is hurting performance of each thread dramatically)

    So performance of R11.5 isn't limited by the fpu resources but by something else on BD. Would also expleain why the leaked sisoft and other benchmarks the BD is ahead of the K8like architecture in fpu on a core and clock basis.


    edit: i'm with drfedja on this.
    Or it is FPU limited but the processor fails to share FPU power between cores. So that all cores just got their own FPU and can't successfully borrow from the others.

  17. #3367
    Xtreme Enthusiast
    Join Date
    Oct 2008
    Posts
    678
    Quote Originally Posted by xdan View Post
    It's an 4 core, 8 integer clusters simple and plain.
    More like a HT thing.
    Intel Core quad aproach( to put 2 dual cores together) was more "true" core design than BD is now.
    8 cores, o yeah...
    Now your make your own definitions, I can make the definition that CPUs are turds, and accuse Intel of lying then they're saying that their CPUs isn't turds. But that doesn't mean I'm right or that Intel actually is lying. You can only accuse AMD of lying in this matter if you make up your own definitions of what the truth is.
    According to the generally accepted definition BD is eight core. You can keep your definitions.
    But out of curiosity. What defines a core to you?

  18. #3368
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    One thing we know for sure now. 32nm production is not where AMD wanted it to be. Llano as a high volume product is eating a lot of capacity and it is yielding much under desired level. On top of that the production is shared between current 45nm and 32nm product lines . This is further more complicating the situation. Bulldozer is large die product and with all factors combined it's no wonder why AMD pushed it back to Q4. As for results maybe none of the results are true since AMD doesn't want to generate unnecessary hype around a product that it can't deliver in large quantities. They would devalue their current product lines and would be unable to supply enough of new products (both llano and server/desktop Bulldozer).

  19. #3369
    Xtreme Enthusiast
    Join Date
    Sep 2008
    Location
    ROMANIA
    Posts
    687
    It was said before by, Chew, who is a better expert.


    Read also that PDF http://www.freepatentsonline.com/20080209173.pdf See that pdf figure 3 what is a true core...A true core doesn't share L1 cache, FPU, prefetch with other core...
    A core could have 2 clusters which share between them all this and work in 2 threads....
    Anyway AMD 4C/8treads cpu beats INTEL 2500K quad would have sound better than...
    AMD 8 core beats Intel 2500K quad...
    Last edited by xdan; 09-30-2011 at 06:29 AM.
    i5 2500K@ 4.5Ghz
    Asrock P67 PRO3


    P55 PRO & i5 750
    http://valid.canardpc.com/show_oc.php?id=966385
    239 BCKL validation on cold air
    http://valid.canardpc.com/show_oc.php?id=966536
    Almost 5hgz , air.

  20. #3370
    Xtreme Addict
    Join Date
    Dec 2007
    Location
    Hungary (EU)
    Posts
    1,376
    Quote Originally Posted by xdan View Post
    It was said before by, Chew, who is a better expert.

    Read also that PDF http://www.freepatentsonline.com/20080209173.pdf See that pdf figure 3 what is a true core...
    You still haven't answered my question.
    -

  21. #3371
    I am Xtreme
    Join Date
    Dec 2007
    Posts
    7,750
    Quote Originally Posted by xdan View Post
    AMD lied saying that BD is an 8 core. In fact even now is lying. It is not.
    the whole core argument started up again right here when you said this

    fact is most of us dont care at all what AMD calls it, or what you call it, or what even i call it. most posts i say module and thread now just to keep people happy and not let them derail a thread with this marketing nonsense every day. since thread scaling gets better by having L3, does that mean the latest Athlons are no real quads? or does that mean Phenom is just somehow a better quad. and if so then maybe its possible to consider BD a octo core, but just not a very good one. we can skin this cat with more posts than this thread is, but in the end our answer will not make a single difference on the performance.
    2500k @ 4900mhz - Asus Maxiums IV Gene Z - Swiftech Apogee LP
    GTX 680 @ +170 (1267mhz) / +300 (3305mhz) - EK 680 FC EN/Acteal
    Swiftech MCR320 Drive @ 1300rpms - 3x GT 1850s @ 1150rpms
    XS Build Log for: My Latest Custom Case

  22. #3372
    Xtreme Member
    Join Date
    Apr 2005
    Location
    London, UK
    Posts
    261
    Did I just step into time machine and it send me few months back? How many times did we have that discussion about cores :/ getting really boring

  23. #3373
    Xtreme Member
    Join Date
    Jan 2008
    Posts
    299
    This thread will be cyclical until AMD releases it.

  24. #3374
    Xtreme Addict
    Join Date
    Dec 2008
    Location
    Sweden, Linköping
    Posts
    2,034
    4 cores with 8 threads, or 8 cores with 8 threads. Who cares, it's not like it will change the price, performance or power consumption depending on what it's called.
    SweClockers.com

    CPU: Phenom II X4 955BE
    Clock: 4200MHz 1.4375v
    Memory: Dominator GT 2x2GB 1600MHz 6-6-6-20 1.65v
    Motherboard: ASUS Crosshair IV Formula
    GPU: HD 5770

  25. #3375
    Registered User
    Join Date
    Mar 2011
    Location
    South Italy
    Posts
    89
    Quote Originally Posted by FlanK3r View Post
    My meaning, there is some problem. Multiefectivity between cores/moudles is good, score is whorse than in original slide, so problem could be in module or in core unit-more lower performance. I dont know which exactly problem, but...hope, will be fixed in Piledriver.
    that's what I am thinking too.
    there is some problem or someone lied (amd or the slides are fakes).

    but.. BD was for q2, then step to "60-90 days" from 1st june.. now from q3 to q4.
    I think amd knows the problems, if there are problems, and they have not found these problems yesterday..
    So I hope that the problems, that are big if we have less performance than thuban, had been solved in this 6 months of delay.

    If the performance are these, even if only cinebench (ok, this is only one bench but... it is deep under the amd estimated performance) is worse than thuban, IMHO amd should not release bd, even if this is price competitive, because every review will kick it because, the delays, the silence (last official statement "60-90 days from 1st june", then from q3 to q4 without ONE official word) the hype created by the "return of the fx comics" and the world record, will affect against amd.

    And we are talking about the "top" 8 core cpu..
    I cannot think the fx 4xxx should perform less than a phenom II 955..

    But I remember that movieman talked about bulldozer like a winner.. I hope he's right.

Page 135 of 181 FirstFirst ... 3585125132133134135136137138145 ... LastLast

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •