MMM
Results 1 to 25 of 4519

Thread: AMD Zambezi news, info, fans !

Hybrid View

  1. #1
    Xtreme Member
    Join Date
    Sep 2008
    Location
    Eastern Tennessee (from Minnesota)
    Posts
    241
    Quote Originally Posted by undone View Post
    I wonder where did you read the '80-90% of the efficiency of a true dual core' statement, if it's true the Zambezi 8 core will only act like 8 * 80% = 6.4 cores in worst scenario?
    AMD's Bulldozer Blog (centralized around Server, though). The exact entry I had linked to about 1/3 or 1/2 way through the thread, but for a different reason. I'd have to read it again to make sure this is accurate (which I've not the time to do atm), but I believe it said something on the order that if each module were working on one thread, it would yield quite a bit more performance than a single Magny-Cours core (heh); however, a single module would only be about 90% of the performance of two of Magny's cores.

    Since I'm 98% sure what I read was by JF (John) anyways, this is just as valid as whatever blog post I'm referring to (though I think it might be one of the 20 Questions posts found here):
    http://www.xtremesystems.org/forums/...1&postcount=67

  2. #2
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    Quote Originally Posted by Formula350 View Post
    however, a single module would only be about 90% of the performance of two of Magny's cores.
    Where did you read that exactly?

  3. #3
    Xtreme Member
    Join Date
    Sep 2008
    Location
    Eastern Tennessee (from Minnesota)
    Posts
    241
    Quote Originally Posted by informal View Post
    Where did you read that exactly?
    What I wrote might not be verbatim, hence my disclaimer that it might not be accurate and that I am going off memory lol Regardless of if I was close, or wayyyy off, I read pretty much all of the Bulldozer related blog entries at AMD. All three of the 20-Questions entries are a good read and where I think John may have said what I was going off of, but I was on Vacation for all of July (without internet) so my brain has been making room for new info while playing catch-up haha If I get time after editing reviews I'll try to dig for the specific blog link.

  4. #4
    Xtreme Enthusiast
    Join Date
    Oct 2005
    Location
    Melbourne, Australia
    Posts
    529
    Quote Originally Posted by Formula350 View Post
    however, a single module would only be about 90% of the performance of two of Magny's cores.
    No.

    If both cores in each module are being used, AMD says it would get 80% higher performance (i.e 180% of single core performance).
    That percentage had nothing to do with K10.5 - it's looking solely at one BD module, with single thread performance vs two thread performance.

    If you ran each thread in separate modules, you'd get better scaling than that though.

  5. #5
    Xtreme Member
    Join Date
    Apr 2007
    Location
    Serbia
    Posts
    102
    Quote Originally Posted by Formula350 View Post
    however, a single module would only be about 90% of the performance of two of Magny's cores.
    JF doesn't said that. He said that the one module has 1.8x of performance scaling with two threads on it. Single module with two threads would be 90% of the performance of two BD modules with two threads. In other words CMT - clustered multithreaded module has 90% of chip multiprocessed two cores with same type of microarchitecture. There isn't comparison with Magny Cours or 10h.

    Since I'm 98% sure what I read was by JF (John) anyways, this is just as valid as whatever blog post I'm referring to (though I think it might be one of the 20 Questions posts found here):
    http://www.xtremesystems.org/forums/...1&postcount=67
    He talks about mythical 6-core bulldozer:

    Mythical 6-core bulldozer:
    100% + 95% + 95% + 95% + 95% + 95% = 575%

    Orochi die with 4 modules:
    180% + 180% + 180% + 180% = 720%

    What if we had just done a 4 core and added HT (keeping in the same die space):
    100% + 95% +95% +95% + 18% + 18% + 18% + 18% = 457%

    What about a 6 core with HT (has to assume more die space):
    100% + 95% +95% +95% +95% +95% + 18% + 18% + 18% + 18% + 18% + 18% = 683%
    And someone concluded that the IPC decreases. :d
    And, from that moment, there is so much people who think that BD has decreased IPC from 10h.

    Quote Originally Posted by FlanK3r View Post
    beep: of course, but what is exactly IPC?
    IPC = Instructions Per Cycle. This is a number who's tells us how many instructions CPU can retire per one cycle. Today modern Out Of Order processors have many execution units, they can execute more than one instruction per cycle, parallel, and out of order. Because program code is in order, there is quite difficult to make such machine to execute out of order, because of data dependencies, too many branches in code, etc.
    IPC is software performance measuring unit. If code is properly optimised, it can run on CPU, or even GPU with more instructions per cycle. However, also if CPU is faster, than it can run same software with higher IPC number.

    Maybe here is not talking about core to core, but about cores to cores (6 vs 8 etc)
    If BD has 50% more throughput than 6-core 10h, at probably same clock, that means the server workload uses all cores. That means the core per core BD vs 10h, BD has 12.5% more throughput than 10h, but single module can do 25% more serialized, single thread jobs than one single core.
    Last edited by drfedja; 08-27-2011 at 05:09 PM.
    "That which does not kill you only makes you stronger." ---Friedrich Nietzsche
    PCAXE

  6. #6
    Xtreme Member
    Join Date
    Jan 2011
    Location
    145.21.4.???
    Posts
    319
    Quote Originally Posted by drfedja View Post
    Mythical 6-core bulldozer:
    100% + 95% + 95% + 95% + 95% + 95% = 575%

    Orochi die with 4 modules:
    180% + 180% + 180% + 180% = 720%

    What if we had just done a 4 core and added HT (keeping in the same die space):
    100% + 95% +95% +95% + 18% + 18% + 18% + 18% = 457%

    What about a 6 core with HT (has to assume more die space):
    100% + 95% +95% +95% +95% +95% + 18% + 18% + 18% + 18% + 18% + 18% = 683%
    Interesting, this calculation take HyperThreading for 18% increase into account, which I often think HT may bring 25-30% additional throughput.

  7. #7
    Xtreme Member
    Join Date
    Apr 2007
    Location
    Serbia
    Posts
    102
    Quote Originally Posted by undone View Post
    Interesting, this calculation take HyperThreading for 18% increase into account, which I often think HT may bring 25-30% additional throughput.
    I don't think so that HT brings 25-30% on throughput on average.... 18% is more realistic number.
    "That which does not kill you only makes you stronger." ---Friedrich Nietzsche
    PCAXE

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •