Results 1 to 25 of 4519

Thread: AMD Zambezi news, info, fans !

Hybrid View

  1. #1
    Xtreme Member
    Join Date
    Apr 2007
    Location
    Serbia
    Posts
    102
    Quote Originally Posted by xsecret View Post
    If Sandra uses AVX in the MM Benchmark, that's not strange. Thuban doesn't have AVX and all benchmarks using it are 10 times slower.
    Why do you think that AVX is so much powerful than SSE? Thuban Core and BD module can execute same number of raw FLOPS. AVX and SSE are vectorised packed FP instructions. BD module can execute one 256-bit AVX which contain 4DP FP operations, same as two 128-bit AVX or SSE. In some cases 256 AVX can be faster, but how much? Two times...

    Quote Originally Posted by radaja View Post
    So going from 3.1Ghz to 3.6Ghz do these 2 CB scores seem to be inline with reality?
    CB scales perfectly with frequency. 3.6/3.1*5.24 = 6.08. Something is wrong here with this results or frequency of CPU's isn't accurate. Actually I think that is much lower than CPUz's readings.
    Last edited by drfedja; 09-11-2011 at 03:22 PM.
    "That which does not kill you only makes you stronger." ---Friedrich Nietzsche
    PCAXE

  2. #2
    Xtreme Member
    Join Date
    Aug 2004
    Posts
    210
    Quote Originally Posted by drfedja View Post
    Why do you think that AVX is so much powerful than SSE? Thuban Core and BD module can execute same number of raw FLOPS. AVX and SSE are vectorised packed FP instructions. BD module can execute one 256-bit AVX which contain 4DP FP operations, same as two 128-bit AVX or SSE. In some cases 256 AVX can be faster, but how much? Two times...
    Yes, AVX would do nothing, but FMA could be the big difference. SiSoft normally always programs special code for each CPU, thus on Bd, it should use XOP&FMA.

  3. #3
    Xtreme Member
    Join Date
    Apr 2007
    Location
    Serbia
    Posts
    102
    Quote Originally Posted by Opteron146 View Post
    Yes, AVX would do nothing, but FMA could be the big difference. SiSoft normally always programs special code for each CPU, thus on Bd, it should use XOP&FMA.
    I agree, but we don't know how SiSoft works with FMA and XOP turned on and off. We will know when we get BD on the bench table.

    Quote Originally Posted by informal View Post
    As you can see,same peak flops in all 3 cases. I rest my case .
    Yes, but what is the module count ? For 64 DP FLOPS you must have 8 SB cores and 16 FlexFP's. That slide is BS, because there is no CPU with 32 BD cores, or 16 BD modules. Interlagos has 8 BD modules or 8 FlexFP's which can execute up to 32 DP FLOPS, or 64 SP FLOPS.
    If you compare 8 core Xeon and 16 core Interlagos that slide make sense.

    Quote Originally Posted by BeepBeep2 View Post
    @ rajada
    yes.

    13% increase in performance over 12.5% increase in base clock speed. Factor complex turbo in and it seems logical to me.
    No, there is 16% increase in clock speed and 13% increase in performance. Gap is too big between increase of frequency and performance or scaling is too bad.
    Last edited by drfedja; 09-11-2011 at 03:40 PM.
    "That which does not kill you only makes you stronger." ---Friedrich Nietzsche
    PCAXE

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •