MMM
Results 1 to 25 of 4519

Thread: AMD Zambezi news, info, fans !

Hybrid View

  1. #1
    Xtreme Enthusiast
    Join Date
    Oct 2008
    Posts
    678
    Quote Originally Posted by Opteron146 View Post
    That is totally ok, BD's L1 is write through, i.e. writes to the L1 go directly to the L2, thus the L1 and L2 write performance should be more or less the same.
    However, I wonder what is happening with the L2 read performance, for some strange reason it seems to depend on uncore clock:

    2.0GHz: 11.9 GB/s
    2.2GHz: 35.8 GB/s
    2.4GHz: 12.5 GB/s
    2.6GHz: 36.8 GB/s

    That's a big difference ...
    Looks like a bug. And I guess the will be some interesting results if you starts playing with the core frequency as well. Could it be problems with the sync?

    And the whole write through idea looks like crap to me. What is the advantage? I think I can see some drawbacks.

  2. #2
    Xtreme Member
    Join Date
    Aug 2004
    Posts
    210
    Quote Originally Posted by -Boris- View Post
    And the whole write through idea looks like crap to me. What is the advantage? I think I can see some drawbacks.
    Mainly clock speed due to a less complex design. Maybe also some power constraints, they only use parity check, no ECC, for the L1 that saves some transistors, too.

  3. #3
    Xtreme Enthusiast
    Join Date
    Oct 2008
    Posts
    678
    Quote Originally Posted by Opteron146 View Post
    Mainly clock speed due to a less complex design. Maybe also some power constraints, they only use parity check, no ECC, for the L1 that saves some transistors, too.
    Is that worth the loss of IPC? IPC usually is a better route than frequency from a power perspective. And frequency gains seems to be small no matter what nowadays so the possibly higher frequencies doesn't seem to make a big impact on the end performance.

  4. #4
    Xtreme Member
    Join Date
    Sep 2010
    Posts
    139
    Quote Originally Posted by -Boris- View Post
    Is that worth the loss of IPC? IPC usually is a better route than frequency from a power perspective. And frequency gains seems to be small no matter what nowadays so the possibly higher frequencies doesn't seem to make a big impact on the end performance.
    IPC is just as bad as frequency from a power perspective. ipc cannot be scaled up without huge power drawbacks. Both ipc/frequency should have a good relation to have proper performance. If the difference is to big you can expect bad performance. None of those extremes are at work for SB, BD, K8, nehalem, CoreDuo.

    Also the write through concept isn't really an issue because the stores should be stored in a buffer and are written out more bandwidth driven (bundled if possible) to have less latency impact. (Whether or not it turns out that way needs to be seen).

    it will have an impact if the l2 cache is extremely slow as results above indicate. However Aida64 also reported that Brazos has 0MB/s write on Brazos... so if te tested version doesn't support it properly strange result are common.

  5. #5
    Xtreme Member
    Join Date
    Aug 2004
    Posts
    210
    Quote Originally Posted by -Boris- View Post
    Is that worth the loss of IPC? IPC usually is a better route than frequency from a power perspective. And frequency gains seems to be small no matter what nowadays so the possibly higher frequencies doesn't seem to make a big impact on the end performance.
    That's traditional, old thinking, it was true until the leakage went through the roof. For IPC you need lots of transistors, hence the fat cores of e.g. intels current architecture, but leakage will be worse and worse with every shrink. Thus the new idea is to cut as much transistors when it does not hurt that much. It's like with a car engine, you can easily construct an 8 cylinder engine with ~500 hp, but if you around 800 hp and up it gets tricky and you have to use lots of time & money to find and build some really complicated tricks. In short: the return of investments gets less and less.
    So in future it is easier to build smaller cores. Bulldozer is just the first generation. I expect some IPC improvments in later versions again, but they won't cost too much transistors ;-)

  6. #6
    Xtreme Enthusiast
    Join Date
    Oct 2008
    Posts
    678
    Quote Originally Posted by Opteron146 View Post
    That's traditional, old thinking, it was true until the leakage went through the roof. For IPC you need lots of transistors, hence the fat cores of e.g. intels current architecture, but leakage will be worse and worse with every shrink. Thus the new idea is to cut as much transistors when it does not hurt that much. It's like with a car engine, you can easily construct an 8 cylinder engine with ~500 hp, but if you around 800 hp and up it gets tricky and you have to use lots of time & money to find and build some really complicated tricks. In short: the return of investments gets less and less.
    So in future it is easier to build smaller cores. Bulldozer is just the first generation. I expect some IPC improvments in later versions again, but they won't cost too much transistors ;-)
    That's exactly how I look at high frequencies. Leakage grown lineary with transistors but exponentially with frequency, and exponentially with voltage.

  7. #7
    Xtreme Member
    Join Date
    Jul 2004
    Location
    Berlin
    Posts
    275
    Quote Originally Posted by -Boris- View Post
    That's exactly how I look at high frequencies. Leakage grown lineary with transistors but exponentially with frequency, and exponentially with voltage.
    Sure, you could use more power consuming logic or drive the single transistors faster to reach higher clockspeeds. But you could just use less logic, thus less transistors, which need to switch during a clock phase. So you get higher clocks w/o increasing power consumption, maybe even lowering it. This is what's AMD's way of going to lower FO4 delays (per pipelinestage).
    Now on Twitter: @Dresdenboy!
    Blog: http://citavia.blog.de/

  8. #8
    I am Xtreme
    Join Date
    Dec 2007
    Posts
    7,750
    Quote Originally Posted by -Boris- View Post
    That's exactly how I look at high frequencies. Leakage grown lineary with transistors but exponentially with frequency, and exponentially with voltage.
    i dont think leakage is exponential with clock speed, ive done the tests with thuban and it seemed power draw was near linear with clock speed, but exponential with voltage. but you need volts for higher clocks so they practically go hand in hand.
    2500k @ 4900mhz - Asus Maxiums IV Gene Z - Swiftech Apogee LP
    GTX 680 @ +170 (1267mhz) / +300 (3305mhz) - EK 680 FC EN/Acteal
    Swiftech MCR320 Drive @ 1300rpms - 3x GT 1850s @ 1150rpms
    XS Build Log for: My Latest Custom Case

  9. #9
    Administrator
    Join Date
    Nov 2007
    Location
    Stockton, CA
    Posts
    3,569
    As some of you know OBR was banned here sometime ago. We do not allow reposting of material from banned members.

    Take this as a warning, anymore posting of OBR screens or links pointing to his materal will/may result in a vacation.

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •