Page 7 of 39 FirstFirst ... 4567891017 ... LastLast
Results 151 to 175 of 954

Thread: AMD's Bobcat and Bulldozer

  1. #151
    Banned
    Join Date
    Jul 2004
    Posts
    1,125
    Quote Originally Posted by god_43 View Post
    honestly wtf cares about single thread performance? geez if ppl cared about it, they would not be buying dual-cores even. the whole nature of multiple cores is for multi-threading, anything else is fail!
    Single-thread performance --> low-thread performance, due to the nature of 2-cored modules and Intel's 2-way HT.

    So you are essentially saying "wtf cares about 4-threaded performance?!?!"

    Well, that encompasses the vast majority of what you'll be doing on the desktop most of the time, so, the answer is: most people.

  2. #152
    Banned
    Join Date
    Jul 2004
    Posts
    1,125
    Quote Originally Posted by LesGrossman View Post
    That's like saying that with K8 AMD concluded that couldn't match the higher cpu clocks of netburst and that would always have to throw more instructions per clock at Intel MHZ's.

    Crysis 2 will support 8 cores, good luck with the single thread performance thing.
    In that case, you'll have the 8-cored SB 2011 monster able to run one thread per core, while the 8-core Zambezi runs in full module-based-resource sharing mode. The result will not be pretty.

    Again, single thread performance really translates as N-threaded performance, where N is either the number of cores in an Intel device, or the number of modules in a BD device. Because up until those thresholds, shared resources aren't really having much of an impact, and you are not really constrained much by power, etc.

  3. #153
    Xtreme Addict
    Join Date
    Jan 2005
    Posts
    1,366
    Quote Originally Posted by LesGrossman View Post
    Crysis 2 will support 8 cores, good luck with the single thread performance thing.
    Quote Originally Posted by god_43 View Post
    honestly wtf cares about single thread performance? geez if ppl cared about it, they would not be buying dual-cores even. the whole nature of multiple cores is for multi-threading, anything else is fail!
    It's a big mistake to think that a single thread performance doesn't matter. And I don't mean a single threaded programs. Just think about it - what is better -8 fast cores or 8 slow cores in a multithreaded program? Now if we speak about an upcoming high-end desktops then that would be 8x 4-wide 2 thread/core capable SB int cores vs 8x 2-wide Bulld int cores and 8x265 SB fpu vs 4x256 Bulld FPU. Yes, I'm avare about 16-core interlagos, but this probably is going to be a multimodule server oriented chip.

  4. #154
    Xtreme Enthusiast
    Join Date
    Oct 2008
    Posts
    678
    Quote Originally Posted by btarunr View Post
    There's "L3 Cache and NB" and "Integrated Northbridge Controller".

    The "NB" in "L3 Cache and NB" is the northbridge component found on AMD processors since K8. The "Northbridge Controller" is the IOMMU (what we're used to referring to as northbridge, such as AMD 790FX) integrated into the processor die.
    No, the PCIe controler (like 790FX) isn't integrated. So we still don't know what that part is.
    There are three NBs on that picture, and only one that truly is a northbridge, and that isn't even labeled as a northbridge, it's labeled as an IMC.
    Last edited by -Boris-; 08-24-2010 at 11:40 AM.

  5. #155
    I am Xtreme
    Join Date
    Dec 2007
    Posts
    7,750
    Quote Originally Posted by terrace215 View Post
    In that case, you'll have the 8-cored SB 2011 monster able to run one thread per core, while the 8-core Zambezi runs in full module-based-resource sharing mode. The result will not be pretty.

    Again, single thread performance really translates as N-threaded performance, where N is either the number of cores in an Intel device, or the number of modules in a BD device. Because up until those thresholds, shared resources aren't really having much of an impact, and you are not really constrained much by power, etc.
    in realistic cases it means that any 8core cpu can probably run crysis at 2ghz cause it would be built around 3ghz quads for high, and 2.5ghz duels as the normal.

    doubt either 8core chip will "lag" in crysis

  6. #156
    Registered User
    Join Date
    Sep 2009
    Posts
    77

  7. #157
    Xtreme Addict
    Join Date
    Dec 2007
    Location
    Hungary (EU)
    Posts
    1,376
    delete pls
    -

  8. #158
    I am Xtreme FlanK3r's Avatar
    Join Date
    May 2008
    Location
    Czech republic
    Posts
    6,823
    read last post from yuri.cs Now, i dont know reality about AM3+ vs AM3
    http://translate.google.cz/translate...409%23p8041409
    ROG Power PCs - Intel and AMD
    CPUs:i9-7900X, i9-9900K, i7-6950X, i7-5960X, i7-8086K, i7-8700K, 4x i7-7700K, i3-7350K, 2x i7-6700K, i5-6600K, R7-2700X, 4x R5 2600X, R5 2400G, R3 1200, R7-1800X, R7-1700X, 3x AMD FX-9590, 1x AMD FX-9370, 4x AMD FX-8350,1x AMD FX-8320,1x AMD FX-8300, 2x AMD FX-6300,2x AMD FX-4300, 3x AMD FX-8150, 2x AMD FX-8120 125 and 95W, AMD X2 555 BE, AMD x4 965 BE C2 and C3, AMD X4 970 BE, AMD x4 975 BE, AMD x4 980 BE, AMD X6 1090T BE, AMD X6 1100T BE, A10-7870K, Athlon 845, Athlon 860K,AMD A10-7850K, AMD A10-6800K, A8-6600K, 2x AMD A10-5800K, AMD A10-5600K, AMD A8-3850, AMD A8-3870K, 2x AMD A64 3000+, AMD 64+ X2 4600+ EE, Intel i7-980X, Intel i7-2600K, Intel i7-3770K,2x i7-4770K, Intel i7-3930KAMD Cinebench R10 challenge AMD Cinebench R15 thread Intel Cinebench R15 thread

  9. #159
    Xtreme Mentor
    Join Date
    Jul 2008
    Location
    Shimla , India
    Posts
    2,631
    Quote Originally Posted by kl0012 View Post
    It's a big mistake to think that a single thread performance doesn't matter. And I don't mean a single threaded programs. Just think about it - what is better -8 fast cores or 8 slow cores in a multithreaded program? Now if we speak about an upcoming high-end desktops then that would be 8x 4-wide 2 thread/core capable SB int cores vs 8x 2-wide Bulld int cores and 8x265 SB fpu vs 4x256 Bulld FPU. Yes, I'm avare about 16-core interlagos, but this probably is going to be a multimodule server oriented chip.
    ammm sandy bridge uses a double pumped fpu makeup that means legacy "non-avx" will run it at 8*128 bit, bulldozer will also run the same with 4 modules or 8 cores.

    Quote Originally Posted by FlanK3r View Post
    read last post from yuri.cs Now, i dont know reality about AM3+ vs AM3
    http://translate.google.cz/translate...409%23p8041409
    Cant understand the translation can you give a summery of the post....
    Last edited by ajaidev; 08-24-2010 at 12:11 PM.
    Coming Soon

  10. #160
    Xtreme Mentor
    Join Date
    Jul 2008
    Location
    Shimla , India
    Posts
    2,631


    Bobcat article - http://www.brightsideofnews.com/news...-movement.aspx

    AMD Bobcat Core plan: Add an 80-core Cedar GPU and you have - Ontario


    80 shaders means it has double the cores of 4200/3200 and since i have overclocked a 4290 to 900mhz i know how well it performs. If this 80 core GPU is also clocked around 750-1000Mhz we may find quite some graphical horses under the bobcats bonnet...
    Coming Soon

  11. #161
    Xtreme Addict
    Join Date
    Mar 2010
    Posts
    1,079
    Quote Originally Posted by LesGrossman View Post
    Crysis 2 will support 8 cores, good luck with the single thread performance thing.
    Link?

  12. #162
    Xtreme Enthusiast
    Join Date
    Mar 2009
    Posts
    605
    lost planet 2 uses 12 threads, at 4.5Ghz it keeps all 12 above 70%
    Email/web browsing PC
    MM Ascension case, 24/7 settings
    Classified, 980x @ 4,525Mhz, HT on 1.39V
    (3) ATI 5870's@ 1062Mhz
    G skill perfect storm RAM s 6gb, 2,176 Mhz CAS 8
    (4) intel x25m RAID 0
    CPU loop, EK supreme HF, EK mobo block, 500MM bitspower res, 2 DDC 3.25's on XSPC top, 2 XSPC RX 360's, 1 XSPC RX 480, GPU loop, EK water blocks, with nickel backs, 2 swiftech mcr 320's, 500MM bitspower res, ddc 3.25, XSPC top all with 1850 RPM gentle typhoons
    plus many other rigs.

  13. #163
    Xtreme Cruncher
    Join Date
    May 2009
    Location
    Bloomfield
    Posts
    1,968
    Quote Originally Posted by ajaidev View Post
    is that a joke? that's the most ridiculous floorplan i have ever seen. it looks like a map or a cloud or something.

  14. #164
    Registered User
    Join Date
    Dec 2008
    Location
    Chicago
    Posts
    49
    Quote Originally Posted by Chumbucket843 View Post
    is that a joke? that's the most ridiculous floorplan i have ever seen. it looks like a map or a cloud or something.
    Computer synthesized, only a few things are laid out by hand.

  15. #165
    Xtreme Addict
    Join Date
    Nov 2006
    Posts
    1,402
    i was thinking the same the 10 first second. Looks like a fake map on a ship

  16. #166
    Xtreme Addict
    Join Date
    Mar 2010
    Posts
    1,079
    Quote Originally Posted by justin.kerr View Post
    lost planet 2 uses 12 threads, at 4.5Ghz it keeps all 12 above 70%
    I just wanted to read more about it. You can assign CPU time to many tasks, some are useful and some are not.

  17. #167
    Xtreme Cruncher
    Join Date
    May 2009
    Location
    Bloomfield
    Posts
    1,968
    Quote Originally Posted by deeperblue View Post
    Computer synthesized, only a few things are laid out by hand.
    synthesizers do not floorplan. generally modern logic blocks are 50-100K gates, probably around 300-600K transistors. this is a rather large chunk when the core itself, including L1 & L2 it is probably <20M transistors.

  18. #168
    Xtreme Addict
    Join Date
    Apr 2007
    Location
    canada
    Posts
    1,886
    LOL is descriptive of this thread
    WILL CUDDLE FOR FOOD

    Quote Originally Posted by JF-AMD View Post
    Dual proc client systems are like sex in high school. Everyone talks about it but nobody is really doing it.

  19. #169
    Xtreme Cruncher
    Join Date
    May 2009
    Location
    Bloomfield
    Posts
    1,968
    Quote Originally Posted by god_43 View Post
    honestly wtf cares about single thread performance? geez if ppl cared about it, they would not be buying dual-cores even. the whole nature of multiple cores is for multi-threading, anything else is fail!
    http://en.wikipedia.org/wiki/Amdahl's_law

    a simple example:

    a task is 50% parallel, 50% serial.
    if i speed up the parallel part by 2x i increase performance by a factor of 1.33
    if i speed up the parallel part by 10x i increase performance by a factor of 1.81
    if i speed up the parallel part by 100x i increase performance by a factor of 1.98
    if i speed up the parallel part infinitely i increase performance by a factor of 2

  20. #170
    Xtreme Cruncher
    Join Date
    Jun 2008
    Location
    pacific NW usa
    Posts
    2,764
    Will BD DROP into current AM3 boards or not?
    _________________________________________________
    ............................ImAcOmPuTeRsPoNgE............................
    [SIGPIC][/SIGPIC]

    MY HEATWARE 76-0-0

  21. #171
    Xtreme Enthusiast
    Join Date
    Oct 2008
    Posts
    678
    Quote Originally Posted by justin.kerr View Post
    lost planet 2 uses 12 threads, at 4.5Ghz it keeps all 12 above 70%
    How do you know? In a true 12 core design do you think it would be over 70%? Maybe it uses four cores, and the HT-threads easily gets maxed out. Four cores is around 66% of a quad.

    My point is, having a hexa core being utilized 70% can only mean at least 4 cores. It could be 12, but there is no way for you to see it, and since it isn't 100%, I seems like it isn't using all cores.

    Quote Originally Posted by deeperblue View Post
    Computer synthesized, only a few things are laid out by hand.
    Finally, chips from AMD has always been nicely ordered, pointing at a mostly hand made layout. I can imagine that it leads to an uneven power usage and unnecessary long circuits and timings. And wastes die space.

  22. #172
    Xtreme Cruncher
    Join Date
    May 2009
    Location
    Bloomfield
    Posts
    1,968
    Quote Originally Posted by -Boris- View Post
    Finally, chips from AMD has always been nicely ordered, pointing at a mostly hand made layout. I can imagine that it leads to an uneven power usage and unnecessary long circuits and timings. And wastes die space.
    lol, it is the exact opposite. hand layout is much better. humans are better at finding eulerian paths and coming up with clever layouts. computers cant really do that with all of the design rules and other parameters as effectively. the difference in performance is 2.6-7x faster with custom designed circuits.

    really what happens is a coder will simulate his module and make sure it reaches the targeted timing, which is usually much higher than actual delay to assure robust operation. if the logic cant reach the speed it is either rewritten or circuit designers optimize it. in certain logic families it must be entirely custom designed.

    circuits that are custom designed are usually things like power gating, clock distribution, and analog circuits such as pll's, dll's, and memory controllers/ io pads.

  23. #173
    Xtreme Addict
    Join Date
    Apr 2007
    Posts
    2,128
    L1D drop to one fourth could possibly indicate two things, inclusive cache hierarchy and in rare cases slightly higher clocks.

    Deeper pipelines then could further increase the clocks and simplify the design. I believe inclusive cache hierarchy would more than compensate for the loss in IPC, and if the BP is better, it could possibly even reduce the time spent in stalls vs. K10 with weaker BP but shorter pipelines.

    ...or then... The cache hierarchy remains exclusive and slow. Any benefit of exclusive cache is being lost due to smaller L1 compared to L2, and L2 compared to L3. Deeper pipeline reduces the IPC but doesn't allow much higher clocks to compensate for it. Aggressive BP isn't good enough to compensate deeper pipeline. More L1D misses. Speculative execution would lead to wrong decisions and L1I misses too often, greatly worsening the IPC in some cases while improving it only marginally in most cases. ...and GF messes with 32nm HighK SOI, effectively ruining the benefit of more cores in hope for better yields.

    Inclusive cache hierarchy would mean that AMD would be able to squeeze the gap between K10/Nehalem cache performance, and bring BD to SB levels of cache performance.

  24. #174
    Registered User
    Join Date
    Dec 2008
    Location
    Chicago
    Posts
    49
    Quote Originally Posted by Chumbucket843 View Post
    synthesizers do not floorplan. generally modern logic blocks are 50-100K gates, probably around 300-600K transistors. this is a rather large chunk when the core itself, including L1 & L2 it is probably <20M transistors.
    AMD says (http://www.youtube.com/watch?v=VIs1CxuUrpc)
    "Synthesizable with small number of custom arrays"
    Together with what was said before I think one of the main goals that AMD wants to achieve is to have easily customizable processors. Add a gpu core here, some cache there and another core here. From the slide it looks like lots of their process is already capable of being laid out by a computer.
    We have the caches, the integer units and the floating point units being the fixed hand optimized blocks with stuff like the x86 decode organically filling up the space in between. AMD also says it makes it easier to put the whole thing on a different process.

    I've only limited knowledge about modern synthesizing and floor planning from working with some FPGAs.
    Maybe Hans or somebody in the industry can say something about Bobcat?

  25. #175
    I am Xtreme
    Join Date
    Jul 2007
    Location
    Austria
    Posts
    5,485
    Only 1 1/2 hour till the other NDA on the presentation slides drop.

Page 7 of 39 FirstFirst ... 4567891017 ... LastLast

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •