Page 4 of 17 FirstFirst 123456714 ... LastLast
Results 76 to 100 of 403

Thread: AMD to Disclose Details About Bulldozer Micro-Architecture in August

  1. #76
    Xtreme Enthusiast
    Join Date
    Aug 2008
    Location
    East Troy Wisconsin
    Posts
    707
    Quote Originally Posted by Mad Pistol View Post
    I think AMD is on the right track. They've posted Quarterly profits I think for the first time in years this past quarter (correct me if I'm wrong on that. they have made a profit though) so that shows that they're doing something right. Obviously, it's chump change compared to Intel's earnings, but they are doing fine. AMD/ATI's R&D budget is much smaller than Intel's or Nvidia's, so that means they're being smart about what they're doing. The Radeon cards this generation have been fantastic. Who's to say that Bulldozer won't be the same way?
    Agreed, I hope Bulldozer is the same. One thing we can all agree on: the more competition the better it is for the consumer.
    HAF922 (3 fan upgraded) 700w ModXtreme
    MSI GD-65 // HIS 5850 // Saitek Esclipse
    Core i5 750 @ 3.32Ghz (OC Genie) Megahalem w/ 1200rpm Scythe Slipstream
    G. Skill RipJaws DDR3 2000 4x2 gig
    Samsung SpinPoint F3 1TB // 5:1 Dell Surround
    24inch Samsung ToC // MX518

  2. #77
    Banned
    Join Date
    Jul 2004
    Posts
    1,125
    Quote Originally Posted by -Sweeper_ View Post
    looks like they wont talk about clock speeds

    150+% of the performance with 133% the number of cores

    thats 12,7+% more performance per core, something easily achievable with higher clocks (magny cours works at a modest 2.3ghz)
    That figure appears to come from an AMD chart focusing on one projection, "Floating Point performance".

    It shows MC at "28", and Interlagos at "43" (at max, the line fades starting around 40). 43% to 53% improvement.

    But given that Interlagos/BD has AVX (including FMA), this really isn't all that impressive, is it?

    "Integer performance" on the same chart goes from "29" to "36-38.5" (from start fade to end fade)

    That's a 24% to 33% improvement for 33% more cores.

    Hmmm, doesn't really look like much single-threaded improvement there.

    In sum:

    From -7% to 0% performance loss per core on "Integer"
    From 7.5% to 15% performance gain per core on "Floating point" -- and that's with AVX!

    Can you imagine the outcry from saaya were SB to compare to Westmere in such a way? I can hear it now... "Epic Fail!!!!!111"

    Other than AVX helping FP perf, somewhat less than expected, I don't see any single-threaded gains from MC --> BD, not based on these performance projections, anyhow.

  3. #78
    Banned
    Join Date
    May 2006
    Location
    Brazil
    Posts
    580
    Quote Originally Posted by terrace215 View Post
    Can you imagine the outcry from saaya were SB to compare to Westmere in such a way? I can hear it now... "Epic Fail!!!!!111"
    thats true
    Last edited by -Sweeper_; 06-23-2010 at 12:50 PM.

  4. #79
    Banned
    Join Date
    Jan 2010
    Posts
    263
    Quote Originally Posted by terrace215 View Post
    That figure appears to come from an AMD chart focusing on one projection, "Floating Point performance".

    It shows MC at "28", and Interlagos at "43" (at max, the line fades starting around 40). 43% to 53% improvement.

    But given that Interlagos/BD has AVX (including FMA), this really isn't all that impressive, is it?

    "Integer performance" on the same chart goes from "29" to "36-38.5" (from start fade to end fade)

    That's a 24% to 33% improvement for 33% more cores.

    Hmmm, doesn't really look like much single-threaded improvement there.

    In sum:

    From -7% to 0% performance loss per core on "Integer"
    From 7.5% to 15% performance gain per core on "Floating point" -- and that's with AVX!

    Can you imagine the outcry from saaya were SB to compare to Westmere in such a way? I can hear it now... "Epic Fail!!!!!111"

    Other than AVX helping FP perf, somewhat less than expected, I don't see any single-threaded gains from MC --> BD, not based on these performance projections, anyhow.
    About Bulldozer and single thread performance though, it seems AMD is going to take a page from Intel by employing an even more aggressive core boosting strategy (ala SB) in single thread scenarios. Which means the focus may not be so much on ipc tweaks, but rather ultra high core frequency boosting, a scenario that would be greatly helped by power-gating to ensure the chip stays within its tdp limits.

  5. #80
    Xtreme Addict
    Join Date
    Apr 2006
    Location
    City of Lights, The Netherlands
    Posts
    2,381
    Quote Originally Posted by terrace215 View Post
    That figure appears to come from an AMD chart focusing on one projection, "Floating Point performance".

    It shows MC at "28", and Interlagos at "43" (at max, the line fades starting around 40). 43% to 53% improvement.

    But given that Interlagos/BD has AVX (including FMA), this really isn't all that impressive, is it?

    "Integer performance" on the same chart goes from "29" to "36-38.5" (from start fade to end fade)

    That's a 24% to 33% improvement for 33% more cores.

    Hmmm, doesn't really look like much single-threaded improvement there.

    In sum:

    From -7% to 0% performance loss per core on "Integer"
    From 7.5% to 15% performance gain per core on "Floating point" -- and that's with AVX!

    Can you imagine the outcry from saaya were SB to compare to Westmere in such a way? I can hear it now... "Epic Fail!!!!!111"

    Other than AVX helping FP perf, somewhat less than expected, I don't see any single-threaded gains from MC --> BD, not based on these performance projections, anyhow.
    http://www.anandtech.com/show/2879/3

    A 16 core (8 modules) Bulldozer based CPU is 60 to 80% faster than a 12 core Magny Cours in integer performance when using SPECInt_rate as a benchmark according to Anandtech.
    "When in doubt, C-4!" -- Jamie Hyneman

    Silverstone TJ-09 Case | Seasonic X-750 PSU | Intel Core i5 750 CPU | ASUS P7P55D PRO Mobo | OCZ 4GB DDR3 RAM | ATI Radeon 5850 GPU | Intel X-25M 80GB SSD | WD 2TB HDD | Windows 7 x64 | NEC EA23WMi 23" Monitor |Auzentech X-Fi Forte Soundcard | Creative T3 2.1 Speakers | AudioTechnica AD900 Headphone |

  6. #81
    Xtreme Addict
    Join Date
    Jan 2005
    Posts
    1,730
    Quote Originally Posted by cegras View Post
    What are you talking about? The fastest P4 released was 3.4 ghz (from wiki?), while core i7's under turbo (e.g. single core to single core) can approach that speed. You seem to have handily ignored things like transistor shrinkage as well as better design ...
    I'm talking about design philosophy. You're comparing a product that reached 3.4GHz in 130nm with Nehalem which reaches 3.5 with 45nm.
    It's not relevant the actual speed P4 reached ( not to mention it was held back in 65nm ), it's a product designed in the late '90s. Nehalem was done by the same time that did Netburst and it took 6 years; obviously, all they've learned with Netburst was put to good use.

    And to sum it up : my point is that you can either aim for frequency or for IPC. The middle path are designs like Core/I7/K10 and possibly Bulldozer. I don't expect any wonders from either be it in IPC or frequency.
    Quote Originally Posted by Heinz Guderian View Post
    There are no desperate situations, there are only desperate people.

  7. #82
    Xtreme Addict
    Join Date
    Jan 2005
    Posts
    1,730
    Quote Originally Posted by Helmore View Post
    That's not what I said. I said, lower IPC than Phenom II while clocking higher and Phenom II has a lower IPC than Core 2 AFAIK.
    Even so, it has a lower frequency, thus it has other bottlenecks in the design.

    You could also go for a more hybrid approach, like double clocking those parts in a core that make sense. Clock domains within a single core in other words. You could for example run the schedulers and execution units at double the clockspeed of the fetch and decode stage. I'm not saying they will, but it's another approach.
    Nothing new here; Pentium 4 did this back in 2000. The integer core was clocked 2x the core clock. Ultimately it ended badly, altough they tried every technique, low swing circuits, domino logic, running something at 6-8GHz means a lot of power used and performance per watt is poor.
    Quote Originally Posted by Heinz Guderian View Post
    There are no desperate situations, there are only desperate people.

  8. #83
    I am Xtreme
    Join Date
    Jul 2007
    Location
    Austria
    Posts
    5,485
    Quote Originally Posted by terrace215 View Post

    Can you imagine the outcry from saaya were SB to compare to Westmere in such a way? I can hear it now... "Epic Fail!!!!!111"
    Well my nickname for him is mister negative, and that for a good reason.

  9. #84
    Banned
    Join Date
    Jun 2009
    Posts
    348
    Quote Originally Posted by OhNoes! View Post
    it seems AMD is going to take a page from Intel by employing an even more aggressive core boosting strategy (ala SB) in single thread scenarios.
    amd copying intel? really.
    Attached Thumbnails Attached Thumbnails Click image for larger version. 

Name:	160288d1276731686-amd-caught-off-guard-high-demand-2lstd.png 
Views:	681 
Size:	188.0 KB 
ID:	105734  

  10. #85
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    Quote Originally Posted by Helmore View Post
    http://www.anandtech.com/show/2879/3

    A 16 core (8 modules) Bulldozer based CPU is 60 to 80% faster than a 12 core Magny Cours in integer performance when using SPECInt_rate as a benchmark according to Anandtech.
    Also keep in mind that terrace based all of his math on an old,vague chart that has fading bars....

  11. #86
    Xtreme Enthusiast
    Join Date
    Jun 2006
    Location
    Space
    Posts
    769
    Quote Originally Posted by JF-AMD View Post
    All I have said is that bulldozer will be faster than current products. I have not made any clock speed statements.

    The statement that I made was that Interlagos would have 33% more cores and will be 50%+ faster than Magny Cours. If you are more than 50% faster with 33% more cores, then your "per core" performance is faster. That is the only statement that we will make on performance.
    Sounds to me like the IPC is better.

  12. #87
    Xtreme Cruncher
    Join Date
    May 2009
    Location
    Bloomfield
    Posts
    1,968
    Quote Originally Posted by wuttz View Post
    amd copying intel? really.
    lol. intel copied amd by having LSU's!

    integrated FPU, TLB, IMC, OoO, register renaming, superscalar pipeline, branch prediction, integrated L2 cache, L3 cache, among others are all things that intel did first (for x86) that amd has used.

    on a more serious note intel doesnt even need to beat amd's uarch. they have the best process and circuit design teams in the world.

  13. #88
    Banned
    Join Date
    Jul 2004
    Posts
    1,125
    Quote Originally Posted by informal View Post
    Also keep in mind that terrace based all of his math on an old,vague chart that has fading bars....
    Well, I mean, what can I say? It is AMD's own chart from 6 months back.

    I'll stick with AMD's official figures over Johan's tidbits, especially when the latter date (and source) from the SAME time of the "5% extra die size for 80% performance gain" nonsense that was later clarified.

    Unless JF-AMD wants to reiterate any performance claim that is different from what that AMD slide shows?

    JF, is Interlagos really 60-80% better than MC (12-core) in specInt_rate, despite the AMD slide showing "integer performance" is only 24-33% better?

    Or was your comment to Johan in error?
    Last edited by terrace215; 06-23-2010 at 05:14 PM.

  14. #89
    I am Xtreme
    Join Date
    Jul 2007
    Location
    Austria
    Posts
    5,485
    Quote Originally Posted by Chumbucket843 View Post
    on a more serious note intel doesnt even need to beat amd's uarch. they have the best process and circuit design teams in the world.
    Well it only can fix that much... we have seen how it turned out for the P4, the process was good (as the pentium-m has proven) but if the cpu design itself isn't up to the task, the best process can't help you.
    Last edited by Hornet331; 06-23-2010 at 05:17 PM.

  15. #90
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    Quote Originally Posted by terrace215 View Post
    Well, I mean, what can I say? It is AMD's own chart from 6 months back.

    I'll stick with AMD's official figures over Johan's tidbits, especially when they date (and source) from the time of the "5% extra die size for 80% performance gain" nonsense that was later "clarified".
    You can stick with whatever you like,but if it ends up like Johan said(or even better ) ,it won't matter. JF already stated that the "jump" will be very similar if not better to what we have got with Istanbul->MC transition.

  16. #91
    Xtreme Enthusiast
    Join Date
    Oct 2007
    Location
    Singapore
    Posts
    970
    Quote Originally Posted by informal View Post
    You can stick with whatever you like,but if it ends up like Johan said(or even better ) ,it won't matter. JF already stated that the "jump" will be very similar if not better to what we have got with Istanbul->MC transition.

    core to core performance, how much does MC gains over Istanbul?
    I thought both were K10.5 micro-architecture?
    Main Rig:
    Processor & Motherboard:AMD Ryzen5 1400 ' Gigabyte B450M-DS3H
    Random Access Memory Module:Adata XPG DDR4 3000 MHz 2x8GB
    Graphic Card:XFX RX 580 4GB
    Power Supply Unit:FSP AURUM 92+ Series PT-650M
    Storage Unit:Crucial MX 500 240GB SATA III SSD
    Processor Heatsink Fan:AMD Wraith Spire RGB
    Chasis:Thermaltake Level 10GTS Black

  17. #92
    Banned
    Join Date
    Jul 2004
    Posts
    1,125
    Quote Originally Posted by informal View Post
    You can stick with whatever you like,but if it ends up like Johan said(or even better ) ,it won't matter. JF already stated that the "jump" will be very similar if not better to what we have got with Istanbul->MC transition.
    You know, I've been looking more at that chart from AMD.

    I've got a very close fit for the Y-axis:

    **** The base results for specInt_rate and specFP_rate, for 2-socket systems with the noted processors, divided by 10. ****

    Look them up. They are dead on for the 2009 "2435 Istanbul 2-socket", and very close for 2008 and 2007 as well.

    Now, you say, but 2010, MC is better than this: the chart would give:

    290 for int_rate (base), 280 for fp_rate for a 2-socket 2.3 MC system.

    When we look we find: 309 int, 290 fp.

    But recall that JF likes to say that they over-delivered with MC vs what was promised... so I think this is ok.

    If I got it right, this chart from AMD calls for

    =================================
    Interlagos top-bin 2-socket system:

    SpecInt_rate(base): 360-390
    SpecFP_rate(base): 400-430

    (lower numbers are where the fade starts, upper is end of bar)

    =================================

    The upper end would amount to an FP improvement of 48% (thank you, AVX), and integer is 390/309 = 26%.

    Note that per-core, this is 148/133 = 11% better SpecFP_rate, but about 5% worse on SpecInt_rate.

    It would make sense that these charts would be some form of SpecInt/FP rates, and base is easier to project than peak, and it must be 2-socket (or 1, but that doesn't make much sense) systems from the 2xx initial parts chosen.


    --------------------

    Anyhow, given that I now think this chart is giving spectInt/FP_rate projections, the Johan specInt_rate tidbit from JF is completely at odds with this chart, and as they were both put out at the same time... gotta think the chart stands unless JF wants to (re-)claim otherwise.
    Last edited by terrace215; 06-23-2010 at 06:13 PM.

  18. #93
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    Johan got his information directly from AMD. The chart has fading bars and AMD(JF) already stated that only they know how high the bars actually go(that's the purpose of the fading btw,to not actually disclose the true perf. projection). You are reading waaaay to much into that chart,especially knowing that AMD couldn't possibly predict the clock speeds they would milk from the BD silicon at the time they made the chart. 60-80% uplift from MC is a good bet,but seeing how AMD delivered and over-delivered with Shanghai,Istanbul and especially MC,you can bet they will do all they can to over-deliver with BD when it launches.

    edit:
    a question : why are you so obsessed with AMD,BD perfromance/tapeout and 2011? Any chance you're an intel shareholder ?
    Last edited by informal; 06-23-2010 at 06:16 PM.

  19. #94
    Banned
    Join Date
    Jul 2004
    Posts
    1,125
    Quote Originally Posted by informal View Post
    Johan got his information directly from AMD. The chart has fading bars and AMD(JF) already stated that only they know how high the bars actually go(that's the purpose of the fading btw,to not actually disclose the true perf. projection). You are reading waaaay to much into that chart,especially knowing that AMD couldn't possibly predict the clock speeds they would milk from the BD silicon at the time they made the chart. 60-80% uplift from MC is a good bet,but seeing how AMD delivered and over-delivered with Shanghai,Istanbul and especially MC,you can bet they will do all they can to over-deliver with BD when it launches.
    Johan got his info from JF, at the same time of the 5%-die-size thing, the same day (or so) that AMD released this chart. Hence the question to JF.

    ***EDIT: Could it be that it was just a misread? What *is* 80% better on SpecInt_rate (per that chart) is MC over Istanbul. (rather than Interlagos over MC, which the chart shows at 35%)

    AMD can make a reasonable stab at Interlagos clocks... remember that power is what is really gating things here. (more so than with a Zambezi 1-die part) But I agree there's a bin or so of "not sure", which is why the bars fade out.

    I'm sure AMD will try to over-deliver, my point is merely that the chart shows BD relative to a slightly-worse-than-reality version of MC.

    The numbers are interesting:

    With int_rate, both Nehalem-EX and Westmere are already at the low-end of BD's projected range, so I think Westmere-EX (25% core increase, higher clocks), and also SB (33% core increase, new arch, higher mem bandwidth) will have no trouble maintaining dominance here.

    With fp_rate, Intel has a lot further to go to catch a (2-socket) 400-430 SpecFP_rate(base). But presumably this is where AVX comes in, as well as more cores/bandwidth.

    For single-to-low-threaded stuff, I expect Intel will win across the board, probably substantially.

    edit: obsessed? Isn't the whole point of these boards/threads speculation? Some people find it fun, you know. It's a challenge trying to decode these AMD performance projection slides, but the results can be informative, no?
    Last edited by terrace215; 06-23-2010 at 07:28 PM.

  20. #95
    Xtreme Cruncher
    Join Date
    May 2009
    Location
    Bloomfield
    Posts
    1,968
    Quote Originally Posted by Hornet331 View Post
    Well it only can fix that much... we have seen how it turned out for the P4, the process was good (as the pentium-m has proven) but if the cpu design itself isn't up to the task, the best process can't help you.
    anecdote:
    back then their CEO was craig barret and he used to be an engineer for intel who started working for them in the 70's. he was on the materials side of things so he pushed process over arch. netburst uarch was a really bad idea from the start. even researchers then new about future power issues. ever since presscot/tejas intel has focused on making a good uarch.

    my point is that intel will almost always be ahead in process/physical design. amd can match or beat them in uarch but the only realistic way intel would lose is to fall behind in uarch (a la netburst). and fwiw hand optimized circuits can be up to 7x more power efficient over a synthesized counterpart.

  21. #96
    Xtreme Member
    Join Date
    Jan 2007
    Location
    Over the mountains and down in the valley
    Posts
    479
    At the end of the day who ever can encode my video faster wins. Currently this is Intel and has been for several years now.

    And if one person responds to this and mentions something like badaboom I'm going to !
    Quote Originally Posted by Manicdan View Post
    using a OCed quad for torrenting is like robbing your local video store with a rocket launcher.

  22. #97
    Xtreme Enthusiast
    Join Date
    Oct 2007
    Location
    Singapore
    Posts
    970
    Quote Originally Posted by saint-francis View Post
    At the end of the day who ever can encode my video faster wins. Currently this is Intel and has been for several years now.

    And if one person responds to this and mentions something like badaboom I'm going to !
    hm....Jaguar for u?
    Main Rig:
    Processor & Motherboard:AMD Ryzen5 1400 ' Gigabyte B450M-DS3H
    Random Access Memory Module:Adata XPG DDR4 3000 MHz 2x8GB
    Graphic Card:XFX RX 580 4GB
    Power Supply Unit:FSP AURUM 92+ Series PT-650M
    Storage Unit:Crucial MX 500 240GB SATA III SSD
    Processor Heatsink Fan:AMD Wraith Spire RGB
    Chasis:Thermaltake Level 10GTS Black

  23. #98
    Xtreme Enthusiast
    Join Date
    Dec 2009
    Posts
    846
    Quote Originally Posted by -Sweeper_ View Post
    looks like they wont talk about clock speeds
    Nobody talks about clock speeds before lanuch.
    While I work for AMD, my posts are my own opinions.

    http://blogs.amd.com/work/author/jfruehe/

  24. #99
    Xtreme Enthusiast
    Join Date
    Dec 2009
    Posts
    846
    Quote Originally Posted by terrace215 View Post
    That figure appears to come from an AMD chart focusing on one projection, "Floating Point performance".

    It shows MC at "28", and Interlagos at "43" (at max, the line fades starting around 40). 43% to 53% improvement.

    But given that Interlagos/BD has AVX (including FMA), this really isn't all that impressive, is it?

    "Integer performance" on the same chart goes from "29" to "36-38.5" (from start fade to end fade)

    That's a 24% to 33% improvement for 33% more cores.

    Hmmm, doesn't really look like much single-threaded improvement there.

    In sum:

    From -7% to 0% performance loss per core on "Integer"
    From 7.5% to 15% performance gain per core on "Floating point" -- and that's with AVX!

    Can you imagine the outcry from saaya were SB to compare to Westmere in such a way? I can hear it now... "Epic Fail!!!!!111"

    Other than AVX helping FP perf, somewhat less than expected, I don't see any single-threaded gains from MC --> BD, not based on these performance projections, anyhow.

    No, you are completely wrong here.


    PLEASE DELETE THAT CHART.

    I have explained several times:

    1. The chart was drawn in powerpoint, the chart was not done in excel where you would have exact numbers
    2. The chart uses a fade to purposely hide the actual performance estimates because we were not making actual estimates at the time.

    Anyone that obsesses about that chart would also notice that the Magny Cours performance increase over Istanbul was also underestimated.

    If you want to refer to any performance estimate for bulldozer, there is one official one: 50% greater total throughput than Magny Cours.

    We won't be saying anything else for the forseeable future.

    Any other guess, no matter how complicated the math or methodology, will be wrong.
    Last edited by JF-AMD; 06-23-2010 at 09:41 PM.
    While I work for AMD, my posts are my own opinions.

    http://blogs.amd.com/work/author/jfruehe/

  25. #100
    Xtreme Enthusiast
    Join Date
    Aug 2008
    Posts
    577
    Quote Originally Posted by JF-AMD View Post
    If you want to refer to any performance estimate for bulldozer, there is one official one: 50% greater total throughput than Magny Cours.
    Mmmm juicy.
    --Intel i5 3570k 4.4ghz (stock volts) - Corsair H100 - 6970 UL XFX 2GB - - Asrock Z77 Professional - 16GB Gskill 1866mhz - 2x90GB Agility 3 - WD640GB - 2xWD320GB - 2TB Samsung Spinpoint F4 - Audigy-- --NZXT Phantom - Samsung SATA DVD--(old systems Intel E8400 Wolfdale/Asus P45, AMD965BEC3 790X, Antec 180, Sapphire 4870 X2 (dead twice))

Page 4 of 17 FirstFirst 123456714 ... LastLast

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •