MMM
Page 6 of 29 FirstFirst ... 345678916 ... LastLast
Results 126 to 150 of 719

Thread: AMD cuts to the core with 'Bulldozer' Opterons

  1. #126
    Xtreme Member
    Join Date
    Jul 2004
    Location
    Berlin
    Posts
    275
    Quote Originally Posted by savantu View Post
    "Concerns about threshold voltage shifts and other performance problems with the gate-first approach to high-k/metal gate creation may cause GlobalFoundries (Sunnyvale, Calif.) and other members of the IBM-led Fishkill Alliance to shift to a gate-last technique, sources said at the International Electron Devices Meeting (IEDM), going on this week in Baltimore.

    "My understanding is that the subsequent thermal steps are causing problems with the gate-first approach," said a senior vice president at Qualcomm Corp. (San Diego) who was attending IEDM. "GlobalFoundries seeks a gate-last approach, and if necessary they could drop in a gate-last module independent of IBM," the Qualcomm executive said.
    http://www.semiconductor.net/article...igh_k-full.php

    Translation : yeah, it might work at 32nm, but obviously yields are down the drain. Exactly what foundries want to hear.
    It is one thing to show a few transistors or a small SRAM from R&D, it's a completely different thing to get a production process ready to roll. IBM utterly sucks at the later.
    Same site, same topic, different result:
    http://www.semiconductor.net/article...Generation.php

    Seems, we have to wait and watch how the story unfolds.
    Now on Twitter: @Dresdenboy!
    Blog: http://citavia.blog.de/

  2. #127
    Xtreme Cruncher
    Join Date
    May 2009
    Location
    Bloomfield
    Posts
    1,968
    @dresdenboy. this is from your blog.
    Does someone have an idea, what EBO is?
    perhaps endian byte order?

  3. #128
    Xtreme Member
    Join Date
    Jul 2004
    Location
    Berlin
    Posts
    275
    Quote Originally Posted by Chumbucket843 View Post
    @dresdenboy. this is from your blog.

    perhaps endian byte order?
    Ah, OK. This riddle is already solved. Please have a look at the comments to my blog entry.
    Now on Twitter: @Dresdenboy!
    Blog: http://citavia.blog.de/

  4. #129
    Xtreme Addict
    Join Date
    Jun 2002
    Location
    Ontario, Canada
    Posts
    1,782
    Quote Originally Posted by Oliverda View Post
    Hey, someone has to assume the role of the optimist!

    Hopefully AMD is not going down the same road as Intel tried to with the P4 Netburst path or if they are, hopefully they've
    learned enough to not make the mistakes Intel did. I had a Prescott that I could literally heat up my computer room with.
    Last edited by freeloader; 01-27-2010 at 01:22 PM.

  5. #130
    Xtreme Enthusiast
    Join Date
    Oct 2007
    Location
    Hong Kong
    Posts
    526
    Quote Originally Posted by Macadamia View Post
    Hey lemme the same 1 "doom"y sentence from the same article written by the same source!

    I can be an anal-yst now ey? XD


    Oh BTW Intel can't even get the mobile Arrandales to perform properly (stuck at 1.8Ghz) on battery load, so much for static logic, high-k, and such a superior gate-first approach with smart core gating.
    IMC design is not good for battery performance

    while you shut down cpu, you cannot power down imc because (i)gpu needs it

  6. #131
    Xtreme Addict
    Join Date
    Jan 2009
    Posts
    1,445
    something about this design makes me believe that ipc is going to be really high? intel has 4 per cycle right? or do i have this mixed up with something else?
    [MOBO] Asus CrossHair Formula 5 AM3+
    [GPU] ATI 6970 x2 Crossfire 2Gb
    [RAM] G.SKILL Ripjaws X Series 16GB (4 x 4GB) 240-Pin DDR3 1600
    [CPU] AMD FX-8120 @ 4.8 ghz
    [COOLER] XSPC Rasa 750 RS360 WaterCooling
    [OS] Windows 8 x64 Enterprise
    [HDD] OCZ Vertex 3 120GB SSD
    [AUDIO] Logitech S-220 17 Watts 2.1

  7. #132
    Xtreme Enthusiast
    Join Date
    Oct 2007
    Location
    Hong Kong
    Posts
    526
    Quote Originally Posted by god_43 View Post
    something about this design makes me believe that ipc is going to be really high? intel has 4 per cycle right? or do i have this mixed up with something else?
    the max. ipc per module (dual-core):
    INT: 4
    64-bit FP: 4
    128-bit FP: 2
    256-bit FP: 1

    Total issue: 4



    if the ALU and FPU are clocked as the same
    Last edited by qcmadness; 01-27-2010 at 11:24 PM.

  8. #133
    Xtreme Addict
    Join Date
    Jan 2009
    Posts
    1,445
    Quote Originally Posted by qcmadness View Post
    the max. ipc per module (dual-core):
    INT: 4
    64-bit FP: 4
    128-bit FP: 2
    256-bit FP: 1

    if the ALU and FPU are clocked as the same
    oh.......that is certainly interesting? is this on par with what intel has now then?
    [MOBO] Asus CrossHair Formula 5 AM3+
    [GPU] ATI 6970 x2 Crossfire 2Gb
    [RAM] G.SKILL Ripjaws X Series 16GB (4 x 4GB) 240-Pin DDR3 1600
    [CPU] AMD FX-8120 @ 4.8 ghz
    [COOLER] XSPC Rasa 750 RS360 WaterCooling
    [OS] Windows 8 x64 Enterprise
    [HDD] OCZ Vertex 3 120GB SSD
    [AUDIO] Logitech S-220 17 Watts 2.1

  9. #134
    Xtreme Enthusiast
    Join Date
    Oct 2007
    Location
    Hong Kong
    Posts
    526
    Quote Originally Posted by god_43 View Post
    oh.......that is certainly interesting? is this on par with what intel has now then?
    Not much info for Sandy Bridge except 256-bit AVX extensions (SSEx is 128-bit)

    Nehalem / Westmere:


    IPC:
    INT: 3 (+2)
    FP / SSE: 3

    Total issue: 4
    Last edited by qcmadness; 01-27-2010 at 11:25 PM.

  10. #135
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    According to AMD's Rick Bergam,each core inside the module is 4 issue wide,so at the module level INT capability is 2x of that,while FP is also 4 issue with 256bit AVX and FMAC capability(FMAC practically stomps anything we have today in x86 world as far as fp/sse stuff goes).

  11. #136
    Xtreme Enthusiast
    Join Date
    Oct 2007
    Location
    Hong Kong
    Posts
    526
    Quote Originally Posted by informal View Post
    According to AMD's Rick Bergam,each core inside the module is 4 issue wide,so at the module level INT capability is 2x of that,while FP is also 4 issue with 256bit AVX and FMAC capability(FMAC practically stomps anything we have today in x86 world as far as fp/sse stuff goes).
    http://www.anandtech.com/cpuchipsets...oc.aspx?i=3674
    Within each integer “core” are four pipelines, presumably half for ALUs and half for memory ops. That’s a narrower width than a single Phenom II core, but there are two integer clusters on a single Bulldozer core.
    Currently each K10 integer pipeline can do both ALU and memory ops.

    Last edited by qcmadness; 01-28-2010 at 12:49 AM.

  12. #137
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    Note the key word in AT's speculative article:
    Within each integer “core” are four pipelines, presumably half for ALUs and half for memory ops.
    In other words he presumes. You have a diagram from AMD stating 4 pipelines(non segmented into ALUs and AGUs). I'm not saying it's 4 issue being made of grouped 4 ALUs/AGUs ,it may be 2 issue with throughput of 4 issue(look at dresdenboy's blog),but I'm still sticking to the image and the post analyst call interview with Rick Bergam who stated that each core inside the module is a 4 issue core.

    PS Note that in the diagram of 10h you posted,the ALUs and AGUs are grouped together,in pairs... 3 issue made up of 3 groups of ALUs and AGUs. Very same methodology may apply to BD cores .
    Last edited by informal; 01-28-2010 at 01:15 AM.

  13. #138
    Xtreme Enthusiast
    Join Date
    Oct 2007
    Location
    Hong Kong
    Posts
    526
    Quote Originally Posted by informal View Post
    Note the key word in AT's speculative article:

    In other words he presumes. You have a diagram from AMD stating 4 pipelines(non segmented into ALUs and AGUs). I'm not saying it's 4 issue being made of grouped 4 ALUs/AGUs ,it may be 2 issue with throughput of 4 issue(look at dresdenboy's blog),but I'm still sticking to the image and the post analyst call interview with Rick Bergam who stated that each core inside the module is a 4 issue core.

    PS Note that in the diagram of 10h you posted,the ALUs and AGUs are grouped together,in pairs... 3 issue made up of 3 groups of ALUs and AGUs. Very same methodology may apply to BD cores .
    speculations here

    if each core in the module has 4 independent ALU and AGUs, the core will be "swelled" and may get too big.

    my image for 2 ALU arguments comes from
    1. post above (average integer IPC is about 2)
    2. bobcat diagram (very similar to half of a bulldozer module)


    if the architecture is for server space, then it sounds possible to have only 2 ALU per core as each thread for server application is of low IPC but highly-threaded (>24 thread).

  14. #139
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    Quote Originally Posted by qcmadness View Post
    speculations here

    if each core in the module has 4 independent ALU and AGUs, the core will be "swelled" and may get too big.

    my image for 2 ALU arguments comes from
    1. post above (average integer IPC is about 2)
    2. bobcat diagram (very similar to half of a bulldozer module)


    if the architecture is for server space, then it sounds possible to have only 2 ALU per core as each thread for server application is of low IPC but highly-threaded (>24 thread).
    BD and bobcat were developed by 2 different teams with 2 different goals in mind... While they do share some common ideas,Bobcat is targeted at very low power and good mainstream performance ,while BD is targeted for very high performance with advanced power gating technologies in place to control the power draw in various scenarios. Note also that Bobcat has no support for various ISA extensions BD will support and has a much weaker fp unit with no FMAC capability(which is normal for low power design).

    As for the "swelled" comment,you seem to forget that BD is going to be produced @ 32nm node,so no,the cores won't be that big in any case. The module itself is probably even going to be smaller than one SB core(with SMT),courtesy of the way AMD designed the very BD module ...
    Last edited by informal; 01-28-2010 at 01:53 AM.

  15. #140
    Xtreme Enthusiast
    Join Date
    Oct 2007
    Location
    Hong Kong
    Posts
    526
    Quote Originally Posted by informal View Post
    BD and bobcat were developed by 2 different teams with 2 different goals in mind... While they do share some common ideas,Bobcat is targeted at very low power and good mainstream performance ,while BD is targeted for very high performance with advanced power gating technologies in place to control the power draw in various scenarios. Note also that Bobcat has no support for various ISA extensions BD will support and has a much weaker fp unit with no FMAC capability(which is normal for low power design).

    As for the "swelled" comment,you seem to forget that BD is going to be produced @ 32nm node,so no,the cores won't be that big in any case. The module itself is probably even going to be smaller than one SB core(with SMT),courtesy of the way AMD designed the very BD module ...
    even with 32nm, the BD core is not small though.

    Shanghai / Deneb single core: ~19mm^2 / 55M transistor
    Calculated BD module: ~29mm^2 / 165M transistor (assuming 50% die shrink, ~150% die size claimed from AT's communication with AMD)

    Packing 4 modules with 8MB L3 cache, you still get about 180mm^2 die size.
    Last edited by qcmadness; 01-28-2010 at 02:06 AM.

  16. #141
    Xtreme Addict
    Join Date
    Nov 2006
    Posts
    1,402
    i think the two arrow on the diagram are very clear.

    AMD is doing big brute force ultra wide pipelines for BD cores.

  17. #142
    Xtreme Enthusiast
    Join Date
    Oct 2007
    Location
    Singapore
    Posts
    970
    Quote Originally Posted by qcmadness View Post
    even with 32nm, the BD core is not small though.

    Shanghai / Deneb single core: ~19mm^2 / 55M transistor
    Calculated BD module: ~29mm^2 / 165M transistor (assuming 50% die shrink, ~150% die size claimed from AT's communication with AMD)

    Packing 4 modules with 8MB L3 cache, you still get about 180mm^2 die size.

    don't rely to a SINGLE source only
    Main Rig:
    Processor & Motherboard:AMD Ryzen5 1400 ' Gigabyte B450M-DS3H
    Random Access Memory Module:Adata XPG DDR4 3000 MHz 2x8GB
    Graphic Card:XFX RX 580 4GB
    Power Supply Unit:FSP AURUM 92+ Series PT-650M
    Storage Unit:Crucial MX 500 240GB SATA III SSD
    Processor Heatsink Fan:AMD Wraith Spire RGB
    Chasis:Thermaltake Level 10GTS Black

  18. #143
    Xtreme Member
    Join Date
    Oct 2009
    Location
    Bucharest, Romania
    Posts
    381
    Quote Originally Posted by qcmadness View Post
    even with 32nm, the BD core is not small though.

    Shanghai / Deneb single core: ~19mm^2 / 55M transistor
    Calculated BD module: ~29mm^2 / 165M transistor (assuming 50% die shrink, ~150% die size claimed from AT's communication with AMD)

    Packing 4 modules with 8MB L3 cache, you still get about 180mm^2 die size.
    1 BD module = 2 Deneb cores.


    So, 165 M vs 110 M for 2 deneb cores, 29mm vs 38mm. Performance should be higher as well, so i think they are doing a good job.

  19. #144
    Xtreme Addict
    Join Date
    Jan 2005
    Posts
    1,730
    Quote Originally Posted by Particle View Post
    That's a pretty bold statement given the current indications that Bulldozers are heading for an early rather than late release versus their official late 2010 release date.
    How exactly is it early when AMD slides themselves point out it is a 2011 product. Not only that but assuming perfect execution by AMD ( which I really doubt ), everything is at the mercy of GF. Which btw, has some issues with process technology.
    Quote Originally Posted by Heinz Guderian View Post
    There are no desperate situations, there are only desperate people.

  20. #145
    Xtreme Member
    Join Date
    Oct 2009
    Location
    Bucharest, Romania
    Posts
    381
    Savantu, does intel pay your bills? Does it make breakfast for you? Because from how you post, it looks that way.

    Somehow, you want AMD to fail, you want them to be late with this product. Nothing from what we know about bulldozer can make us think they will be late, they seem to move fast into the right direction.

  21. #146
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    Quote Originally Posted by savantu View Post
    How exactly is it early when AMD slides themselves point out it is a 2011 product. Not only that but assuming perfect execution by AMD ( which I really doubt ), everything is at the mercy of GF. Which btw, has some issues with process technology.
    AMD is targeting BD class cores for desktop and server for Q2 next year.Think Thuban time frame + 1 year away(Thuban is May this year).

  22. #147
    Xtreme Enthusiast
    Join Date
    Oct 2007
    Location
    Singapore
    Posts
    970
    Quote Originally Posted by savantu View Post
    How exactly is it early when AMD slides themselves point out it is a 2011 product. Not only that but assuming perfect execution by AMD ( which I really doubt ), everything is at the mercy of GF. Which btw, has some issues with process technology.

    the AMD today is not the AMD you knew yesterday
    lolz
    Main Rig:
    Processor & Motherboard:AMD Ryzen5 1400 ' Gigabyte B450M-DS3H
    Random Access Memory Module:Adata XPG DDR4 3000 MHz 2x8GB
    Graphic Card:XFX RX 580 4GB
    Power Supply Unit:FSP AURUM 92+ Series PT-650M
    Storage Unit:Crucial MX 500 240GB SATA III SSD
    Processor Heatsink Fan:AMD Wraith Spire RGB
    Chasis:Thermaltake Level 10GTS Black

  23. #148
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    BD cores get sported by HWinfo app.
    Gen. 15h,code name Bulldozer. Since BD samples are in AMD's hands,it's logical they get support from some HW identification app. out there.

    http://www.hwinfo.com/

    Intel Nehalem family: Core i7 (Bloomfield), Xeon 55xx (Nehalem-EP/Gainestown), Xeon W35xx (Nehalem-WS), Nehalem-EX (Beckton), Core i5/i7 (Lynnfield), Core i7 (Clarksfield), Havendale, Auburndale, Xeon (Jasper Forest)
    Intel Westmere family: Core i3/i5 Clarkdale, Core i5/i7 Arrandale (+ECC), Westmere-EP, Westmere-WS, Gulftown, Westmere-EX (Eagleton)
    Intel EP80579 (Tolapai), Atom Z5xx (Silverthorne), Atom N2xx/2xx/3xx (Diamondville), Atom N4xx/D4xx/D5xx (Pineview), Atom (Lincroft)
    Intel Sandy Bridge family
    AMD Family 10h (DR-Ax/Bx, RB-Cx, BL-Cx, DA-Cx, HY-Dx, PH-Ex):
    - AM2r2/AM3: Athlon/Sempron (Lima, Sargas), Athlon II/Neo (Kuma, Regor, Rana, Propus), Phenom II/Neo (Agena, Deneb, Callisto, Heka, Toliman), TWKR Black Edition
    - Fr2/Fr4: Opteron (Barcelona, Shanghai, Suzuka, Budapest, Istanbul)
    - G34: Opteron 61xx (Magny Cours)
    - S1g3/g4: Turion II/Ultra, Athlon II, Phenom II
    - ASB2: Athlon II Neo K, Turion II Neo K
    AMD Family 11h (LG-Ax/Bx): S1g2: Sempron SI/NI/X2, Athlon QI/QL (Sable), Turion X2 Ultra/RM (Griffin)
    AMD Family 12h (FS1)
    AMD Family 15h: Bulldozer
    VIA C7 and Nano

  24. #149
    Xtreme Addict
    Join Date
    Jan 2009
    Posts
    1,445
    intel must know about bulldozers supposed capabilities already eh? i mean intel must have like spy's inside amd (corporate espionage is a big deal you know) im sure? should amd have released any info about these new cpus? is sandyB a new arch, or is it a refresh of nehielem? wont intel be able to counter amd with this info? and why is it snowing out side?

    sometimes i think about these things...
    [MOBO] Asus CrossHair Formula 5 AM3+
    [GPU] ATI 6970 x2 Crossfire 2Gb
    [RAM] G.SKILL Ripjaws X Series 16GB (4 x 4GB) 240-Pin DDR3 1600
    [CPU] AMD FX-8120 @ 4.8 ghz
    [COOLER] XSPC Rasa 750 RS360 WaterCooling
    [OS] Windows 8 x64 Enterprise
    [HDD] OCZ Vertex 3 120GB SSD
    [AUDIO] Logitech S-220 17 Watts 2.1

  25. #150
    Xtreme Mentor
    Join Date
    Jul 2008
    Location
    Shimla , India
    Posts
    2,631
    Amm Intel did delay Sandy bridge and they did it to try and implement something which SB does not have and BD does. Tough the implementation is very hard to accomplish i have been told 3-4 teams are working on it right now, one may call it the v1.1 the v1.0 of SB is all ready for the fight....

    If the v1.1 works out it will get relesed in stead of the v1.0 otherwise v1.0 will be used, in tis case IB will come sooner than later.
    Coming Soon

Page 6 of 29 FirstFirst ... 345678916 ... LastLast

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •