Page 2 of 48 FirstFirst 1234512 ... LastLast
Results 26 to 50 of 1198

Thread: AMD "Piledriver" refresh of Zambezi - info, speculations, test, fans

  1. #26
    Registered User
    Join Date
    Apr 2010
    Posts
    84
    I am surprised that no one has talked about this more in regards to power consumption and also clocking ability. So here goes..

    First how many transistors does the L3 cache account for? Anandtech has mentioned that it will be taken out of bulldozer derivatives which includes trinity, Piledriver, and all future cores because it serves no purpose. Only benefits server type workloads. Here is a link of Anandtech and VR-zone mentioning its non importance with regards to trinity and future derivatives, and techreport mentioning how it didnt provide much more bandwidth.

    http://www.anandtech.com/show/4955/t...x8150-tested/2

    http://vr-zone.com/articles/report-a...led/13807.html

    http://techreport.com/articles.x/21813/6

    But my question is how many of those 2 billion transistors are the 8mb l3 cache? Everyone knows by the die size the L3 takes up alot of space. Also how much would power consumption decrease, and clocking ability increase with the L3 cutout. I think that is how they are getting the power reductions in trinity. I am curious because not just one site and many here even say that bulldozer is a server chip just released for us also with pushed up clocks. Also most know the l3 cache serves only servers because they can always recompile programs quickly for server systems internally.

    The consumer programs dont nearly get updated and recompiled as quickly as server workload programs do. Is AMD likely to have a two production lines? One low demand with L3 cache for servers, second high demand mainstream no L3 cache?

  2. #27
    I am Xtreme
    Join Date
    Dec 2007
    Posts
    7,750
    L3 has no purpose for client uses?
    then why is Phenom stronger than Athlon with the 45nm chips?
    2500k @ 4900mhz - Asus Maxiums IV Gene Z - Swiftech Apogee LP
    GTX 680 @ +170 (1267mhz) / +300 (3305mhz) - EK 680 FC EN/Acteal
    Swiftech MCR320 Drive @ 1300rpms - 3x GT 1850s @ 1150rpms
    XS Build Log for: My Latest Custom Case

  3. #28
    Xtreme Enthusiast
    Join Date
    Mar 2005
    Posts
    644
    Quote Originally Posted by ga1ve1an View Post
    But my question is how many of those 2 billion transistors are the 8mb l3 cache? Everyone knows by the die size the L3 takes up alot of space. Also how much would power consumption decrease, and clocking ability increase with the L3 cutout. I think that is how they are getting the power reductions in trinity. I am curious because not just one site and many here even say that bulldozer is a server chip just released for us also with pushed up clocks.
    As far that I remember Cache L3 was never power hungry. It is actually a "cheap" way of increasing performance with minor increase in power consumption, for as long as you don't care having ridiculous huge and expensive dies, though Intel always gave itself such a luxury in their Server lines with Cores with tons of Cache L3.
    To know how much of an impact Cache L3 got in Frequency scalability you could compare Propus BL-C3 to Deneb RB-C3. I find it pretty weird that I don't even remember having saw Athlons II getting on average higher Frequencies that Phenoms II, actually, they usually are under it, something that I wasn't expecting. I suppose that this must be related to something that I recall JF-AMD mentioning that they can tweak the Wafer manufacturing process to increased yields at the expense of Frequency headroom or the other way around. It makes sense that the Propus Wafers, being for a value Processor, are going to go to maximize yields, while Deneb being pretty much a mainstream or AMD high end part for Frequency.


    Quote Originally Posted by ga1ve1an View Post
    Also most know the l3 cache serves only servers because they can always recompile programs quickly for server systems internally.

    The consumer programs dont nearly get updated and recompiled as quickly as server workload programs do. Is AMD likely to have a two production lines? One low demand with L3 cache for servers, second high demand mainstream no L3 cache?
    Cache usually helps performance in everything, not only Servers. On some applications it may not help, but on general it does. However, Cache L3 is expensive in die size. Because Servers lines got higher margins due to the premium of being Server parts, they can give the luxury of adding Cache L3 to increase performance, but on Desktop it makes no real sense considering it die size cost. Besides, AMD lineup was precisely like that, you had a value part with no Cache L3 (Propus) and another one with it (Deneb).


    Quote Originally Posted by Manicdan View Post
    L3 has no purpose for client uses?
    then why is Phenom stronger than Athlon with the 45nm chips?
    Actually it says so on the Anandtech Review he linked...

    There's a single 8MB L3 cache that's shared among all Bulldozer modules on a chip. In its first incarnation, AMD has no plans to offer a desktop part without an L3 cache. However AMD indicated that the L3 cache was only really useful in server workloads and we might expect future Bulldozer derivatives (ahem, Trinity?) to forgo the L3 cache entirely as a result.

  4. #29
    Registered User
    Join Date
    Apr 2010
    Posts
    84
    So basically cutting the L3 cache will only increase yields (due to a smaller die), and will have no effect on the power comsumption or clocking ability? I would think the faster you are clocking a die that has all kinds of complex L2 and large L3 caches, the more voltage it would need to have everything working properly, therefore increasing consumption and limiting clocking ability. The inverse of this is not true? Didnt AMD directly say they had planned on the latencies being lower based on higher clocks they were hoping for. That means that the caches have to be running faster as well consuming more, no?

    New question based on a reply. If yields are effected based on the size of the die(especially as rough as GF yields on 32nm seems to be), and the cache is a large percentage of the die, is it possible if there are errors or problems with parts of the cache due to yield that they are closing modules based on cache yield also, therefore giving good 8150's with bad partial caches resold as 6100's and 4100's?

  5. #30
    Xtreme Addict
    Join Date
    Mar 2005
    Location
    Rotterdam
    Posts
    1,553
    AMD should focus on getting their cache latency lower instead...
    And why are AMDs transistors denser than intels at the same node but yet their cache isnt? I seem to recall that intel can fit a lot more cache within the same die area than AMD.
    Gigabyte Z77X-UD5H
    G-Skill Ripjaws X 16Gb - 2133Mhz
    Thermalright Ultra-120 eXtreme
    i7 2600k @ 4.4Ghz
    Sapphire 7970 OC 1.2Ghz
    Mushkin Chronos Deluxe 128Gb

  6. #31
    Xtreme Addict
    Join Date
    Dec 2008
    Location
    Sweden, Linköping
    Posts
    2,034
    Quote Originally Posted by Manicdan View Post
    L3 has no purpose for client uses?
    then why is Phenom stronger than Athlon with the 45nm chips?
    That was true back then, but this is a whole new architecture. We shall see, but derivates will most likely have different dies for consumers and servers where things such as cache (and unnessary amounts) will be adjusted depending on market.
    SweClockers.com

    CPU: Phenom II X4 955BE
    Clock: 4200MHz 1.4375v
    Memory: Dominator GT 2x2GB 1600MHz 6-6-6-20 1.65v
    Motherboard: ASUS Crosshair IV Formula
    GPU: HD 5770

  7. #32
    Xtreme Mentor
    Join Date
    May 2008
    Location
    cleveland ohio
    Posts
    2,879
    Quote Originally Posted by Smartidiot89 View Post
    That was true back then, but this is a whole new architecture. We shall see, but derivates will most likely have different dies for consumers and servers where things such as cache (and unnessary amounts) will be adjusted depending on market.
    slightly off topic, here
    which why i want to see opteron 6220 vs FX 8120 95 watt .

    also isn't SOI have 10% bigger transistors the bulk ?
    Last edited by demonkevy666; 10-28-2011 at 02:19 PM.
    HAVE NO FEAR!
    "AMD fallen angel"
    Quote Originally Posted by Gamekiller View Post
    You didn't get the memo? 1 hour 'Fugger time' is equal to 12 hours of regular time.

  8. #33
    Xtreme Enthusiast
    Join Date
    May 2008
    Posts
    612
    Quote Originally Posted by Manicdan View Post
    L3 has no purpose for client uses?
    then why is Phenom stronger than Athlon with the 45nm chips?
    Phenom prefetcher is weak and only works on the L1 cache. Bulldozer has a huge L2 cache and strong prefetchers for it. That will remove some of L3 cache advantage for a cpu that don't have prefetchers or prefetchers only working with L1 cache.

  9. #34
    Xtreme Member
    Join Date
    Jul 2008
    Posts
    260
    300-900MHz with TC 3.0 for pelidriver-trinity
    http://semiaccurate.com/2011/11/04/g...r-trinity-yet/

    But TC 3.0 will have only more frecuencies? or maybe a thirth state of turbo core?

  10. #35
    I am Xtreme FlanK3r's Avatar
    Join Date
    May 2008
    Location
    Czech republic
    Posts
    6,823
    its not more frequencies, because example FX 4170 has 300 mHz max boost and example FX 8120 has boost 900 MHz. So, the same
    ROG Power PCs - Intel and AMD
    CPUs:i9-7900X, i9-9900K, i7-6950X, i7-5960X, i7-8086K, i7-8700K, 4x i7-7700K, i3-7350K, 2x i7-6700K, i5-6600K, R7-2700X, 4x R5 2600X, R5 2400G, R3 1200, R7-1800X, R7-1700X, 3x AMD FX-9590, 1x AMD FX-9370, 4x AMD FX-8350,1x AMD FX-8320,1x AMD FX-8300, 2x AMD FX-6300,2x AMD FX-4300, 3x AMD FX-8150, 2x AMD FX-8120 125 and 95W, AMD X2 555 BE, AMD x4 965 BE C2 and C3, AMD X4 970 BE, AMD x4 975 BE, AMD x4 980 BE, AMD X6 1090T BE, AMD X6 1100T BE, A10-7870K, Athlon 845, Athlon 860K,AMD A10-7850K, AMD A10-6800K, A8-6600K, 2x AMD A10-5800K, AMD A10-5600K, AMD A8-3850, AMD A8-3870K, 2x AMD A64 3000+, AMD 64+ X2 4600+ EE, Intel i7-980X, Intel i7-2600K, Intel i7-3770K,2x i7-4770K, Intel i7-3930KAMD Cinebench R10 challenge AMD Cinebench R15 thread Intel Cinebench R15 thread

  11. #36
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    GPU got the Turbo now,that's what's new .

  12. #37
    Xtreme Member
    Join Date
    Jan 2011
    Location
    145.21.4.???
    Posts
    319
    Trinity engineering sample spec list and other info

    http://www.computerbase.de/news/2011...urchgesickert/

  13. #38
    Xtreme Mentor
    Join Date
    Jun 2008
    Location
    France - Bx
    Posts
    2,601
    Thanks undone !





    FM1 & FM2
    Last edited by Olivon; 11-05-2011 at 02:20 PM.

  14. #39
    I am Xtreme
    Join Date
    Dec 2007
    Posts
    7,750
    does this mean FM2 socket will support FM1 chips?
    2500k @ 4900mhz - Asus Maxiums IV Gene Z - Swiftech Apogee LP
    GTX 680 @ +170 (1267mhz) / +300 (3305mhz) - EK 680 FC EN/Acteal
    Swiftech MCR320 Drive @ 1300rpms - 3x GT 1850s @ 1150rpms
    XS Build Log for: My Latest Custom Case

  15. #40
    Xtreme Mentor
    Join Date
    Jun 2008
    Location
    France - Bx
    Posts
    2,601
    FM2 processors could fit in FM1 socket but FM1 processors on FM2 socket seems impossible if the red circle got no hole in it :





    10% better CPU between the donanimhaber document and the bigpao007's one.
    Last edited by Olivon; 11-05-2011 at 02:59 PM.

  16. #41
    Xtreme Enthusiast
    Join Date
    Apr 2006
    Location
    Bulgaria
    Posts
    544
    Wait a little, I see some other differencies between FM1 and FM2



    Bottom right hole (hole being equal to "no pins there") is shifted "one pin" to the left, compared to FM1.
    Top left hole is shifted one pin to the left.

    I see them physically incompatible.

    On your second picture you should flip the cpu vertially first. Then there are some other difference's.



    Pins marked with red on the cpu side will interfere with the socket notches. Correct me if I'm wrong.
    Last edited by I.nfraR.ed; 11-08-2011 at 03:56 AM.
    X6 1090T (1010MPMW) @ 4267MHz 1.44V NB@3229MHz 1.33V | Asus Crosshair IV Formula | 2x2GB Corsair Dominator GTX2 @ 1845MHz 6-6-5-17-22 1T | Asus HD4890 @ 1050/4800 | Corsair HX850W | HAF 932
    EK Supreme HF | EK-FC4890LT | MCP655 + Koolance D5 Top | ThermoChill PA120.3 + 3x Enermax Magma | XSPC Bay res | Masterkleer 1/2" UV Red

  17. #42
    Xtreme Mentor
    Join Date
    Jun 2008
    Location
    France - Bx
    Posts
    2,601
    You're totally right I.nfraR.ed, I didn't noticed ! My bad !

    Well, so, no FM2 processors on FM1 socket :/
    Last edited by Olivon; 11-05-2011 at 03:21 PM.

  18. #43
    Xtreme Enthusiast
    Join Date
    Apr 2006
    Location
    Bulgaria
    Posts
    544
    Yes . We just need to see FM2 socket if it's incompatible both ways. Is there any picture already available?
    If the corresponding notches will be shifted as well...then no love from AMD this time. Seems like they failed again.
    FM1 seemed useless to me from the beginning.
    Last edited by I.nfraR.ed; 11-05-2011 at 03:36 PM.
    X6 1090T (1010MPMW) @ 4267MHz 1.44V NB@3229MHz 1.33V | Asus Crosshair IV Formula | 2x2GB Corsair Dominator GTX2 @ 1845MHz 6-6-5-17-22 1T | Asus HD4890 @ 1050/4800 | Corsair HX850W | HAF 932
    EK Supreme HF | EK-FC4890LT | MCP655 + Koolance D5 Top | ThermoChill PA120.3 + 3x Enermax Magma | XSPC Bay res | Masterkleer 1/2" UV Red

  19. #44
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    The guy with Trinity A0 says that it just matches Llano at 2.9Ghz in 3dmark and c11.5 ... He didn't say what model but I assume top 3.8Ghz. If so then there is no improvement versus bdver1...

  20. #45
    Xtreme 3D Team
    Join Date
    Jan 2009
    Location
    Ohio
    Posts
    8,499
    Quote Originally Posted by informal View Post
    The guy with Trinity A0 says that it just matches Llano at 2.9Ghz in 3dmark and c11.5 ... He didn't say what model but I assume top 3.8Ghz. If so then there is no improvement versus bdver1...
    There is no way that it can be an improvement...
    AMD is now 80% behind SandyBridge in Cinebench CPC, uses ~20% more power than the old architecture...giving a generous 10% leeway in a 30% gate shrink, 30% behind in WinRAR, which happens to be the only place so far that FX really "shines".
    I hope Piledriver comes with some "I'm on acid" 50% IPC increase...

    They need 20% more clock vs Llano so 700 Mhz over Llano, 3.8 Ghz should beat Llano by a few percent
    Last edited by BeepBeep2; 11-05-2011 at 05:26 PM.
    Smile

  21. #46
    Xtreme Enthusiast
    Join Date
    Mar 2005
    Posts
    644
    After Bulldozer unsuccessfull launch I'm quite sceptical about its successors because I still don't see if the architecture can be improved to become truely competitive or this entire experimental concept isn't useful for real world usage and should be canned, at least on Desktop and Mobile. The point is that until Bulldozer doesn't prove itself to be able to provide consistent, all-around better performance than a K10, getting brainwashed about the benefits of the next generations isn't going to do anything helpful considering current AMD credibility when you can simply sit where you are and not miss anything important until you get actual launch date results. What worries me is that if Bulldozer starts to replace K10 along all the product range and it still doesn't deliver, we are going to miss K10 architecture a lot, as I suppose many did when Northwoods started to become scarse as Prescott production had its take off. And, compared to Prescott replacing Northwood, AMD DOES have a 32nm K10 shrink design ready as part of Llano, while Northwood didn't had any type of 90nm shrink.
    Thuban die size is 346 mm^2 at 45nm while Zambezi is 315 mm^2 at 32nm. A hyphothetical 32nm shrink of Thuban should be around 242 mm^2 (346 * 0.7). You could have end with something that is smaller than Zambezi thus cheaper to manufacture, lower power consumption that either Thuban or Zambezi, a bit higher Frequency headroom, and were it incorporating Llano's K10 improvements, slighty higher performance per Hz, meaning that such 32nm Thuban could make Bulldozer have no reason to exist on Desktop. As I asumme that AMD should have knew with enough anticipation that Bulldozer wasn't going to shine, how much doing a simple die shrink of a current design could have cost in time and money to NOT have doing so instead of all the worthless Bulldozer hype that just hurted AMD?


    I think that AMD strategy was a total screw up of things: Bulldozer is quite dissapointing for general Desktop performance, generally not worth being called an upgrade over a 45nm K10. However, considering that on Servers Bulldozer could actually be much more competitive due to the abundance of Multithreaded workloads, it would have been better if AMD efforts concentrated on providing Bulldozer for Servers and still rely on K10 for Desktop yet another generation. Its what AMD did back on K8 launch: Opteron first, Athlon 64 almost half a year later while Bartons were getting owned by Northwood C. I don't understand what AMD had to win by releasing Bulldozer for Desktop in its current incarnation considering that is the market where less chances it had to shine, though I didn't saw Reviews of how Bulldozer does on Server to know if at least it holds it ground on that market.
    Last edited by zir_blazer; 11-05-2011 at 06:01 PM.

  22. #47
    Xtreme 3D Team
    Join Date
    Jan 2009
    Location
    Ohio
    Posts
    8,499
    Even for servers, I dont see how BD is a win.
    16 core server chips, they could have shrunk Magny Cours and improved clockspeed a few hundred Mhz or added 4 more K10 cores.
    They should have added Llano 3-5% improvements plus 32nm shrink on X6, add in 200-300 Mhz headroom and called it a day.

    Anyone saying Bulldozer isn't as bad as Phenom I launch needs to stop drinking Kool-Aid and check facts unfortunately.

    I'm not saying it isnt fun to OC though, but other than that there is no logical reason to buy the product.
    Smile

  23. #48
    Registered User
    Join Date
    Dec 2005
    Location
    New Zealand
    Posts
    63
    Those projected % increases vs llano, if you actually read the slides they have a 1 next to both of them. In both cases it states this is for digital media workloads, NOT ipc.

    Still it has magically increased. Personally I think it is just smokes and mirrors marketing bs. I bet there will basically minimal change in performance for piledriver vs bulldozer. These large % differences AMD are claiming I think are just selected benchmarks. I.E the ones that already do well for Bulldozer vs Thuban.

    I doubt GFs process has improved that much, maybe a little but certainly not a lot. When the supply of AMD chips starts rapidly improving then you will see they are getting on top of yields and process peformance.
    If they get the process issues really sorted, then I think we could start seeing some really good results.

  24. #49
    Xtreme Cruncher
    Join Date
    Apr 2008
    Location
    Ohio
    Posts
    3,119
    Quote Originally Posted by BeepBeep2 View Post
    Even for servers, I dont see how BD is a win.
    16 core server chips, they could have shrunk Magny Cours and improved clockspeed a few hundred Mhz or added 4 more K10 cores.
    They should have added Llano 3-5% improvements plus 32nm shrink on X6, add in 200-300 Mhz headroom and called it a day.
    BD added new Instruction sets which in proper bench marks, show how much better BD is then Phenom II. I know, not a lot of applications take advantage of these yet, but soon will. Now it Thuban could have been shruck, added Instruction sets and bumbed the clock speed... It would have been a nice chip.
    ~1~
    AMD Ryzen 9 3900X
    GigaByte X570 AORUS LITE
    Trident-Z 3200 CL14 16GB
    AMD Radeon VII
    ~2~
    AMD Ryzen ThreadRipper 2950x
    Asus Prime X399-A
    GSkill Flare-X 3200mhz, CAS14, 64GB
    AMD RX 5700 XT

  25. #50
    Xtreme Member
    Join Date
    Jul 2008
    Posts
    260
    Quote Originally Posted by Olivon View Post
    Thanks undone !





    FM1 & FM2
    I think that models ES of Trinity don't have the improved of Pelidriver core. I can see in the slide the date of October and in this month AMD launched BD, then these ES models of Trinity have Bulldozer core not PD core.

    The best of these ES models have the perfomance of FX-4100 and this is equal at the performance of A8-3850 in CPU.

    I hope with the improved of 10-15% IPC and little better more MHz (or better Turbo core) We will see the 20 % more performance than Llano. 30% is much and I think is a old prediction of AMD when they expected more performance for BD.
    Last edited by cesariuth; 11-05-2011 at 10:27 PM.

Page 2 of 48 FirstFirst 1234512 ... LastLast

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •