Ryzen 9 on the test bench... Sometimes older generation/tech as well.
Interesting reads here, but 3 questions:
1.) why would AMD do such a thing (you wrote for better clock-scaling, that means higher clocks or better scaling?)
2.) can it be optimized further
3.)this contradicts somewhatWrite Trough means that every write to the cache causes a synchronous write to the backing store. Because L2 is slower than L1, L1 must wait for L2 to write out data.or you mean this can be due to WCC? I am not a professional, but I would guess WCC doesnt take that long to preserve coherence... If anything at all, I'd guess the waiting of L1D for L2 can be a problem, but if stuff is written to L1D fast, and then some cycles later WCC writes to L2, I guess there will only be rare cases whereThe L3 is a non-inclusive victim cache (L2 data are evicted to the L3) with data transfered from L3 to the L1D of the expected core without being copied to the L2.really matters...The L1 is in Write-through, but you're not sure a data not in L2 is not the L1D of another core.
I guesswill first speed up things, thendata transfered from L3 to the L1D of the expected core without being copied to the L2.will need some time. All, to my eyes, depends on how this stuff is used, it can be faster in one case, and slow in the other... Mhm somewhere we came to that conclusion beforesnoop traffic in order to keep the coherency correct
Maybe the problem is because of, one core might need to wait for WCC to complete to ensure coherency?The L1 is in Write-through, but you're not sure a data not in L2 is not the L1D of another core.
Ahhhh btw... CONGRATZ FOR NEW WR![]()
1. ASUS Sabertooth 990fx | FX 8320 || 2. DFI DK 790FXB-M3H5 | X4 810
8GB Samsung 30nm DDR3-2000 9-10-10-28 || 4GB PSC DDR3-1333 6-7-6-21
Corsair TX750W | Sapphire 6970 2GB || BeQuiet PurePower 450w | HD 4850
EK Supreme | AC aquagratix | Laing Pro | MoRa 2 || Aircooled
When will the first reviews come out?
INTEL 2600K @ 4.5ghz 24/7 Corsair H100
ASUS P8Z68-V PRO
2 x CORSAIR 4GB DDR3 1600 (CL8)
4TB Seagate SATA2
SAPPHIRE 7950 (GPU 1100 | MEM 1500)
Cosmos S
Asus XONAR DX
Corsair 850W PSU
How you can claim such things ? Are you chip architect of BD, or some experienced with low level code optimisation programmer to claim that the WCC is a joke or good enough patch for WT policy?
So, on 10h is also non-inclusive, or exclusive, and K10 with L3 cache is faster than K10 without cache. Gain from such large L3 may be bigger if cache is faster, but K10 core run very well withot L3. Non-inclusive L3 isn't reason for low performance, especially in FP intensive, or integer intensive, number crunching apps like wprime or CB. With BD module there is 2MB of 16-way L2 cache which is enough for achieving acceptable performance levels. I don't think so that the SB will run so great without L3 cache. I also think that cache hieararchy of AMD K10 and BD is some kind of tradeoff for having good enough performance of low end, and APU products, like Llano and Trinity.The L3 is a non-inclusive victim cache (L2 data are evicted to the L3)
L2 is mostly inclusive of L1D caches, so the data from L1D caches in most cases is in L2, because of WT and mostly inclusive cache policy. But to have complete picture of the puzzle, we need to know how they solved problems with snoop traffic and how much from for eg. core 0 must snoop core 1 L1D through the L3.with data transfered from L3 to the L1D of the expected core without being copied to the L2. That mean high snoop traffic in order to keep the coherency correct. And snoop traffic is something really unwanted from a bandwidth/performance pov. There is a pardox here : The L1 is in Write-through, but you're not sure a data not in L2 is not the L1D of another core.
The question for AMD is what is the average hit rate of L1 copy in L2.
"That which does not kill you only makes you stronger." ---Friedrich Nietzsche
PCAXE
I told that before. For WB cache in opposite from WT, writes are not immediately mirrored to the store.
From RWT Article about BD architecture:
Stores from both L1D caches go through the WCC, where they are buffered and coalesced. The purpose of the WCC is to reduce the number of writes to the L2 cache, by taking advantage of both spatial and temporal locality between stores. For example, a memcpy() routine might clear a cache line with four 128-bit stores, the WCC would coalesce these stores together and only write out once to the L2 cache.
"That which does not kill you only makes you stronger." ---Friedrich Nietzsche
PCAXE
CPU: - I7 4930K (EK Supremacy )
GPU: - 2x AMD HD7970 flashed GHZ bios ( EK Acetal Nickel Waterblock H2o)
Motherboard: Asus x79 Deluxe
RAM: G-skill Ares C9 2133mhz 16GB
Main Storage: Samsung 840EVO 500GB / 2x Crucial RealSSD C300 Raid0
Chew*, happy testing others chips, nice hand choice for 4th 8.4GHz+ chip! Only this 4th was the first!
![]()
ROG Power PCs - Intel and AMD
CPUs:i9-7900X, i9-9900K, i7-6950X, i7-5960X, i7-8086K, i7-8700K, 4x i7-7700K, i3-7350K, 2x i7-6700K, i5-6600K, R7-2700X, 4x R5 2600X, R5 2400G, R3 1200, R7-1800X, R7-1700X, 3x AMD FX-9590, 1x AMD FX-9370, 4x AMD FX-8350,1x AMD FX-8320,1x AMD FX-8300, 2x AMD FX-6300,2x AMD FX-4300, 3x AMD FX-8150, 2x AMD FX-8120 125 and 95W, AMD X2 555 BE, AMD x4 965 BE C2 and C3, AMD X4 970 BE, AMD x4 975 BE, AMD x4 980 BE, AMD X6 1090T BE, AMD X6 1100T BE, A10-7870K, Athlon 845, Athlon 860K,AMD A10-7850K, AMD A10-6800K, A8-6600K, 2x AMD A10-5800K, AMD A10-5600K, AMD A8-3850, AMD A8-3870K, 2x AMD A64 3000+, AMD 64+ X2 4600+ EE, Intel i7-980X, Intel i7-2600K, Intel i7-3770K,2x i7-4770K, Intel i7-3930KAMD Cinebench R10 challenge AMD Cinebench R15 thread Intel Cinebench R15 thread
If that's just four chips, I want to see the best chip of a batch. I'm pretty sure that hitting 8.6+ in the first two to three weeks after launch won't be hard and with time, breaking 9 should be possible.
ing awesome achievement guys.
Well, if they don`t do marketing, who planned today`s surprise for them? On Intel`s day as well ;-).Originally Posted by Manicdan
http://www.youtube.com/watch?v=5AB-G...layer_embedded
Even if an FX8150 needs to go 6.5Ghz to compete with 5.8Ghz 2600K in 3Ds...it still better considering how easy to find 58X 2600K
but...its speculation still...Maybe not!
matla:oh, I forgot at Intel day, today is FX suprised OC day
ROG Power PCs - Intel and AMD
CPUs:i9-7900X, i9-9900K, i7-6950X, i7-5960X, i7-8086K, i7-8700K, 4x i7-7700K, i3-7350K, 2x i7-6700K, i5-6600K, R7-2700X, 4x R5 2600X, R5 2400G, R3 1200, R7-1800X, R7-1700X, 3x AMD FX-9590, 1x AMD FX-9370, 4x AMD FX-8350,1x AMD FX-8320,1x AMD FX-8300, 2x AMD FX-6300,2x AMD FX-4300, 3x AMD FX-8150, 2x AMD FX-8120 125 and 95W, AMD X2 555 BE, AMD x4 965 BE C2 and C3, AMD X4 970 BE, AMD x4 975 BE, AMD x4 980 BE, AMD X6 1090T BE, AMD X6 1100T BE, A10-7870K, Athlon 845, Athlon 860K,AMD A10-7850K, AMD A10-6800K, A8-6600K, 2x AMD A10-5800K, AMD A10-5600K, AMD A8-3850, AMD A8-3870K, 2x AMD A64 3000+, AMD 64+ X2 4600+ EE, Intel i7-980X, Intel i7-2600K, Intel i7-3770K,2x i7-4770K, Intel i7-3930KAMD Cinebench R10 challenge AMD Cinebench R15 thread Intel Cinebench R15 thread
Ok guys,clock speed is great and all but don't expect too much from FX (versus i7 or even thuban). That way you won't be that much disappointed. ..
"That which does not kill you only makes you stronger." ---Friedrich Nietzsche
PCAXE
2500k @ 4900mhz - Asus Maxiums IV Gene Z - Swiftech Apogee LP
GTX 680 @ +170 (1267mhz) / +300 (3305mhz) - EK 680 FC EN/Acteal
Swiftech MCR320 Drive @ 1300rpms - 3x GT 1850s @ 1150rpms
XS Build Log for: My Latest Custom Case
Gigabyte Z77X-UD5H
G-Skill Ripjaws X 16Gb - 2133Mhz
Thermalright Ultra-120 eXtreme
i7 2600k @ 4.4Ghz
Sapphire 7970 OC 1.2Ghz
Mushkin Chronos Deluxe 128Gb
http://valid.canardpc.com/show_oc.php?id=1874555
i'm sorry for being OT, but does the above link indicate a processor oc'ing all its 6 cores to 7+ Ghz? I don't think so. I think it is mostly the fact that CPU-Z has all information and is showing all the information, i.e number of cores etc.
On topic, let us hope that BD has more to it than just clocks. When informal and all are losing hope, i wonder what remains for me...
mhm informal by what you write in your blog, even if ipc per bulldozer core will not or even slightly be higher then per thuban core, I think this processor gonna rock (dont care much if its slower the i7 on stock clocks given that OC numbers)
only question mark is at power consumption still.. though the shared design and clock gating should do its work..
hehe maybe we will thank xsecret one day, when we were all laying down depressed and then really the boom boom bulldozer breaks in with thuban ipc or more and 5ghz on air![]()
Last edited by Oese; 09-13-2011 at 12:33 PM.
1. ASUS Sabertooth 990fx | FX 8320 || 2. DFI DK 790FXB-M3H5 | X4 810
8GB Samsung 30nm DDR3-2000 9-10-10-28 || 4GB PSC DDR3-1333 6-7-6-21
Corsair TX750W | Sapphire 6970 2GB || BeQuiet PurePower 450w | HD 4850
EK Supreme | AC aquagratix | Laing Pro | MoRa 2 || Aircooled
________________
Main:
Phenom II x6 1090T BE|Crosshair IV Formula|Corsair 4x2GB DDR3|Sapphire HD5870|Adaptec 2405 + Hitachi Ultrastar 15k 450GB SAS, Toshiba MBD2147RC 146GB 10k SAS, Samsung F3 1TB, Seagate Barracuda Green 2TB 5900RPM, WD Black 2TB, Seagate Barracuda ST2000M001 2TB|Asus Xonar Essence ST + HD600|Corsair HX850|HPZR24w|Fractal Define XL Black|Windows 7 X64 Pro
Backup/Storage server:
HP Proliant ML350 G4|2 x Xeon "Nocona" 3GHz|4GB DDR1 ECC|Storage (SCSI): 3x10k 72GB + 10k 300GB + 15k 300GB + Ultrium460 tape drive|Storage (SATA): Adaptec 2810SA + 2 x WD Caviar 250GB RAID0 + Seagate 250GB|Windows Server 2008r2 Datacenter
Other:
HP Proliant DL380 G5|Xeon 5150|4GB FB DDR2 ECC|HP Smart Array P400-256MB cache|3x10k 146GB SAS in RAID 0 + 10k 146GB SAS|2x800W|ATi FireGL V7700|Samsung 226BW|Windows Server 2008r2 Enterprise
HP DL320 G5|Xeon 3150 2.13GHz|1GB DDR2 ECC|2x80GB RAID 0|Windows Server 2008r2 Standard
Laptop:
HP 8560w|i5-2540M|2x4GB DDR3|AMD FirePro M5950|Samsung 840 Pro 256GB|Windows 7 X64 Pro
If that is true, BD per core is faster than K10. BD-Interlagos Opterons is almost same clocked as Magny Cours. How much is 12-core Opteron faster than 8-core? For 50% more cores, 35% or something like that?There is no chance that Zambezi could be slower on average than Thuban.
I know!but i think informal is just playing it safe.![]()
"That which does not kill you only makes you stronger." ---Friedrich Nietzsche
PCAXE
yes, 3.6ghz/4.2ghz turbo 8core in 125W TDP seems not too bad... especially if these ipc numbers hold true.
BTW post no. 1000time for an avatar
![]()
1. ASUS Sabertooth 990fx | FX 8320 || 2. DFI DK 790FXB-M3H5 | X4 810
8GB Samsung 30nm DDR3-2000 9-10-10-28 || 4GB PSC DDR3-1333 6-7-6-21
Corsair TX750W | Sapphire 6970 2GB || BeQuiet PurePower 450w | HD 4850
EK Supreme | AC aquagratix | Laing Pro | MoRa 2 || Aircooled
If I remember correct the first 8 samples tested did 8Ghz or more with LN2, also some very high on water. Clocks is good, and architecture is...
Like another said, "some" guys without mension any in here may be just great pay'd off by Intel who don't know whay they're talking about or have too much spare time. In here BD is kind of underestimated from guys who does'nt know. Even with an eventuelly game from AMD (ohh can it be?), they buy it like a child. Wait for release... It will not perform bad, but compared to whay you've wait and see.
Ivy Bridge 3770K @ ????MHz
6c Intel Xeon X7460 24MB cache 16GB RAM 22TB HDD fileserver
Dual Intel Xeon E5620 workstation
SB 2600K @ 5016MHz 1.37v HT on AIR primestable
AMD Athlon X3 425 @ B25 4GHz+ AIR
AMD Athlon X2 6400+ @ 3811MHz AIR
AMD Athlon X2 3600+ @ 3200MHz AIR
AMD Athlon XP 1700+ @ 2714MHz AIR
Thermalright Ultra-120 Extreme
Corsair 8GB XMS3 2000MHz
ATI Radeon HD5850 @ 1000MHz+/1200MHz+
Windows 7 Enterprise x64
Corsair HX750W
Bookmarks