MMM
Page 3 of 7 FirstFirst 123456 ... LastLast
Results 51 to 75 of 166

Thread: ATI Aiming For R300 Deja-Vu: Officially Launches 4800 Campaign

  1. #51
    Xtreme Mentor
    Join Date
    Apr 2007
    Location
    Idaho
    Posts
    3,200
    Doesn't look too bad so far. Have to keep in mind it's early silicon.
    ATi is making good gains in the value segment, which helps a lot.
    "To exist in this vast universe for a speck of time is the great gift of life. Our tiny sliver of time is our gift of life. It is our only life. The universe will go on, indifferent to our brief existence, but while we are here we touch not just part of that vastness, but also the lives around us. Life is the gift each of us has been given. Each life is our own and no one else's. It is precious beyond all counting. It is the greatest value we have. Cherish it for what it truly is."

  2. #52
    Engineering The Xtreme
    Join Date
    Feb 2007
    Location
    MA, USA
    Posts
    7,217
    Quote Originally Posted by 003 View Post
    Yeah, with a Q9450 at 3.6GHz and my 2900Pro at 874/1000, I score 14.5K in 3DMark06. The 2900Pro has 320SPs and 16TMUs.

    The 4850 is paired with a QX6850 at 4.0GHz, and the 4850 has clock speeds of 670/1100, and it has 800 SPs and 32 TMUs. It's score is not impressive.

    but 2900s have always had drastically inflated 06 scores compared with their game performance, ever occurred to you that the 4850 is a different architecture...?

  3. #53
    Xtreme Cruncher
    Join Date
    Aug 2006
    Location
    Denmark
    Posts
    7,747
    Quote Originally Posted by Extelleron View Post
    HD 4870 does not have 160SPs. It has 800, just like R600 has 320. They are just different than nVidia's; for one they are clocked much lower.

    If HD 3870 has 64SPs, then why do those 64SPs @ 775MHz outperform the 9600GT's 64SPs @ 1.625GHz by a huge margin?
    Wrong. You just ate the marketing raw. Does the GTX280 have 480?

    800 sounds better than 160, specially if your competitor got 240.

    Your 3870 outperform the 9600GT because they are 1:2 SPs and the 3870 are 1:5. So even the doubled clock on 9600GT leaves it around 1:4.

    Why you think a 112-128 SP easily beat a 320 SP 3870 card? Even with doubled SP clock it would still be 256 vs 320 at peak.
    Crunching for Comrades and the Common good of the People.

  4. #54
    Xtreme Cruncher
    Join Date
    Aug 2006
    Location
    Denmark
    Posts
    7,747
    Quote Originally Posted by SNiiPE_DoGG View Post
    but 2900s have always had drastically inflated 06 scores compared with their game performance, ever occurred to you that the 4850 is a different architecture...?
    What makes 4800 series different? Its all absed on R600 base designs. And 4800 series is nothing more than a glorified 3800 series that was nothing more than a slightly changed R600.
    Crunching for Comrades and the Common good of the People.

  5. #55
    Xtreme Member
    Join Date
    May 2007
    Posts
    341
    Quote Originally Posted by SNiiPE_DoGG View Post
    but 2900s have always had drastically inflated 06 scores compared with their game performance, ever occurred to you that the 4850 is a different architecture...?
    It is, but not that different. And 4 GHz Yorkfield, really screws up the 3DMark06 score. Compare a Q6600 at stock with a 2900XT, and then with it clocked at 4 GHz. The difference is obvious to anyone.

    R6xx and high CPU OC rapes my system in 3DMark06, but in games, at 2560 * 1600 and with just 4 * AA / 16 * AF, that is not so... 3DMark06 sucks for compare to anything but same GPU architecture, and CPU ... Not any good comparisson with games and high quality settings and resolutions.
    Last edited by Seraphiel; 06-12-2008 at 04:14 PM.

  6. #56
    Xtreme Guru adamsleath's Avatar
    Join Date
    Nov 2006
    Location
    Brisbane, Australia
    Posts
    3,803
    IMO nobody should be benching cards on '06 anymore; it is 99% CPU dependent
    BS!

    anyway, gaming performance needed. let's see AA/AF in action with 4800 series please
    Last edited by adamsleath; 06-12-2008 at 04:24 PM.
    i7 3610QM 1.2-3.2GHz

  7. #57
    Xtreme Member
    Join Date
    Apr 2006
    Posts
    290
    Quote Originally Posted by Shintai View Post
    Wrong. You just ate the marketing raw. Does the GTX280 have 480?

    800 sounds better than 160, specially if your competitor got 240.

    Your 3870 outperform the 9600GT because they are 1:2 SPs and the 3870 are 1:5. So even the doubled clock on 9600GT leaves it around 1:4.

    Why you think a 112-128 SP easily beat a 320 SP 3870 card? Even with doubled SP clock it would still be 256 vs 320 at peak.
    Why does a 112SP easily beat a 320SP 3870 card? Well, it doesn't. http://www.firingsquad.com/hardware/...ance/page8.asp

    The HD 3870 has more shading power than a 8800GT, as seen by the Perlin Noise test.

    If we take the theoretical....

    8800GT = 112SP * 1.5 = 168
    HD 3870 = 320SP * .775 = 248

    HD 3870 should be 47% faster, in real world it is 16% faster. So yes, R600's 320 stream processors are a bit less efficient than those of G92. But by no means is it fair to compare R600 as having 64 SPs and G92 as 128. It certainly is a means of comparison, but it does not at all reflect real world performance. It is much more accurate to describe the HD 3870 as 320 SP @ 775MHz than describe it as 64SP @ 775MHz, in comparison to the 8800GT's 112SP @ 1.5GHz.

  8. #58
    Xtreme Member
    Join Date
    May 2007
    Posts
    341
    Quote Originally Posted by Extelleron View Post
    Why does a 112SP easily beat a 320SP 3870 card? Well, it doesn't. http://www.firingsquad.com/hardware/...ance/page8.asp

    The HD 3870 has more shading power than a 8800GT, as seen by the Perlin Noise test.

    If we take the theoretical....

    8800GT = 112SP * 1.5 = 168
    HD 3870 = 320SP * .775 = 248

    HD 3870 should be 47% faster, in real world it is 16% faster. So yes, R600's 320 stream processors are a bit less efficient than those of G92. But by no means is it fair to compare R600 as having 64 SPs and G92 as 128. It certainly is a means of comparison, but it does not at all reflect real world performance. It is much more accurate to describe the HD 3870 as 320 SP @ 775MHz than describe it as 64SP @ 775MHz, in comparison to the 8800GT's 112SP @ 1.5GHz.
    Nice FUD ... haven't you understood the numbers?

    ATI's way of counting SP is in terms of execution units = 320, regardless of structure, which really is 64 SP * 5 EU = 320 EU total

    NVIDIA was more correct and only counted SP and not EU = 128, but if counted like ATI did it, then the number is 128 SP * 2 EU = 256 EU... If the "missing MULL" is counted with nVIDIA, then it is 384 EU, but that one is shadowed area, and shouldn't be counted, as I have yet to see it to be effective.

    Do the math and clocks with these numbers... but also remember shaders aren't everything, and only one part of effective performance, relative to the program executed.

    By your numbers, R600 destroys G80... but that isn't the reality, now is it? Especially, when both solutions are pressed: 4 MP resolutions, and with maxed AA and AF... or even just 4 * MSAA + 16 * AF.
    Last edited by Seraphiel; 06-12-2008 at 04:31 PM.

  9. #59
    I am Xtreme zanzabar's Avatar
    Join Date
    Jul 2007
    Location
    SF bay area, CA
    Posts
    15,871
    Quote Originally Posted by Shintai View Post
    Wrong. You just ate the marketing raw. Does the GTX280 have 480?

    800 sounds better than 160, specially if your competitor got 240.

    Your 3870 outperform the 9600GT because they are 1:2 SPs and the 3870 are 1:5. So even the doubled clock on 9600GT leaves it around 1:4.

    Why you think a 112-128 SP easily beat a 320 SP 3870 card? Even with doubled SP clock it would still be 256 vs 320 at peak.

    then how can u boot up something that uses streams and get 320 simultaneous operations, it counts them like a cell, NV counts per unit snce it cant always execute 2 operation at a time (when acting like a pixle shader or using cuda)

    NV can score better since it has faster use of emulated vertex shaders it has a much faster texture fill rate than its pixle fill rate so when u have something optimized for 128 shaders then also using texture overlays instead of rendering from scratch then the NV card will score much better, but if u have full 3d rendering and texture creation (3d mark 06, maya, cad, solid works, COJ, mass effect somewhat) then ati will score much better than the NV card. thats why vantage scores better on NV it uses vary little texture rendering on gpu but handles 3d interactions on cpu/ppu
    Last edited by zanzabar; 06-12-2008 at 04:31 PM.
    5930k, R5E, samsung 8GBx4 d-die, vega 56, wd gold 8TB, wd 4TB red, 2TB raid1 wd blue 5400
    samsung 840 evo 500GB, HP EX 1TB NVME , CM690II, swiftech h220, corsair 750hxi

  10. #60
    Xtreme Addict
    Join Date
    Dec 2007
    Posts
    1,326
    I guess this pretty much confirms the 800 shader units.

    Sweet.

  11. #61
    Xtreme Member
    Join Date
    May 2007
    Posts
    341
    Quote Originally Posted by ToTTenTranz View Post
    I guess this pretty much confirms the 800 shader units.

    Sweet.
    Oh really? How come it does that, what makes you believe that it has 800 EU?

    My own 8800 Ultra can match that score, with a 4 Ghz Yorkfield... It only has 128 SP ... or counted like ATI does: 256 SP.

  12. #62
    Xtreme Member
    Join Date
    Mar 2005
    Posts
    232
    Quote Originally Posted by ToTTenTranz View Post
    I guess this pretty much confirms the 800 shader units.

    Sweet.
    How the hell did you come to that conclusion? It performs about the same as the 128 shader cores on the 9800GTX, so it must have 6.25x more shaders? o.0

    t

  13. #63
    Registered User
    Join Date
    Apr 2008
    Posts
    23
    Quote Originally Posted by Seraphiel View Post
    Does the 4870 really have that many? I thought, they had a 1:5 ratio, in term of their FUD about SP count? R600 had 320 SP / 5 = 64 SP vs G80 GTX/Ultra that had 256 SP / 2 = 128 SP

    I thought, the 4850 / 4870 only really had 96, increased from 64, but still with more execution units per SP than nVIDIA ... 1:5 vs 1:2.

    In terms of execution units ... GTX 280 and 4870 is the same: 480(1:2) vs 480(1:5). Meaning 240 SP for nvidia and 96 for ATI, and there is the clock difference.

    I thought, ATI's 320 SP for R600 was FUD and not the same as nVIDIA's 128 SP, which has two execution units (and counted as ATI does, that would be 256 SP).

    Please someone explain.
    Hmm. Where do we start...I'll use the R600 as baseline since you seem to be familiar with that. The R600 had 64 Shader Processors, with each Shader processor having 5 scalar ALUs (the Stream Processors). One of the 5 ALUs is capable of doing transcedentals(sin, cos etc.). 64*5=320(the magic number).

    Based on utilization achieved by the compiler(it's generally around 80% on average nowadays, it was somewhat worse at launch), you can at worse have 64 ALUs doing work(worst case scenario, exclusively dependent serial code, doesn't happen in practice), or at best have full utilization and manage to schedule 5 ops per Shader processor and thus get 320 ALUs doing work. Each ALU is MADD capable so you get 2 FLOPS (floating point ops) per ALU.

    nV's 128 ALUs for the G80 are also MADD capable, and they were also supposed to be able to coissue a MUL alongside that scalar MADD, but that never quite materialized (the GT200 is supposedly capable of doing this). For all intents and purposes they're also pegged at 2 FLOPS per ALU. What's peculiar for nV's ALUs is that they work at a far higher frequency, so you can either claim you have 128 ALUs running at that frequency, or 256 ALUs running at frequency/2. One of the advantages of nV's independent scalar ALU arrangement is that scheduling is easier, and they're less sensitive to the type of code they're running.

  14. #64
    Xtreme Mentor
    Join Date
    Oct 2005
    Posts
    2,788
    Quote Originally Posted by SNiiPE_DoGG View Post
    but 2900s have always had drastically inflated 06 scores compared with their game performance, ever occurred to you that the 4850 is a different architecture...?
    No, the 4850 is a very much enhanced HD3000, and the HD3000 was just a die shrink of the R600. If anything, it should have an even more inflated scored than the 2900, the underlying architecture is the same. It just has more, more, more.
    Asus Rampage II Gene | Core i7 920 | 6*2GB Mushkin 998729 | BFG GTX280 OCX | Auzentech X-Fi Forte | Corsair VX550
    —Life is too short to be bound by the moral, ethical and legal constraints imposed on us by modern day society.

  15. #65
    Xtreme Guru adamsleath's Avatar
    Join Date
    Nov 2006
    Location
    Brisbane, Australia
    Posts
    3,803
    http://www.theinquirer.net/gb/inquir...ormance-posted
    According to the site’s post, the DAAMIT benchmarketing shows its single-GPU newborns thoroughly thrashing Graphzilla’s 9800GTX (4870) and 8800GT (4850). In the first instance, the 4870 outguns the 9800GTX in all the games with anything from a 36 per cent to 48 per cent lead, in the second instance (4850 vs. 8800GT) DAAMIT has anything from 26 per cent to a 48 per cent lead. Even if these numbers are only benchmarketing, it does look like ATI took special care on improving AA, AF and DX10 performance – which also means it’s been paying attention to its customers.
    oh yeah, could be fake...
    i7 3610QM 1.2-3.2GHz

  16. #66
    Xtreme Member
    Join Date
    Mar 2005
    Posts
    232
    Quote Originally Posted by adamsleath View Post
    Bull was already called on those benchies, not enough variation.

    t

  17. #67
    Xtreme Member
    Join Date
    Apr 2006
    Posts
    290
    Quote Originally Posted by Seraphiel View Post
    Nice FUD ... haven't you understood the numbers?

    ATI's way of counting SP is in terms of execution units = 320, regardless of structure, which really is 64 SP * 5 EU = 320 EU total

    NVIDIA was more correct and only counted SP and not EU = 128, but if counted like ATI did it, then the number is 128 SP * 2 EU = 256 EU... If the "missing MULL" is counted with nVIDIA, then it is 384 EU, but that one is shadowed area, and shouldn't be counted, as I have yet to see it to be effective.

    Do the math and clocks with these numbers... but also remember shaders aren't everything, and only one part of effective performance, relative to the program executed.

    By your numbers, R600 destroys G80... but that isn't the reality, now is it? Especially, when both solutions are pressed: 4 MP resolutions, and with maxed AA and AF... or even just 4 * MSAA + 16 * AF.
    AMD has 5-way shaders, yes. They chose to count their stream processors a different way than nVidia, because in comparing G92 vs. RV670, we see that it reflects performance quite nicely.

    By my numbers, R600 loses to G80 (8800GTX), but wins against the castrated parts like 8800GT / G80 GTS. In games, is this the case? No, definitely not in the case of the 8800GT. Games take into account much more than pure ALU performance. G92/G80 has more texture power and in the case of G80, more pixel power. Also, R600's SPs are not as efficient in real world usage as they are in benchmarks like Perlin Noise - the R600 design (1 "fat" SP + 4 weaker ones) proves less efficient in games than nVidia's G80 design. In theoretical ALU benchmarks like Perlin Noise, this inefficiency is not present and R600 presents itself well.

    Doing the numbers your way makes no more sense than doing the numbers AMD's way. If I do what you are saying, then I compare HD 3870 with 64 SPs @ 775MHz to 9600GT with 64 SPs @ 1625MHz.... yet the 3870 wins by nearly a factor of 2x in that battle. So is that a fair comparison? Heck no.

    G80/G92 and R600/RV670 have VERY different shader designs, so they deserve very different ways of counting them. AMD's way is certainly preferrable for marketing, but when you factor in that AMD's parts have less than 1/2 the clock of nVidia's, they make sense from a technical standpoint. Comparing 320 RV670 SPs @ 775MHz to 64 G94 SPs @ 1625MHz is just as valid of a comparison as it would have been to compare 16 R520 pipes @ 625MHz to 24 G70 pipes @ 430MHz..... in other words, not a good comparison because of the different architecture, but a fair one nevertheless.

  18. #68
    Xtreme Addict
    Join Date
    May 2007
    Posts
    2,125
    When will people learn that shaders across architectures aren't meant to be literally compared and that not all shaders are made equal!

    Great explanations Extelleron and Morgoth

  19. #69
    Xtreme Member
    Join Date
    May 2007
    Posts
    341
    Quote Originally Posted by Morgoth Bauglir View Post
    *snip*
    Thank you, that is the kind of insight, I have seen from the Beyond3D forum

    Quote Originally Posted by Extelleron View Post
    *snip*
    Sorry, I didn't notice your comparison (G92 vs RV670).

    I only assumed G80 A2 (GTX) / G80 A3 (Ultra) vs ATI's offers at that time. I'll be the first to admit that G92 has its flaws vs G80 A2 / A3. I can still find benchmarks, especially with Crysis, where my two Ultras destroys 2 * G92 (GX2) ... 2 * G80 A3 vs 4 * G92.

    My apologizes
    Last edited by Seraphiel; 06-12-2008 at 04:56 PM.

  20. #70
    Xtreme Addict
    Join Date
    Dec 2007
    Posts
    1,326
    Quote Originally Posted by Seraphiel View Post
    Oh really? How come it does that, what makes you believe that it has 800 EU?

    My own 8800 Ultra can match that score, with a 4 Ghz Yorkfield... It only has 128 SP ... or counted like ATI does: 256 SP.
    You can't compare the G80's shaders with R6xx ones. RV670/R600 only has 64 shader processors "equivalent" to the G80/G92 ones, and nVidia has them at twice the clock speed. It's everything so different that you can't compare them that way. Besides, the benchmarks' score won't tell you the exact ammount of shaders.

    I don't remember where I saw that, but I remember seeing some calculations proving that if ATI said it could do 1 TeraFlop, the RV770 had to have 800SPs for the numbers to match. Maybe it was in Beyond3D forums.. If I find it again I'll post in here.

  21. #71
    Xtreme Addict
    Join Date
    May 2007
    Posts
    2,125
    We know that the 4850 comes stock at 625 MHz

    ATI Flops: Frequency * SP count * 2 (for MADD)

    so 625 * 480 * 2 = 600GFlops

    625 * 800 * 2 = 1000 GFlops = exactly 1 Tflop

    The only way 480SPs reaches 1TFlop is if it has a ~1000MHz shader clock but we do not have any evidence it still has a shader domain

  22. #72
    Xtreme Member
    Join Date
    May 2007
    Posts
    341
    Quote Originally Posted by ToTTenTranz View Post
    You can't compare the G80's shaders with R6xx ones. RV670/R600 only has 64 shader processors "equivalent" to the G80/G92 ones, and nVidia has them at twice the clock speed. It's everything so different that you can't compare them that way. Besides, the benchmarks' score won't tell you the exact ammount of shaders.

    I don't remember where I saw that, but I remember seeing some calculations proving that if ATI said it could do 1 TeraFlop, the RV770 had to have 800SPs for the numbers to match. Maybe it was in Beyond3D forums.. If I find it again I'll post in here.
    Fair enough, but aren't you sure it wasn't the R700 / 2 * RV770? Even if ATI could provide that theoretical max in terms of flops, the reality of it would hard would be same. No architecture could ever get 100% of theoretical max, and ATI's architecture is very limited in this regard.

    But then again, ATI still claims 320 / 480 SP vs NVIDIA's 128 / 240 SP from papers, and that alone should tell anyone, that ATI is full of it...

    As soon as they called their part 320 SP vs nVIDIAs G80 GTX / Ultra's 128 SP... then anyone should know the depths this company has been lowered to, in terms of PR / FUD.

    In terms of single card / single GPU, I would love ATI to be competitive in terms of raw performance and effective as well, but that is not going to happe, and thus this can never be a "R300 Deja-Vu" or even something like it.

    ATI may well have the performance vs cost, in some markets, but in terms of raw effective performance, I can't see how nVIDIA can't win with the GTX 280.

    If anyone wonders, then yes, only the best in terms of performance per GPU chip matters to me, no matter the cost. The GTX 280 bought from the US with tax, is about %60 the cost of each of my current Ultras...

    That is my personal view and wants, but I respect and accept others see it differently

  23. #73
    Xtreme Addict
    Join Date
    May 2007
    Posts
    2,125
    they call them stream processors anyways, so its not like they're using the same term

    And 800 SP's is the only way 1TFlop works for the 4850 at those clocks

    And while I want the best performing GPU because i run at 2560x1600, only Crysis is limited at all at this moment

  24. #74
    I am Xtreme zanzabar's Avatar
    Join Date
    Jul 2007
    Location
    SF bay area, CA
    Posts
    15,871
    when u talk about the shaders though NV isnt always doing 2 operation, and its more like hyper threading, while ati is always doing the 5 per physical unit, i always thought that ati has something equivalent to a multi core shader like a cell, and when i asked their rep at "heroes happen" he also said like a cell (thats were i got that from) so i thought that it didnt matter how they got there so long as they all were working at the same time

    u wouldent say that a p4 HT had 2 cores but u would say that a 6 stream cell has 6 cores

    and NV says SP = shader processor, ati and MS says SP = stream processor
    5930k, R5E, samsung 8GBx4 d-die, vega 56, wd gold 8TB, wd 4TB red, 2TB raid1 wd blue 5400
    samsung 840 evo 500GB, HP EX 1TB NVME , CM690II, swiftech h220, corsair 750hxi

  25. #75
    Xtreme Member
    Join Date
    May 2007
    Posts
    341
    Quote Originally Posted by zerazax View Post
    And while I want the best performing GPU because i run at 2560x1600, only Crysis is limited at all at this moment
    How does Supreme Commander and Call of Juarez work for you at that resolution, with HQ settings (in game maxed + 4 * MSAA / 16 * AF)...

    Not that I really disagree with your point. Aside from those 3 games, I have wonder why I would want to upgrade my two XFX 8800 Ultra XXX 685M.

    Still, I want more... and the new offerings from NVIDIA and ATI + Intel's Nehalem, is the closest alternative for more than a year to what I have now.

Page 3 of 7 FirstFirst 123456 ... LastLast

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •