Results 1 to 25 of 66

Thread: Horrible ati yeilds (3%!!)

Hybrid View

  1. #1
    Xtreme Addict
    Join Date
    Jan 2005
    Location
    Grand Forks, ND (Yah sure, you betcha)
    Posts
    1,266
    Word, so did I. Here we were expecting 90nm to bring super high clocks from ATi, but 32pp and a high clock (the best of both worlds) may have been asking too much.

    In theory, if it was MADE to be 16pp, the room to clock to 700mhz would be there, but with a chip made to have 32pp, couldn't that hamper it's ability to have a high clock? This makes sense to me as when nvidia moved to 110nm you'd think the chip would clock a good deal better than the 130nm gf6, but it doesn't, just a little. It makes me think this is hampered by the addition of pipes. I'm not an engineer though, so don't take that as fact.

    Also, obviously ATi is having yield problems. Some chips probably do clock really well, but they have to set a standard somewhere so they can get an acceptable amount of chips out to sell. Maybe it's 3% that can do 700mhz with 24pipes? Maybe 700mhz @ 16pp was possible with 16 quads not working at an acceptable, but they figured 500 @ 24pp was a optimal design that also had an acceptable yield? That would make sense then they'd release it at 500mhz if a greater yield could operate at that clock with 24pipes, and still beat G70. Who knows what they did and why they did it, maybe those 3% will become the xtpesupermegadeluxeultra's to beat the G70 ultra. That would also give you a clue how high the R520's will clock (If 3% hit target speeds...say 600/700/800...we'll obviously hit a wall in our overclocking much below that under normal cooling...but who knows what that target was...

    It still looks to me though as the refresh will look more exciting.

    ATi R580 - 32pp, clock prolly similar to R520 but really anyone's guess, more TMUs? (500x32?)
    Nvidia G71 - 90nm G70...so 24pp, prolly a super high clock. (24x650?)
    ^^anyone's guess on clocks

    That'll be a fun one...
    Last edited by turtle; 07-24-2005 at 06:12 AM.
    That is all.

    Peace and love.

  2. #2
    Xtreme Addict
    Join Date
    Apr 2004
    Posts
    1,640
    Yeah, more pipes would definitely limit the clockspeed. Think of it this way, more pipes means more transistors. More transistors means more heat, and power consumption, plus the signal integrity suffers (communication via wires between transistors). I mean if you've ever overclocked before you know that heat can be a real obstacle when you're trying to get up there in clockspeed.

    What I find funny is that people who buy the RV530 are gonna have a chip with 2/3 of the pipelines that the R520 has, less transistors, while being built on the same process. Now see, last gen the X700 series had only half as many pipes as the X8x0, but they were built on a less mature process, 110nm. As the X800XL proved, this 110nm wasn't as efficient in clocking as the refined 130nm low-k process. This time around mainstream users are going to get all the clocking benefits of the highend card, without the added heat. I mean the R520 doesn't just have 16 or 24 pipes creating heat, it has all 32 contributing thermal waste, whether they're all enabled or not. People who buy the RV530 are going to be able to clock MUCH higher than the R520 can achieve, and get very similar (though not quite as much, as you observed turtle) performance, for like half the price.

    NVIDIA will probably make the G72 (7600 series) on a 90nm process. I'm eager to see if this line has 16 pipes or not. Most likely 12, I'm guessing.

    BTW, what are TMUs?
    DFI LANParty DK 790FX-B
    Phenom II X4 955 BE (1003GPMW) @ 3.8GHz (19x200) w/1.36v
    -cooling: Scythe Mugen 2 + AC MX-2
    XFX ATI Radeon HD 5870 1024MB
    8GB PC2-6400 G.Skill @ 800MHz (1:2) 5-5-5-15 w/1.8v
    Seagate 1TB 7200.11 Barracuda
    Corsair HX620W


    Support PC gaming. Don't pirate games.

  3. #3
    Xtreme Addict
    Join Date
    Jan 2005
    Location
    Grand Forks, ND (Yah sure, you betcha)
    Posts
    1,266
    Quote Originally Posted by Cybercat
    Yeah, more pipes would definitely limit the clockspeed. Think of it this way, more pipes means more transistors. More transistors means more heat, and power consumption, plus the signal integrity suffers (communication via wires between transistors). I mean if you've ever overclocked before you know that heat can be a real obstacle when you're trying to get up there in clockspeed.

    What I find funny is that people who buy the RV530 are gonna have a chip with 2/3 of the pipelines that the R520 has, less transistors, while being built on the same process. Now see, last gen the X700 series had only half as many pipes as the X8x0, but they were built on a less mature process, 110nm. As the X800XL proved, this 110nm wasn't as efficient in clocking as the refined 130nm low-k process. This time around mainstream users are going to get all the clocking benefits of the highend card, without the added heat. I mean the R520 doesn't just have 16 or 24 pipes creating heat, it has all 32 contributing thermal waste, whether they're all enabled or not. People who buy the RV530 are going to be able to clock MUCH higher than the R520 can achieve, and get very similar (though not quite as much, as you observed turtle) performance, for like half the price.

    NVIDIA will probably make the G72 (7600 series) on a 90nm process. I'm eager to see if this line has 16 pipes or not. Most likely 12, I'm guessing.

    BTW, what are TMUs?

    1. Your thing on transistors, heat, ect...Exactly what I was thinking. Makes perfect sense.

    2. 2/3 the pipes, 16 broken pipes, 16 working pipes, same amount of transistors (RV530 sounds like it'll be a R520 with 4 broken quads, 2 more broken quads than R520, which already has 2 because it was meant to be 32pp...that's what i've gathered anyway...IF THIS WHOLE THING IS TRUE) who knows how well they'll clock. True, they may clock much better because only 4 quads have to hit that number instead of 6...who knows? Yes, it being on 90nm, just like 520 helps it, but there still will be all the "waste", at least the way I understand it at the moment...there will actually be more, % wise. My guess is RV530 will be a great mainstream part and compete against 7800gt well, and you'll pay more than the last gen high-end mainstream card. It'll prolly be $100 less than R520 or something.

    3. No idea about 7600gt. Hope it kicks ass. Everyone that only had $150-$200 to spend on a gfx card last gen really got a good deal on the 6600gt, hope they keep up that tradition.

    4. Texture Mapping Units.
    That is all.

    Peace and love.

  4. #4
    Xtreme Addict
    Join Date
    Jan 2005
    Location
    Grand Forks, ND (Yah sure, you betcha)
    Posts
    1,266
    Alexio, you're right. In SOME instances one will be faster, and others it'll be reversed, won't it? I would think in some instances a greater fillrate would be to much an advantage, as some benchies/games R420 beats up on the NV40 rather well...like HL2 and Far Cry for example. I imagine those use the higher fill rate? Granted there are times it doesn't help it, and nv40 beats up on R420.

    Admittedly, I don't know a lot about greater mhz vs more pipes. I just know the fillrate would be higher, and it seemed logical as a possible reason why they'd do it. Couldn't ATi's pipe architecture be more efficient than nvidia's when doing this comparison, especially in this upcoming generation? That would be one reason to do with 24pp@500 rather than 16@700 right?

    Also 200x16 = 3200, 400x8 = 3200. They're equal. The later is more efficient, but would it be enough to make up the differance in fillrate assuming the pipe architectures are similar (11,200 as opposed to 12,000). Feel free to enlighten me...I'm intrigued.
    That is all.

    Peace and love.

  5. #5
    Xtreme Addict
    Join Date
    Jan 2005
    Location
    Grand Forks, ND (Yah sure, you betcha)
    Posts
    1,266
    Well, I assume RV530 is the pro model...and it's supposedly confirmed that it's 16pp...who knows if it's a different chip or just just a failed R520. Considering the yields though, it'd make sense they'd use failed R520's to make the lower-end chips, as otherwise (according to the yield rumours) they'd be wasting a WHOOOLE lot of money on those wafers.

    And remember, 7800GT's are prolly just gtx's with a failed quad and they're using them to use the failed yields of the gtx, and they will probably overclock pretty well...I mean, look at how close the core clockspeed is to the GTX...and even if the mem is rated at 1100, it probably uses the same stuff as GTX, so you can expect a similar clock there as well. I truely only think the 20p vs 24 pipes is going to make a 2-3% differance anyway...(based on a 5% differance of a GTX vs 6800GT/U at same clocks..16pp vs 24pp) so assuming the GT can hit similar clocks to gtx, it'll be a good deal just like the 6800gt was. Who's to say RV530 couldn't do the same, it'd especially be true under Alexio's reasoning. I do think that ATi's pipes are better than nvidia's though, and it might make a bigger differance than 5%. I imagine RV530 will clock overall just slightly better than a x850xt or something, with the same amount of pipes, but probably better pipes then that previous gen. I expect RV530 and 7800gt to fight tooth and nail, but who knows...There will obviously have to be a winner. We'll know when we see some benchies for it...We've already seen 7800gt benchies, and they're smack between 68U and 78GTX, I expect RV530 to be the same with x850xt and R520...so again, it'll be close, might come down to price and availability.

    But yes, things are shakey. We should know in the next month or so though...
    Last edited by turtle; 07-24-2005 at 07:07 AM.
    That is all.

    Peace and love.

  6. #6
    The Blue Dolphin
    Join Date
    Nov 2004
    Location
    The Netherlands
    Posts
    2,816
    Quote Originally Posted by turtle
    Alexio, you're right. In SOME instances one will be faster, and others it'll be reversed, won't it? I would think in some instances a greater fillrate would be to much an advantage, as some benchies/games R420 beats up on the NV40 rather well...like HL2 and Far Cry for example. I imagine those use the higher fill rate? Granted there are times it doesn't help it, and nv40 beats up on R420.

    Admittedly, I don't know a lot about greater mhz vs more pipes. I just know the fillrate would be higher, and it seemed logical as a possible reason why they'd do it. Couldn't ATi's pipe architecture be more efficient than nvidia's when doing this comparison, especially in this upcoming generation? That would be one reason to do with 24pp@500 rather than 16@700 right?

    Also 200x16 = 3200, 400x8 = 3200. They're equal. The later is more efficient, but would it be enough to make up the differance in fillrate assuming the pipe architectures are similar (11,200 as opposed to 12,000). Feel free to enlighten me...I'm intrigued.
    It's all about optimizing really. You can optimize a game to be the fastest at 1 pipeline but also for 24 pipelines. Todays games are made to also run on slower cards with less pipelines. This is the part where the unified shader model becomes interesting for high-end cards.

    On a normal fixed 16 pipes 6 vertexes card there are one or more pipes and/or vertexes doing nothing, waithing for a new task. With the unified shadermodel every task can be handled by all the pipes and vertexes; that way there will never be a scenario where for example the vertexshaders are working there asses of while the pixel pipelines are hardly doing anything, or the opposite ofcourse.

    A combination of different game executables for different types of hardware (graphicscards, cpu's, amount of RAM) and the unified shadermodel will make games run much better. For example in a game all physics are being handled by the cpu, this is fixed. If you can make an executable that gives this task to the graphicscard when the cpu has to much tasks to handle while the gpu has to little than you can achieve much better performance in the game. The first step will ofcourse be making games multithreaded for use with multi core cpu's. In the beginning this might be fixed, than for example the A.I in the game is being handled by the second core (only). Further on in the development of games this process can be made dynamic, just like the unified shadermodel.
    Blue Dolphin Reviews & Guides

    Blue Reviews:
    Gigabyte G-Power PRO CPU cooler
    Vantec Nexstar 3.5" external HDD enclosure
    Gigabyte Poseidon 310 case


    Blue Guides:
    Fixing a GFX BIOS checksum yourself


    98% of the internet population has a Myspace. If you're part of the 2% that isn't an emo bastard, copy and paste this into your sig.

  7. #7
    Xtreme Addict
    Join Date
    Apr 2004
    Posts
    1,640
    Alexio got me interested in doing some runs in 3DMark05 to see how the NV40 architecture scales with clockspeed. He said he scored higher at 400x8 than at 200x16. Well I've got a 6600 so I don't have the luxury of of 16 pipes but I can go lower with higher clockspeeds.

    So at 200MHz with all 8 pipes enabled, this is what I got:


    At 560MHz (my max core OC) with 4 pipes, this is what I got:


    I was pretty surprised with the results. Having higher clockspeeds versus more pipes gives about a 19% overall improvement. Here's how they do in each game.

    Return to Proxycon: 7%
    Firefly Forest: 28%
    Canyon Flight: 23%

    It could have something to do with the architecture, or it could have something to do with software optimizations like Alexio said. Can't know for sure. But this is very interesting, and it could mean that lower end cards with 16 pipes at higher clockspeeds could perform the same or better than a 24-pipe card at lower clockspeeds (not taking into account other differences and dis/advantages).
    DFI LANParty DK 790FX-B
    Phenom II X4 955 BE (1003GPMW) @ 3.8GHz (19x200) w/1.36v
    -cooling: Scythe Mugen 2 + AC MX-2
    XFX ATI Radeon HD 5870 1024MB
    8GB PC2-6400 G.Skill @ 800MHz (1:2) 5-5-5-15 w/1.8v
    Seagate 1TB 7200.11 Barracuda
    Corsair HX620W


    Support PC gaming. Don't pirate games.

  8. #8
    Xtreme 3DTeam Member
    Join Date
    Jan 2005
    Location
    AMD factory
    Posts
    2,288
    Grrrrrrrr........., me wants a X900XT PE with 32 pipes 0.09...

    Previous system:


    DFI NF4 ULTRA 0453A3 KOREA CHIPSET / BIOS 510-2FIX / FX-57 0516WPMW@3.62GHZ / 2x256 CORSAIR 3200LLPT BH-5@13x278MHZ 2-2-2-5@3.69VDIMM / MACH II GT@MOD BY BERKUT / ACTIVE COOLING FOR RAM - MOSFETS - GPU RAM / CHIPSET & GPU CORE WATERCOOLED / OCZ POWERSTEAM 600W / BUILT BY ATI X850XT@660/651 - VGPU@1.73-VDD@2.26-VDDQ@2.21 PENCIL MOD / WIN XP 2x80GB SAMSUNG SPINPOINT SP80 SATA - RAID 0 & WIN 2K 40GB SAMSUNG SPINPOINT SP40 IDE BENCH DRIVE / PIC


    ----------------><------------------

  9. #9
    The Blue Dolphin
    Join Date
    Nov 2004
    Location
    The Netherlands
    Posts
    2,816
    Can you run at 200mhz 8 pipes and at 400mhz 4 pipes for a better comparation? The results a got when I tested 200mhz 16 pipes vs 400mhz 8 pipes was with a card having some problems so I'm not 100% sure that I got the right scores. I don't have 3dmark on my hdd right now so can you please test it for me?
    Blue Dolphin Reviews & Guides

    Blue Reviews:
    Gigabyte G-Power PRO CPU cooler
    Vantec Nexstar 3.5" external HDD enclosure
    Gigabyte Poseidon 310 case


    Blue Guides:
    Fixing a GFX BIOS checksum yourself


    98% of the internet population has a Myspace. If you're part of the 2% that isn't an emo bastard, copy and paste this into your sig.

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •