MMM
Results 1 to 17 of 17

Thread: is there a difference in the l2 cache latency between allendales and conroes?

  1. #1
    Xtreme Addict
    Join Date
    Oct 2004
    Posts
    1,838

    is there a difference in the l2 cache latency between allendales and conroes?

    the performance is just too close considering one has double the cache!
    the only explanation i can fathom is lower latency L2 cache on the allendales
    or i could be totally wrong....does anyone know?
    DFI P965-S/core 2 quad q6600@3.2ghz/4gb gskill ddr2 @ 800mhz cas 4/xfx gtx 260/ silverstone op650/thermaltake xaser 3 case/razer lachesis

  2. #2
    Xtreme Addict
    Join Date
    Jun 2005
    Location
    TAIWAN
    Posts
    1,257
    i test a abit with pi 1m, super pi 1m reflect cache , cache doesn't affect 32m



    3.6G 2M VS 3.2G 4M

    i think per clock/performance , L2 4m 20% faster than 2m

  3. #3
    Xtreme Addict
    Join Date
    Jun 2005
    Location
    TAIWAN
    Posts
    1,257
    super pi 32m and cpumark99 doesn't reflect cache issue

    E6400@3.2G (8*400) 32M & cpumark99


    X6800@3.2G(8*400) 32M & cpumark99

  4. #4
    I am Xtreme
    Join Date
    Nov 2002
    Location
    South FL, USA
    Posts
    4,892
    i read a review which showed an underclock 6800 at allendale speeds and then running benches between that and an actual Allendale cpu..there were only, the most, a 5-6% difference in bench performance and some were only 1-2%

  5. #5
    Xtreme Addict
    Join Date
    Jun 2005
    Location
    TAIWAN
    Posts
    1,257
    Quote Originally Posted by Ace-a-Rue
    i read a review which showed an underclock 6800 at allendale speeds and then running benches between that and an actual Allendale cpu..there were only, the most, a 5-6% difference in bench performance and some were only 1-2%
    really, can u show the link?

  6. #6

  7. #7
    Xtreme Addict
    Join Date
    Oct 2004
    Posts
    1,838
    yea, the conclusion they got was that there was barely any difference
    thats what started me thinking
    cause the situation now seems to parrallel when intel released the 2mb prescott and performed almost the same as the 1mb prescott
    DFI P965-S/core 2 quad q6600@3.2ghz/4gb gskill ddr2 @ 800mhz cas 4/xfx gtx 260/ silverstone op650/thermaltake xaser 3 case/razer lachesis

  8. #8
    I am Xtreme
    Join Date
    Nov 2002
    Location
    South FL, USA
    Posts
    4,892
    Quote Originally Posted by hicookie
    really, can u show the link?
    unfortunately, i did not bookmark the review....sorry!

    but, it is clear to me that L2 cache is important but the jump from 2 Mb to 4 Mb is probably not as significant as it looks on paper or feels in our mind.

  9. #9
    Registered User
    Join Date
    Jul 2006
    Posts
    13
    Have any websites looked at how the cache plays a role as the frequency increases? I'd like to see an X6800 with a 7x multiplier in comparison to a 6300 from stock to 2.8Ghz.

    Also, does anyone know the associativity of the cache on the 2mb vs. 4mb? Is it only 4-way on 2mb or is it 8-way for both?

  10. #10
    Xtreme Addict
    Join Date
    Oct 2004
    Posts
    1,838
    look at the link plaicd posted lol
    yea, the difference isnt as big as i think it should be, which leads me to believe the l2 cache on the allendale has a slightly lower latency than that of the conroe...it would be interesting to find out if this was true
    DFI P965-S/core 2 quad q6600@3.2ghz/4gb gskill ddr2 @ 800mhz cas 4/xfx gtx 260/ silverstone op650/thermaltake xaser 3 case/razer lachesis

  11. #11
    Registered User
    Join Date
    Jul 2006
    Posts
    13
    Quote Originally Posted by grimREEFER
    look at the link plaicd posted lol
    yea, the difference isnt as big as i think it should be, which leads me to believe the l2 cache on the allendale has a slightly lower latency than that of the conroe...it would be interesting to find out if this was true
    I looked at the techreport article earlier, but all they have is the overclocking page which does not give you any idea of how the cache size matters at higher clocks. What i'm saying is take an X6800, set the multiplier at 7. Then increase the fsb in increments along with an E6300 all the way upto a high clock such as 2.8GHz. Then compare how the cache influences the performance in various tasks. I believe multithreaded performance for encoding would be most interesting. So a line graph with performance deltas between the two processors. I'll look into other reasons for possible latency improvements on the 2mb. I believe that these are the same physical cores as the conroes with part of the cache disabled. The E4000 series will have a smaller reduced core. Even then I dont know if thats true because economically its cheaper to produce unused silicon and disable it than to make a separate mask and production process.

  12. #12
    Xtreme Member
    Join Date
    May 2005
    Posts
    392
    Quote Originally Posted by aashburns
    Have any websites looked at how the cache plays a role as the frequency increases? I'd like to see an X6800 with a 7x multiplier in comparison to a 6300 from stock to 2.8Ghz.

    Also, does anyone know the associativity of the cache on the 2mb vs. 4mb? Is it only 4-way on 2mb or is it 8-way for both?
    allendale L1 and L2 is 8-way.
    Intel Pentium G3258 | ASRock Z97M OC Formula | Samsung 2x4GB DDR3-1600 | Galaxy GTX660 GC | Samsung PM840 Pro 128GB, PM830 250GB FDE |

    "I Love To Camp!"
    "Comensing Shock Therapy!"

    HeatWare

  13. #13
    Xtreme Enthusiast
    Join Date
    Jul 2006
    Posts
    650
    I did a thread on this. Apparently for 3D graphics Intel did not short change us L2 cache on the allendales (unlike my celeron where its L2 deficient)

    As far as other applications go Anandtech reports anywhere from 3-10% improvement.

    However I really want to know how 4 mb L2 cache scales in the high 3.5+ GHz clock speeds? Does it make more difference or less under those conditions?

  14. #14
    Xtreme Member
    Join Date
    May 2005
    Posts
    392
    Quote Originally Posted by Phosphate
    I did a thread on this. Apparently for 3D graphics Intel did not short change us L2 cache on the allendales (unlike my celeron where its L2 deficient)

    As far as other applications go Anandtech reports anywhere from 3-10% improvement.

    However I really want to know how 4 mb L2 cache scales in the high 3.5+ GHz clock speeds? Does it make more difference or less under those conditions?
    if the clocks speeds are equal on both 2mb and 4mb the difference is still going to be the same as quoted above. if your running something like superpi for caculations all day long get the 4mb by all means.
    Intel Pentium G3258 | ASRock Z97M OC Formula | Samsung 2x4GB DDR3-1600 | Galaxy GTX660 GC | Samsung PM840 Pro 128GB, PM830 250GB FDE |

    "I Love To Camp!"
    "Comensing Shock Therapy!"

    HeatWare

  15. #15
    Registered User
    Join Date
    Jul 2006
    Posts
    13
    Quote Originally Posted by Evilsizer
    allendale L1 and L2 is 8-way.
    Since Conroe is 16-way and if Allendale is 8-way, then the associativity of the cache is lower. Hence, there will be a slight performance hit from an increase in the rate of cache misses. I can't think of any reason why the allendale cache would have less latency is this case since it is the same die as conroe.

    Quote Originally Posted by Evilsizer
    if the clocks speeds are equal on both 2mb and 4mb the difference is still going to be the same as quoted above. if your running something like superpi for caculations all day long get the 4mb by all means.
    I suppose this depends on what you are doing. Remember that Memory Bandwidth does not increase proportionaly with the FSB (e.g. C2D can double its speed with careful overclocking, but memory cannot). Applications that frequently access more than 2MB data will definitely benefit from the larger cache. Not to mention at 3.5GHz the number of CPU cyles the C2D waits for memory (latency) is almost twice as much as a stock E6300. So the effectiveness of prefetchers is reduced as clock speeds increase. My guess is that Intel will include an on-die memory controller once the clock speeds reach a level at where prefetchers cannot hide the horrible latency of memory.
    Last edited by aashburns; 08-10-2006 at 11:52 PM.

  16. #16
    Xtreme Mentor
    Join Date
    Aug 2006
    Location
    HD0
    Posts
    2,646
    you have to realize that fullspeed cache is a lot easier to do without upping the latency on a lower clocked system than it is on a higherclocked netburst system. Atleast that is my guesse.

  17. #17
    I am Xtreme
    Join Date
    Nov 2002
    Location
    South FL, USA
    Posts
    4,892
    Quote Originally Posted by Ace-a-Rue
    i read a review which showed an underclock 6800 at allendale speeds and then running benches between that and an actual Allendale cpu..there were only, the most, a 5-6% difference in bench performance and some were only 1-2%
    Quote Originally Posted by hicookie
    really, can u show the link?
    i found the answer and i posted it here:

    http://www.xtremesystems.org/forums/...03#post1645603

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •