Page 16 of 29 FirstFirst ... 61314151617181926 ... LastLast
Results 376 to 400 of 713

Thread: K10 Scores starting to surface

  1. #376
    Xtreme Member
    Join Date
    Dec 2006
    Posts
    139
    Also has anybody noticed how small the page file was during these benchmarks?
    http://forum.coolaler.com/showthread...161127&page=12

  2. #377
    V3 Xeons coming soon!
    Join Date
    Nov 2005
    Location
    New Hampshire
    Posts
    36,363
    Almost puts on Mod hat but decides not to..yet:
    I can see this heading downhill in a hurry..
    Crunch with us, the XS WCG team
    The XS WCG team needs your support.
    A good project with good goals.
    Come join us,get that warm fuzzy feeling that you've done something good for mankind.

    Quote Originally Posted by Frisch View Post
    If you have lost faith in humanity, then hold a newborn in your hands.

  3. #378
    Xtreme Mentor
    Join Date
    Apr 2007
    Location
    Idaho
    Posts
    3,200
    Quote Originally Posted by Movieman View Post
    Almost puts on Mod hat but decides not to..yet:
    I can see this heading downhill in a hurry..
    All the K10 threads go downhill in a hurry
    "To exist in this vast universe for a speck of time is the great gift of life. Our tiny sliver of time is our gift of life. It is our only life. The universe will go on, indifferent to our brief existence, but while we are here we touch not just part of that vastness, but also the lives around us. Life is the gift each of us has been given. Each life is our own and no one else's. It is precious beyond all counting. It is the greatest value we have. Cherish it for what it truly is."

  4. #379
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    Quote Originally Posted by mstp2009 View Post
    Yeah, AMD isn't a souce.
    First it's not a souce for sure .
    Second AMD never ever mentioned the L3 cache latency in your linked document,so no,they did not say what the value is in that document.
    Like i said,no source on your side that disputes my link.

  5. #380
    Xtreme Enthusiast
    Join Date
    Apr 2007
    Posts
    772
    Quote Originally Posted by informal View Post
    First it's not a souce for sure .
    Second AMD never ever mentioned the L3 cache latency in your linked document,so no,they did not say what the value is in that document.
    Like i said,no source on your side that disputes my link.
    And your link could be written by a 12 yo expressing his OPINION.

    MY POINT was that AMD themselves have NOT specified that latency is variable.


    Since variable latency would be a significant deviation from current CPU design specs (with questionable if any benefit), the assumption should be made that the L3 cache is a fixed latency (i.e. number of cycles).


    But in AMD Fanboi land, anything is possible.

    How's that REVERSE Hyperthreading coming along?

  6. #381
    Banned
    Join Date
    Aug 2007
    Posts
    1,014
    Quote Originally Posted by informal View Post
    First it's not a souce for sure .
    Second AMD never ever mentioned the L3 cache latency in your linked document,so no,they did not say what the value is in that document.
    Like i said,no source on your side that disputes my link.
    Is it me or did you missed an essential part of his sentence?

    "We can see what AMD might have done in K10's L3 cache."

    he doesn't claim anything...

  7. #382
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    Quote Originally Posted by mstp2009 View Post
    And your link could be written by a 12 yo expressing his OPINION.

    MY POINT was that AMD themselves have NOT specified that latency is variable.


    Since variable latency would be a significant deviation from current CPU design specs (with questionable if any benefit), the assumption should be made that the L3 cache is a fixed latency (i.e. number of cycles).


    But in AMD Fanboi land, anything is possible.

    How's that REVERSE Hyperthreading coming along?
    So the techarp quote is not good enough??You still think it was 12 yo kid who wrote it(and not the 12 yo who is disagreeing with it)??

    Let's see,would one man named David Kanter,a little older than the supposed techarp kid of 12 years,be a good enough source for you??

    Start rolling on that floor and laughing at yourself,as of now:

    http://www.realworldtech.com/page.cf...1607033728&p=7

    Quote Originally Posted by David Kanter
    A round-robin algorithm is used to give access to one of the four cores each cycle. The latency to the L3 cache has not been disclosed, but it depends on the relative northbridge and core frequencies – for reasons which we will see later.
    I made it in red in case you can't see it from that big avatar...

    In gamers terms:pwnd.

    Edit:
    even more pwng coming along:

    From a circuit level perspective, the changes between the K8 and Barcelona were extremely significant. Barcelona is specified to operate at a wide range of voltages, from 0.8-1.4V. However, unlike its predecessor, each core in Barcelona has a dedicated clock distribution system (including PLL) and power grid. The frequency for each core is independent of both the other cores, and the various non-core regions; the voltage for all four cores is shared, but separate from the non-core. As a result, power can be aggressively managed by lowering frequency and voltage whenever possible. To support independent clocking and modular design, asynchronous dynamic FIFO buffers are used to communicate between different cores and the northbridge/L3 cache. These FIFOs absorb any global skew or clock rate variation, but the latency for passing through depends on the skew and frequency variance – which is why the L3 cache latency is variable. The northbridge and L3 cache compose roughly 20% of the die and share a voltage and clock domain that is independent of the four cores, which is essential for mobile applications. Previously, the northbridge clock and voltage was tied to the processors, so systems with integrated graphics could not reduce the processor voltage or frequency to deep power saving states. Separate sleep states, voltages and frequencies for the northbridge and processors should lower AMD’s average power dissipation which will help in the mobile market.
    Last edited by informal; 09-01-2007 at 01:14 PM.

  8. #383
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    Quote Originally Posted by BeardyMan View Post
    Is it me or did you missed an essential part of his sentence?

    "We can see what AMD might have done in K10's L3 cache."

    he doesn't claim anything...
    I think you quoted the wrong man,since i did gave the link for that patent...

  9. #384
    Xtreme Enthusiast
    Join Date
    Apr 2007
    Posts
    772
    Quote Originally Posted by informal View Post
    So the techarp quote is not good enough??You still think it was 12 yo kid who wrote it(and not the 12 yo who is disagreeing with it)??

    Let's see,would one man named David Kanter,a little older than the supposed techarp kid of 12 years,be a good enough source for you??

    Start rolling on that floor and laughing at yourself,as of now:

    http://www.realworldtech.com/page.cf...1607033728&p=7



    I made it in red in case you can't see it from that big avatar...

    In gamers terms:pwnd.
    WTF? That's not pwnage you n00b.

    It said "The latency to the L3 cache has not been disclosed . . .".

    "but it depends on the relative northbridge and core frequencies" simply means THEY DON'T KNOW. Not that it is variable.

    Go back and pass your GMAT and then we will talk. Reading comprehension seems to be a problem for you.


    We're still waiting on the glorious Reverse Hyperthreading, BTW.

  10. #385
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    Quote Originally Posted by mstp2009 View Post
    WTF? That's not pwnage you n00b.

    It said "The latency to the L3 cache has not been disclosed . . .".

    "but it depends on the relative northbridge and core frequencies" simply means THEY DON'T KNOW. Not that it is variable.

    Go back and pass your GMAT and then we will talk. Reading comprehension seems to be a problem for you.


    We're still waiting on the glorious Reverse Hyperthreading, BTW.
    LMAO,you really are cluless,aren't you.And having a nerve to call me a "noob".Sure thing.
    Go ahead genius,read the next page:
    From a circuit level perspective, the changes between the K8 and Barcelona were extremely significant. Barcelona is specified to operate at a wide range of voltages, from 0.8-1.4V. However, unlike its predecessor, each core in Barcelona has a dedicated clock distribution system (including PLL) and power grid. The frequency for each core is independent of both the other cores, and the various non-core regions; the voltage for all four cores is shared, but separate from the non-core. As a result, power can be aggressively managed by lowering frequency and voltage whenever possible. To support independent clocking and modular design, asynchronous dynamic FIFO buffers are used to communicate between different cores and the northbridge/L3 cache. These FIFOs absorb any global skew or clock rate variation, but the latency for passing through depends on the skew and frequency variance – which is why the L3 cache latency is variable. The northbridge and L3 cache compose roughly 20% of the die and share a voltage and clock domain that is independent of the four cores, which is essential for mobile applications. Previously, the northbridge clock and voltage was tied to the processors, so systems with integrated graphics could not reduce the processor voltage or frequency to deep power saving states. Separate sleep states, voltages and frequencies for the northbridge and processors should lower AMD’s average power dissipation which will help in the mobile market.
    Pwnd!

  11. #386
    Xtreme Enthusiast
    Join Date
    Apr 2007
    Posts
    772
    Quote Originally Posted by informal View Post
    And having a nerve to call me a "noob".Sure thing.
    If the shoe fits . . .





    You know the one thing you accomplished here - to really show off your interpersonal skills.

    You are almost as annoying as a Mac user. Almost.


    Reminds me of an argument I saw you have last year about the SSE units of K10.

    You swore up and down that they were double in number and not bit size (64-128). You got pwned there and still would never admit you were wrong.

  12. #387
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    Quote Originally Posted by mstp2009 View Post
    If the shoe fits . . .





    You know the one thing you accomplished here - to really show off your interpersonal skills.

    You are almost as annoying as a Mac user. Almost.
    And you still don't have the guts to accept the fact that you were wrong!And when you can't counter any more you start using name calling ..Really shows a lot about your character,doesn't it?
    Talk about annoying,when i argue with a man that accept no facts unless they fit in his prearranged view.

    PS Aren't you a guy who got banned at AMDzone for heavy trolling every thread?Was there another forum[H]?
    PPS Did you gave a proof for 3Ghz 25W Penryns for laptops?Oh wait,it is the same "God" analogy coming again,no?
    Last edited by informal; 09-01-2007 at 01:27 PM.

  13. #388
    Xtreme Addict
    Join Date
    Apr 2005
    Location
    Wales, UK
    Posts
    1,195
    Why can't everyone just wait the week or two left? Then a lot of arguments can be settled...

  14. #389
    Banned
    Join Date
    Aug 2007
    Posts
    1,014
    Quote Originally Posted by onewingedangel View Post
    Why can't everyone just wait the week or two left? Then a lot of arguments can be settled...
    Indeed, many are selling the skin of the bear before it's even nailed down by a nice winchester

  15. #390
    Xtreme Mentor
    Join Date
    Mar 2006
    Posts
    2,978
    Quote Originally Posted by informal View Post
    LMAO,you really are cluless,aren't you.And having a nerve to call me a "noob".Sure thing.
    Go ahead genius,read the next page:
    Pwnd!
    Informal, becareful and read Kanter's note carefully, this is beginning to focus up .... it was a good link... but you may be misinterpreting what Kanter is saying....

    I am, myself, trying to understand at a detail that makes sense, this is much more complicated than what we are assuming....

    Each core (that is execution core and the dedicated cache) will be clocked independently, a major power saving feature of K10. So in order to share a cache at L3 level, it will need to send data asynchronously to differently clocked cores... wow, this is complicated.... so what AMD has done (per Kanter) is build a 'translator', or a FIFO buffer to send data to and from the L3 -- this is not the same as dynamically adjusting L3 clock or latency, what it is doing is dynamically adjusting a clock divider to synch L3 with variable speed cores, now this variable L3 latency makes much much more sense.

    Any asynchronous communication will incur extra latency (over a simple 1:1) simply as a result of clock mismatch ... this is a given ... (this is why C2D shows a dip in performance in DDR2-533 to DDR2-667 to DDR2-800 as dividers beyond 1:1 introduce extra latency).

    So with this understanding, the observed latency (which is actually the important part) will be variable, not because L3 cache has variable latency but because it has to be sychronized through the FIFO buffers to cores of variable clocks.

    Damn should have paid more attention to Kanter's article too....

    Guys I am learning a lot here... thanks.

    Jack

  16. #391
    Xtreme Enthusiast
    Join Date
    Apr 2007
    Posts
    772
    Quote Originally Posted by informal View Post
    PS Aren't you a guy who got banned at AMDzone for heavy trolling every thread?Was there another forum[H]?

    Nope. Never graced that forum.

    PPS Did you gave a proof for 3Ghz 25W Penryns for laptops?Oh wait,it is the same "God" analogy coming again,no?
    My statement was that they were possible w/ 45nm high-K. 35W 2.8GHz ones were listed. And they weren't even low-voltage.

    Pwned AGAIN.

  17. #392
    I am Xtreme
    Join Date
    Jul 2004
    Location
    Little Rock
    Posts
    7,204
    Quote Originally Posted by Zytek_Fan View Post
    All the K10 threads go downhill in a hurry
    Can't blame it on me LOL!
    Quote Originally Posted by Movieman
    With the two approaches to "how" to design a processor WE are the lucky ones as we get to choose what is important to us as individuals.
    For that we should thank BOTH (AMD and Intel) companies!


    Posted by duploxxx
    I am sure JF is relaxed and smiling these days with there intended launch schedule. SNB Xeon servers on the other hand....
    Posted by gallag
    there yo go bringing intel into a amd thread again lol, if that was someone droping a dig at amd you would be crying like a girl.
    qft!

  18. #393
    V3 Xeons coming soon!
    Join Date
    Nov 2005
    Location
    New Hampshire
    Posts
    36,363
    moving this thread to the AMD section per several requests..
    Crunch with us, the XS WCG team
    The XS WCG team needs your support.
    A good project with good goals.
    Come join us,get that warm fuzzy feeling that you've done something good for mankind.

    Quote Originally Posted by Frisch View Post
    If you have lost faith in humanity, then hold a newborn in your hands.

  19. #394
    Xtreme Enthusiast
    Join Date
    Apr 2007
    Posts
    772
    Quote Originally Posted by JumpingJack View Post
    Informal, becareful and read Kanter's note carefully, this is beginning to focus up .... it was a good link... but you may be misinterpreting what Kanter is saying....

    I am, myself, trying to understand at a detail that makes sense, this is much more complicated than what we are assuming....

    Each core (that is execution core and the dedicated cache) will be clocked independently, a major power saving feature of K10. So in order to share a cache at L3 level, it will need to send data asynchronously to differently clocked cores... wow, this is complicated.... so what AMD has done (per Kanter) is build a 'translator', or a FIFO buffer to send data to and from the L3 -- this is not the same as dynamically adjusting L3 clock or latency, what it is doing is dynamically adjusting a clock divider to synch L3 with variable speed cores, now this variable L3 latency makes much much more sense.

    Any asynchronous communication will incur extra latency (over a simple 1:1) simply as a result of clock mismatch ... this is a given ... (this is why C2D shows a dip in performance in DDR2-533 to DDR2-667 to DDR2-800 as dividers beyond 1:1 introduce extra latency).

    So with this understanding, the observed latency (which is actually the important part) will be variable, not because L3 cache has variable latency but because it has to be sychronized through the FIFO buffers to cores of variable clocks.

    Damn should have paid more attention to Kanter's article too....

    Guys I am learning a lot here... thanks.

    Jack
    Thank you Jack.

    So absolutely latency - the transfer from FIFO buffer to L3 is constant, it would just be the fill time of the FIFO buffer that is variable b/c it has async communication with each of the 4 cores (unless they are at full speed, one would assume).

    The observed latency would be:

    Core to FIFO Buf latency + FIFO Buf to L3 latency

    Correct?

  20. #395
    Xtreme Mentor
    Join Date
    Mar 2006
    Posts
    2,978
    Quote Originally Posted by onewingedangel View Post
    Why can't everyone just wait the week or two left? Then a lot of arguments can be settled...
    Nahhhh, we would get bored.

  21. #396
    Banned
    Join Date
    Aug 2007
    Posts
    1,014
    Quote Originally Posted by informal View Post
    I think you quoted the wrong man,since i did gave the link for that patent...
    I know, you may slap me for that

  22. #397
    Xtreme Mentor
    Join Date
    Mar 2006
    Posts
    2,978
    Quote Originally Posted by mstp2009 View Post
    Thank you Jack.

    So absolutely latency - the transfer from FIFO buffer to L3 is constant, it would just be the fill time of the FIFO buffer that is variable b/c it has async communication with each of the 4 cores (unless they are at full speed, one would assume).

    The observed latency would be:

    Core to FIFO Buf latency + FIFO Buf to L3 latency

    Correct?
    Yeah, and I cannot believe I was so ignorant not to think this through. The L3 cache is shared, but each core is throttled independently. The overall intrinsic latency of the L3 will be like any cache, it is fixed by the size and quality of the process technology as well as the speed paths set at design.

    However, since each core will throttle depending on load, the clock for each core can be different than L3... necessitating an asynchronous bus such that each core can still access the data....

    Now, simple asynchronous communications will always have variable latency (as a function of the ratio or divider) as one agent will need to wait on the other at some point. Example, say you have a divider 6:5 let's call that agent a and b, so it is 6:5 A:B, to make this easy lets say 1 bit line so in 5 clock tickes it will send 5 bits for agent B, but agent A has put 6 clock ticks into the queue, one cycle will be left hanging until the next revolution around.... temporally this whould make no difference3, but agent A is only as fast as agent B.....

    But you also have to add into the mix the physical latency of the circuit to do this work.... it is a trade off, one that AMD obviously believes is better in the long run... so long as L3 'observed' latency is much less than that to main memory, there is a benefit.
    Last edited by JumpingJack; 09-01-2007 at 01:48 PM.

  23. #398
    Xtreme Cruncher
    Join Date
    Jun 2006
    Posts
    6,215
    However, since each core will throttle depending on load, the clock for each core will be different than L3... necessitating an asynchronous bus such that each core can still access the data....

    Now, simply asynchronous communications will always have variable latency as one agent will need to wait on the other at some point. Example, say you have a divider 6:5 let's call that agent a and b, so it is 6:5 A:B, to make this easy lets say 1 bit line so in 5 clock tickes it will send 5 bits for agent B, but agent A has put 6 clock ticks into the queue, one cycle will be left hanging until the next revolution around.... temporally this whould make no difference3, but agent A is only as fast as agent B.....
    Yes Jack,this is a simplified reason why it occurs.
    I thought you saw Kanter's article long ago(since it is online since the middle of May i think)

  24. #399
    Xtreme Mentor
    Join Date
    Mar 2006
    Posts
    2,978
    Quote Originally Posted by informal View Post
    Yes Jack,this is a simplified reason why it occurs.
    I thought you saw Kanter's article long ago(since it is online since the middle of May i think)
    I did, and I read it to... it just did not register enough to turn on lights as this topic has evolved.

  25. #400
    Xtreme Mentor
    Join Date
    Nov 2005
    Location
    Devon
    Posts
    3,437
    Quote Originally Posted by JumpingJack View Post
    Informal, becareful and read Kanter's note carefully, this is beginning to focus up .... it was a good link... but you may be misinterpreting what Kanter is saying....

    I am, myself, trying to understand at a detail that makes sense, this is much more complicated than what we are assuming....

    Each core (that is execution core and the dedicated cache) will be clocked independently, a major power saving feature of K10. So in order to share a cache at L3 level, it will need to send data asynchronously to differently clocked cores... wow, this is complicated.... so what AMD has done (per Kanter) is build a 'translator', or a FIFO buffer to send data to and from the L3 -- this is not the same as dynamically adjusting L3 clock or latency, what it is doing is dynamically adjusting a clock divider to synch L3 with variable speed cores, now this variable L3 latency makes much much more sense.

    Any asynchronous communication will incur extra latency (over a simple 1:1) simply as a result of clock mismatch ... this is a given ... (this is why C2D shows a dip in performance in DDR2-533 to DDR2-667 to DDR2-800 as dividers beyond 1:1 introduce extra latency).

    So with this understanding, the observed latency (which is actually the important part) will be variable, not because L3 cache has variable latency but because it has to be sychronized through the FIFO buffers to cores of variable clocks.

    Damn should have paid more attention to Kanter's article too....

    Guys I am learning a lot here... thanks.

    Jack
    This is not entirely true...

    I read somewhere long time ago that L3 in K10 is acting more like memory layer. In other words it is clocked by IMC independently from all 4 cores and on diagram I would put it after CrossBar...
    That's why L3 latency can vary from core point of view (cache latency itself is probably constant). It is similar to how DDR2-800 latency (again from CPU point of view) is different compared to DDR2-667 (same timings of course ).


    Edit: JumpingJack you typing too fast I barely read page 16 and typed my response and here surprise! another page with new info making my post partially obsolete
    Last edited by Lightman; 09-01-2007 at 02:11 PM.
    RiG1: Ryzen 7 1700 @4.0GHz 1.39V, Asus X370 Prime, G.Skill RipJaws 2x8GB 3200MHz CL14 Samsung B-die, TuL Vega 56 Stock, Samsung SS805 100GB SLC SDD (OS Drive) + 512GB Evo 850 SSD (2nd OS Drive) + 3TB Seagate + 1TB Seagate, BeQuiet PowerZone 1000W

    RiG2: HTPC AMD A10-7850K APU, 2x8GB Kingstone HyperX 2400C12, AsRock FM2A88M Extreme4+, 128GB SSD + 640GB Samsung 7200, LG Blu-ray Recorder, Thermaltake BACH, Hiper 4M880 880W PSU

    SmartPhone Samsung Galaxy S7 EDGE
    XBONE paired with 55'' Samsung LED 3D TV

Page 16 of 29 FirstFirst ... 61314151617181926 ... LastLast

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •