Page 18 of 29 FirstFirst ... 81516171819202128 ... LastLast
Results 426 to 450 of 713

Thread: K10 Scores starting to surface

  1. #426
    Xtreme Enthusiast
    Join Date
    Apr 2006
    Location
    Brasil
    Posts
    534
    At Barcelona's presentation it was said that the L3 cache runs at Core clock frequency, not at northbridges' (wich runs a little lower. i.e: on 2GHz Barcelona northbridge runs at 1.6GHz on a uniplane board, and at 1.8GHz on a dual plane board)

    But the L3 cache beeing a part of the northbridge does make sense, by the latency numbers we've seen and all the async stuff...
    Last edited by doompc; 09-02-2007 at 03:27 PM.

  2. #427
    Xtreme Mentor
    Join Date
    May 2007
    Posts
    2,792
    Let's put it in simple words as guys speaking techno will lose most other people and that way no one learns or benefits accurately, which is the whole purpose of sharing and explaining.

    I mentioned about the L3 cache many pages back. AMD outlined it at "less than 38 cycles". 38 cycles is it's slowest latency. L3 cache is just a victim cache. It is very good for improving latency of data transfer to the core which needs it.

    Usually, the data (retained in cache lines) evicted from L2 cache, has to be thrown back to memory (RAM). If needed again, it has to be re-fetched. This distance->latency is very large in comparison to on-die distance->latency.

    Thus, by including "another" on-die cache but this time to retain L2 evicted datum in, they can refill the L2 or mostly, send it directly to L1D cache very quickly and any core can access the data (shared cache) far quicker than when going to memory.

    The inclusion of this benefits AMD K10 very much similar to how it does for Core 2/Penryn with the large L2 cache, and that's why it will be increased proportionally with time as die size decreases in K10.

    IIRC L3 populates itself from L1D cache, L2 cache and also memory based on prediction algorithms too.

    K8 is not K10, so don't be using it's architectural features to undermine or imply those to the K10 - just because one doesn't know any accurate information or any better.

    L2 and L3 cache use 64B lines. L3 cache controller is variable and flexible to support 8MB.

    Also "Size and associativity of the AMD Family 10h processor L2 cache is implementation dependent. See the appropriate BIOS and Kernel Developer’s Guide for details."

    L3 cache in not inclusive nor orthodoxly exclusive but tweaked - cache line can be fetched into the L1D and still be retained in L3 for core sharing based on sharing history which is tracked.

    L3 doesn't evict using the same LRU algorithm the K8 used for L2, but evicts lines based on the LRU unshared cache line.

    Northbridge and core frequency somewhat determines L3 cache relative latency.

    L3 cache and IMC (northbridge) runs at independent speeds and voltages from all cores.

    "Furthermore, the cache features bandwidth-adaptive policies that optimize latency when requested bandwidth is low, but allows scaling to higher aggregate L3 bandwidth when required (such as in a multi-core environment)." http://www.amd.com/us-en/assets/cont...docs/40546.pdf

    "L3 cache latency will evidently be higher than L2 cache latency. However, AMD materials suggest that the latency will vary adaptively depending on the workload. If the workload isn’t too heavy, latency will improve, and under heavy workload the bandwidth will rise. We still have to check what really stands behind this." http://www.xbitlabs.com/articles/cpu...amd-k10_8.html

    You already know about HT and it's implementational advantages, exactly as why Intel is developing CSI for high bandwidth, and some of their CSI design team left over this: http://www.realworldtech.com/page.cf...05-05-2006#361

    HT link bandwidth is dependent on the controller implemented and increasing this becomes a real power hungry system, which is why processor architects will withhold to do so but in limited very high-end and future shrunk node levels.

    In all honesty, Barcelona front end and out-of-order engine is a very complex architecture.

    Each clock cycle fetches 32B of data from the L1 instruction cache into the predecode/pick buffer - K8 does 16B - Core 2 does 16B.

    The data travels on a bi-directional bus which is 256-bit wide - K8 has it at 128-bit - Core 2 is at 128-bit.

    The predecode and pick buffer maybe increased to 48B.

    Direct branch prediction is improved > the global history register tracks the last 12 entries - K8 one tracked 8.

    New indirect branch prediction of 512-entries.

    The FIFO stack including the return addresses of function calls 3, 11 and 15 is 24 entries - K8 pushed 12 out.

    Core 2 uses a single execution cluster with a unified scheduler and reservation stations across multiple ports, like in Athlons - K10 has a split integer and floating point cluster with distributed scheduler and reservation stations.

    IDIV instructions are variable latency and not a fixed iteration as in K8 previously. This K10 32 bit divide latency is roughly 10 cycles faster than in K8.

    Third ALU is now for LZCOUNT/POPCOUNT.

    ALUs are most optimized for power efficiency rather than peak performance ATM.

    Out-of-order memory access prevents operation stalls seen in K8 (especially with load).

    In K10, each core has 8 prefetchers which fetch data into the L1D cache whereas in K8 they were prefetched to the L2 cache.

    That's some of it's better features over the K8 (let alone the DRAM controllers) which make it by no way equal. According to what I see, Barcelona quintessence is the Budapest core at higher clock speeds.

    Now to the argument of efficiency (a) can't be and can be correlated with frequency (b) in a format we don't expect - all based on what Gary said. Gary commented on what he saw, simply. Processors have an upper limit whereafter more frequency=little performance gain, a mid optimum band where more frequency=inclining gradient performance, a lower band whereby the frequencies=sub-optimal performance.

    I will provide you an example of an older CPU I have with some SPI mod 1.5XS 1M tests I ran 1-3 months or so back (P4 Celeron D DO S478, Malaysia week 21 of '04). Memory is synchronous 512MB.

    Clock (MHz) - Time (sec) - Scaling
    1802MHz = 116.547sec = 100%
    1997MHz = 104.110sec = 112%
    2104MHz = 97.480sec = 119.6%
    2205MHz = 73.455sec = 158.67%
    2302MHz = 88.307sec = 132%
    2402MHz = 85.009sec = 137%
    2504MHz = 80.486sec = 144.8%
    2604MHz = 77.161sec = 151%
    2701MHz = 73.541sec = 158.47%
    2798MHz = 70.431sec = 165.5%
    2999MHz = 65.044sec = 179%
    3201MHz = 60.396sec = 193%
    3301MHz = 60.036sec = 194%
    3360MHz = 48.460sec = 240.5%
    3503MHz = 47.849sec = 243.6%

    See what happened from 1800-2300 and from 3300-3360? Now let's place K10 in there assuming relative scaling like I witnessed with my P4 (not compared to P4 though, but K8). What would happen as a result?

    Base comparison: (P4 1800) K8 1900 = 100%

    (P4 2300) K10 1900 = 132%
    (P4 2400) K10 2000 = 137%
    (P4 2500) K10 2100 = 145%
    (P4 2600) K10 2200 = 151%
    (P4 2700) K10 2300 = 158%
    (P4 2800) K10 2400 = 166%
    (P4 2900) K10 2500 = 171%
    (P4 3000) K10 2600 = 179%
    (P4 3100) K10 2700 = 186%
    (P4 3200) K10 2800 = 193%
    (P4 3300) K10 2900 = 194%
    (P4 3400) K10 3000 = 240% **PEAK MOST STABLE PERFORMANCE**
    (P4 3500) K10 3100 = 243%
    (P4 3600) K10 3200 = 244%
    (P4 3700) K10 3300 = NA

    This scenario I've witnessed and is based on real life processor performance, so it's equally possible. By AMD and others touting 3GHz Phenom, again and again, I would expect this is loosely what we are to expect.

    Here's one Phenom MB BTW: http://img170.imageshack.us/img170/2...63c0cf7kk7.jpg

  3. #428
    Xtreme Guru adamsleath's Avatar
    Join Date
    Nov 2006
    Location
    Brisbane, Australia
    Posts
    3,803
    nice figures....but the equating of k10 to real figures is....purely hypothetical.

    how is the k8=100% (@1900) and k10=132% (@1900) derived? - does this assume a 32% gain for k10 over k8 baseline?

    and does it also assume identical scaling for k10 versus an old p4?
    Last edited by adamsleath; 09-02-2007 at 05:46 PM.
    i7 3610QM 1.2-3.2GHz

  4. #429
    Xtreme Mentor
    Join Date
    May 2007
    Posts
    2,792
    Quote Originally Posted by adamsleath View Post
    nice figures....but the equating of k10 to real figures is....purely hypothetical.
    It can get far worse believe me.

    One of the points I wanted to show is how it clearly and perilously casts doubts on this http://img.coolaler.com.tw/images/zm...mlwemyzmzk.jpg compared to this http://img527.imageshack.us/img527/6582/sp47cj7.jpg

    Bro, that was 1x 512MB RAM not even dual channel and an old CD. A P4 will get 10 seconds lower at least, beating the Barcelona quad core time they showed.
    how is the k8=100% (@1900) and k10=132% (@1900) derived? - does this assume a 32% gain for k10 over k8 baseline?
    Yes, the K8 1900 was baseline compared to a K10 1900 - as an assumption for better clock per clock performance. It could be far lower, obviously, but this is a vague explanatory comparison of what Gary stated rather than a "prediction".

    What I was saying is, Gary of Anandtech could be basing his statements on similar performance scaling he saw with the K10, as I did with the Celeron chip.
    and does it also assume identical scaling for k10 versus an old p4?
    Yes. It's not impossible is what I'm saying. Look at how the numbers fluctuate with the Celeron and where. Pure technical math cannot account for this, so we won't be able to explain it, but we'll experience it. It's possible that K10 at lower clocks does not scale as well as some higher clocks. I've just shown you in one application how my old Celeron did it, which means it's entirely possible.

  5. #430
    Xtreme Mentor
    Join Date
    Mar 2006
    Posts
    2,978
    Quote Originally Posted by adamsleath View Post
    nice figures....but the equating of k10 to real figures is....purely hypothetical.

    how is the k8=100% (@1900) and k10=132% (@1900) derived? - does this assume a 32% gain for k10 over k8 baseline?

    and does it also assume identical scaling for k10 versus an old p4?
    Don't believe his data, there are two typo's and he misinterpreted it.

  6. #431
    Xtreme Guru adamsleath's Avatar
    Join Date
    Nov 2006
    Location
    Brisbane, Australia
    Posts
    3,803
    ...but it is a real scaling example...ie p4.
    i7 3610QM 1.2-3.2GHz

  7. #432
    Xtreme Mentor
    Join Date
    Mar 2006
    Posts
    2,978
    Quote Originally Posted by adamsleath View Post
    ...but it is a real scaling example...ie p4.
    He is using to argue non-linear scaling, but he misinterprets the data..

    First, there are two typos, or atleast I think they are typos, the first is at 2205 Mhz, he says 73.455 but I think he meant 93.455, this is in line with what would be expected. The other is 3301 or 3201, I looks like he ran the same run twice at the same frequency but recorded a different speed. Here is a plot of his SP1M time vs frequency



    If you correct his typo (heck sometimes my 9's look like's 7), then it behaves more as expected. For example:


    The typo's or inconsistent data points are not what really matters, what does matter is that he normalizes to the slowest time, and converts to a percentage... he makes the most common mistake when one analyzes time to complete rather than rate ... SP1M is measured in time it takes to complete the task, but processor speed is measured in frequency ... he cannot calculate scaling factors when the units of one dimension is the inverse of the other.... he should have taken 1/time and plotted against fequency to check linearity, when you do that, it becomes completely linear:


    In short, he data shows nothing but a handful of mistakes and that indeed SP1M scales linearly with clock speed.

    Jack
    Last edited by JumpingJack; 09-02-2007 at 08:38 PM.

  8. #433
    Xtreme Guru adamsleath's Avatar
    Join Date
    Nov 2006
    Location
    Brisbane, Australia
    Posts
    3,803

    jumpingjack's scaling graph.
    the non-linear data does look a bit fishy.
    Last edited by adamsleath; 09-02-2007 at 08:30 PM.
    i7 3610QM 1.2-3.2GHz

  9. #434
    Xtreme Mentor
    Join Date
    Mar 2006
    Posts
    2,978
    Quote Originally Posted by adamsleath View Post
    the non-linear data does look a bit fishy.
    Yeah, every test I have ever performed on SP1M shows linear scaling with CPU clock... though I have not tested on a P4, I have tested on a K8 and C2D.

  10. #435
    c[_]
    Join Date
    Nov 2002
    Location
    Alberta, Canada
    Posts
    18,728
    he cannot calculate scaling factors when the units of one dimension is the inverse of the other.... he should have taken 1/time and plotted against fequency to check linearity, when you do that, it becomes completely linear:
    Is there a published scientific paper on that?

    Thats reminiscent of people pricing things at $24.99 instead of $25 to make people see the smaller number. Its not hard to make your numbers look good when graphing, same things goes for finding percentages.

    As to normalizing to a slower time we have to use stock as the baseline else you are going on prediction only. Using percentages wasnt needed but using either percentages of increase over stock or performance product factors would have had the same outcome, just maybe more confusing numbers for some.

    All along the watchtower the watchmen watch the eternal return.

  11. #436
    Xtreme Mentor
    Join Date
    Mar 2006
    Posts
    2,978
    Quote Originally Posted by STEvil View Post
    Is there a published scientific paper on that?

    Thats reminiscent of people pricing things at $24.99 instead of $25 to make people see the smaller number. Its not hard to make your numbers look good when graphing, same things goes for finding percentages.

    As to normalizing to a slower time we have to use stock as the baseline else you are going on prediction only. Using percentages wasnt needed but using either percentages of increase over stock or performance product factors would have had the same outcome, just maybe more confusing numbers for some.

    No, this is Jr. High Math... Just plot it... .... Super Pi measured in time is an inverse function of frequency as it should, because frequency is 1/time.... this is not rocket science... heck run the experiment yourself... here is an X6800 stretching out the range of time and frequency so it is easier to see the functionality:

    First the raw data:


    Next, SP in time domain vs Frequency:


    Finally, in the correct time domain (i.e. reciprocal time by taking 1/time, which makes it a linear function of frequency) for SP1,2,4,8M
    Last edited by JumpingJack; 09-02-2007 at 09:56 PM.

  12. #437
    Xtreme Guru adamsleath's Avatar
    Join Date
    Nov 2006
    Location
    Brisbane, Australia
    Posts
    3,803


    does not this indicate that performance increase is non linear and that (as i thought) scaling gives diminishing returns...?
    ie once you reach the shallower part of the curve.
    ie performance increase starts off sharp then plataeus. - such that higher and higher clocks give less and less performance increase delta.... for a given core at speeds plotted on the graph.

    it is a parabolic curve (i think)

    and it is a long time since i did school mathematics.

    the scaling is NOT linear. - extrapolating beyond 3500(speed) in this example yields negligible performance returns; indeed the difference between 2500 and 3500 is s. f. a.
    Last edited by adamsleath; 09-02-2007 at 10:11 PM.
    i7 3610QM 1.2-3.2GHz

  13. #438
    Xtreme Mentor
    Join Date
    Mar 2006
    Posts
    2,978
    Quote Originally Posted by adamsleath View Post

    does not this indicate that performance increase is non linear and that (as i thought) scaling gives diminishing returns...?
    ie once you reach the shallower part of the curve.
    ie performance increase starts off sharp then plataeus. - such that higher and higher clocks give less and less performance increase delta.... for a given core at speeds plotted on the graph.

    it is a parabolic curve (i think)

    and it is a long time since i did school mathematics.

    the scaling is NOT linear. - extrapolating beyond 3500(speed) in this example yields negligible performance returns; indeed the difference between 2500 and 3500 is s. f. a.
    Not exactly, that is the point of how a processor understands time..... a processor only understands a clock tick.. A signal that goes high then low. The total time for that is irrelevant. You and I perceive time not clock ticks, so in your words this is correct ... point of diminishing returns.

    Frequency is cycles per second, or cycles/second. As frequency increases the number of ticks increases within one second. However, calculating Pi in super PI only depends X number of clock ticks that performs x number of instructions. As you increase frequency, you increase the number of clock ticks per unit time, but you are observing unit time.... so the functionality of the observed time it takes to complete a task (lower is better) is inversely proportional to frequency. I.e. F(x) = 1/x, so plot a simple plot of 1/x, it approaches zero asymptotically, a paraboly is afunction of f(x) = x^2 this is different.

    Units of a quanity are important, and mathematically, they are treated like any variable or number. Super PI, Pi calculated and reported in time is not directly linear to Frequency because frequency is in 1/Time... to make one a direct finction of the other simply convert Super PI from time domain to frequncy domain and plot... wolla linear. This is the same as you will read around when people discuss how to calcuate % improvement for benches that are 'slower is better', slower is better benchmarks always approach a 'point of diminishing returns'.... go check it out, find any review of a series of processors varying as a function of frequency for the same core where the reported bench is in units of time... and plot time vs frequency, it will always be inversely proportional.... Very simple.

    KTE's data is also 'paraboloic' to use your word -- (actually not parabolic, that is a function of a quadratic equation), he just chose 1M on short time scales and over a smaller frequecy range in that he was on a 'flatter' area of the curve... if he did 8M and repeated the same data, assuming he makes no mistakes... he would get what you see in the X6800 data above as the 8M stretches out time to see the inverse proportionality.

    Now, how does this relate to K10... well not much, we first have to assume that the one data point is correct (i.e. ~39 second SP1M at 2.0 GHz), what people are arguing is that K10 'turns on' after 2.4-2.6 GHz range such that it scales 'better'.... this is odd way of arguing it, because the digital logic of a CPU is just that, it only knows a clock tick it does not care how long that clock tick is when all the transistors flip on and off to give the computational result for that tick... simply speeding up the ticks does not change the logistical arrangement of bits and the functional blocks that create the logic to actuate those bits.....

    From this data point, again assuming it is true, in the absense of extrenal bottlenecks (such as the memory, if that is even important), Super Pi should scale at best linear to frequency ... so, within a few %+/- due to noise (background processes, etc), SP1M for K10 would scale as such:

    2.0 Ghz == 39 seconds
    2.2 Ghz == 36.4 seconds
    2.4 GHz == 34.2 seconds
    2.6 GHz == 32.3 seconds
    2.8 Ghz == 30.7 seconds
    3.0 Ghz == 29.3 seconds

    But this is gross based on one data point, I personally have a hard time believing K10 will give this kind of super pi performance, it is barely better than a K8....

    Jack
    Last edited by JumpingJack; 09-03-2007 at 05:53 AM.

  14. #439
    Banned
    Join Date
    Jul 2007
    Posts
    264
    A question.
    If you were to slow a processor down, lower than its intended clockspeed, would it be possible that there comes a point at which performance suddenly takes a bigger hit than is expected by the decreased clockspeed?
    Is it possible that there are mechanisms in a processor that only contribute above a minimum clockspeed, and work against performance below a certain clockspeed?

  15. #440
    Xtreme Cruncher
    Join Date
    May 2007
    Posts
    570
    Quote Originally Posted by Lightman View Post


    @leoftw
    How your system is configured memory wise. Do you have DIMMs plugged for both CPU sockets?

    Can you run SuperPi? It is not x64 optimized so scores will be very comparable with your system. Same goes for CPU-Z cache latency test.
    Thanks for your effort!

    EDIT: I just noticed over 4x speedup in muliCPU test! Why is that?? have you done 1-CPU test at lower clocks???


    Hey guys my system is 100% OEM , nothing is overclocked or I have no special settings . Yes I do have dimms plugged in for both CPU sockets , each processor supposedly gets 2gigs from what I've read in the bios .

  16. #441
    Xtreme Guru adamsleath's Avatar
    Join Date
    Nov 2006
    Location
    Brisbane, Australia
    Posts
    3,803
    what people are arguing is that K10 'turns on' after 2.4-2.6 GHz range such that it scales 'better'....
    what cpu has ever done that?
    not that id be complaining if somehow that was built in to the cpu (not likely)

    multithreaded superpi anyone??

    or a chip design that executes many more instructions per cycle or somesuch.
    Last edited by adamsleath; 09-03-2007 at 12:55 AM.
    i7 3610QM 1.2-3.2GHz

  17. #442
    Xtreme Addict
    Join Date
    Jun 2002
    Location
    Ontario, Canada
    Posts
    1,782
    Quote Originally Posted by KTE View Post
    It can get far worse believe me.

    One of the points I wanted to show is how it clearly and perilously casts doubts on this http://img.coolaler.com.tw/images/zm...mlwemyzmzk.jpg compared to this http://img527.imageshack.us/img527/6582/sp47cj7.jpg

    Bro, that was 1x 512MB RAM not even dual channel and an old CD. A P4 will get 10 seconds lower at least, beating the Barcelona quad core time they showed.
    Yes, the K8 1900 was baseline compared to a K10 1900 - as an assumption for better clock per clock performance. It could be far lower, obviously, but this is a vague explanatory comparison of what Gary stated rather than a "prediction".

    What I was saying is, Gary of Anandtech could be basing his statements on similar performance scaling he saw with the K10, as I did with the Celeron chip.
    Yes. It's not impossible is what I'm saying. Look at how the numbers fluctuate with the Celeron and where. Pure technical math cannot account for this, so we won't be able to explain it, but we'll experience it. It's possible that K10 at lower clocks does not scale as well as some higher clocks. I've just shown you in one application how my old Celeron did it, which means it's entirely possible.
    That's definitely interesting. I guess I'm so used to seeing "linear" scaling, that I didn't give it much thought. It will definitely be cool to see otherwise with K10.

    Just a guess here concering SuperPI. Is it possible that we'll see non linear scores due to the L3 cache having decreased latency as the core clock scales higher or is the latency always the same, no matter what the core is clocked at?
    Last edited by freeloader; 09-03-2007 at 02:46 AM.

  18. #443
    Registered User
    Join Date
    Aug 2007
    Posts
    77
    The point Jack is trying to make is that, SuperPI, is LOOPED CODE and the IPC improvements of K10 are the same no matter what frequency the CPU is running at.

    And that's that problem with looped code; it will only give you the performance of a very specific scenario and nothing more.

  19. #444
    Registered User
    Join Date
    Aug 2007
    Posts
    77
    Quote Originally Posted by freeloader View Post
    That's definitely interesting. I guess I'm so used to seeing "linear" scaling, that I didn't give it much thought. It will definitely be cool to see otherwise with K10.

    Just a guess here concering SuperPI. Is it possible that we'll see non linear scores due to the L3 cache having decreased latency as the core clock scales higher or is the latency always the same, no matter what the core is clocked at?
    You're thinking in the time domain again. The number of clock cycles for code to execute will always be the same regardless of how long those clock cycles are.

    So to answer your question, no. It will not affect the scaling.

  20. #445
    Xtreme Mentor
    Join Date
    May 2007
    Posts
    2,792
    Nice analysis going on although it's becoming very offshoot to what I stated and did. Scaling wasn't shown in the true mathematical sense, little time to do it. This was data I already had for some time now, copied it over. All that was meant to be shown is the % change in SPI 1M times as frequency rises above the 1800MHz time, which is projected as 100%, or stock. Call is 0% if you like and anything on-top is the gained percentage.

    K10 comes in where people state affirmatively that something they do not own nor have seen perform from a released product, cannot react in a certain way, for whatever musings. I've seen erratic jumps before in processor performance, and I'm showing one of them right here.

    It''s not to "argue" anything but a viable possibility you cannot on any grounds reject yet. You have no evidence to. Or we'd like to see it.

    Quote Originally Posted by JumpingJack View Post
    He is using to argue non-linear scaling, but he misinterprets the data..

    First, there are two typos, or atleast I think they are typos, the first is at 2205 Mhz, he says 73.455 but I think he meant 93.455, this is in line with what would be expected.
    No typo, I clearly state it's experimental results. I predicted linear scaling, as most people would. But investigation speaks otherwise - something I cannot explain but I experience.
    Conditions are exactly the same for all but >3500 (unstable), in fact I'm sure they were ran one after another on the same day, with minimum services/processes running in the background. One thing I clearly stated I didn't and couldn't do, is keep the memory asynchronous. RAM timings/divider is kept the same. But the rest I'll show it you and I've repeated it time and time again from around 4 months back. All of the data stands final.

    I'll try and find what I still can, one second. Hmm... there's data missing here. I only seem to have a few of the run results saved but have more info in the text file saved instead. Anyway, the interesting ones are all still there, the rest of the missing +100MHz sampling points are progressive and linear as you would expect and I'll try finding them on the other drives. Here ya go.

    1800MHz = http://img212.imageshack.us/img212/5...00spi1mlx8.jpg
    2000MHz = http://img406.imageshack.us/img406/8...00spi1mus3.jpg
    2100MHz = http://img510.imageshack.us/img510/6...00spi1mgc7.jpg
    2200MHz = http://img205.imageshack.us/img205/1...00spi1mwn9.jpg
    2300MHz = http://img250.imageshack.us/img250/8...00spi1mqk4.jpg
    2500MHz = http://img508.imageshack.us/img508/7...00spi1mhm8.jpg
    2600MHz = http://img209.imageshack.us/img209/3...00spi1mlb0.jpg
    2800MHz = http://img510.imageshack.us/img510/2...00spi1map4.jpg
    3000MHz = http://img515.imageshack.us/img515/1...00spi1mod1.jpg
    3200MHz = http://img265.imageshack.us/img265/4...00spi1mnr5.jpg
    3300MHz = http://img250.imageshack.us/img250/8...00spi1mzk4.jpg
    3360MHz (3400 is the same) = http://img210.imageshack.us/img210/4379/spi48qg8.jpg
    3500MHz = http://img250.imageshack.us/img250/2747/sp47dm3.jpg

    3500MHz is quickest = after that, there's hardly any change if the processor runs it (memory bottleneck). If I had similar memory and system now, I'd repeat them now again just to refresh, but I don't and I didn't know this was coming to prepare but just did it for my own personal investigating back then.

    BTW, science doesn't equal "we expect this and this all that can be true." Broken logic is to expect linear scaling and when something other occurs you start the conspiracies. Science broadens your horizons to accept observational finding, like the new colossal area devoid of matter found in space, even devoid of dark energy, which was NEVER predicted nor expected at those sizes and changes acceptance of many beliefs and idea's held by physicists beforehand.

    Scientific experiment = controlled conditions + variable factor + experiment + observation + repetition + results

    I've just given you results of some of my findings, enough for a genuinely interested person to see what's happening for themselves.

    The other is 3301 or 3201, I looks like he ran the same run twice at the same frequency but recorded a different speed. Here is a plot of his SP1M time vs frequency
    Conspiracy theories and conjectures will get one no where. All you have to do is ask for evidence before the insinuating.
    In short, he data shows nothing but a handful of mistakes and that indeed SP1M scales linearly with clock speed.
    That's what you want to believe Jack, not what the evidence shows. I'm sorry but you haven't proved how my finding ties in with your belief.

  21. #446
    Registered User
    Join Date
    Apr 2005
    Posts
    2

    Smile Hi there

    I was just looking at the forums at coolaler and saw this. This is K10 in Pi B0 stepping I think. And from the picture it does Pi in 32 seconds at only 1800Mhz. By the way, I'm new here.

  22. #447
    Registered User
    Join Date
    Apr 2005
    Posts
    2

    sorry , forgot the link


  23. #448
    Xtreme Addict
    Join Date
    Jun 2002
    Location
    Ontario, Canada
    Posts
    1,782
    Quote Originally Posted by CoW]8(0) View Post
    You're thinking in the time domain again. The number of clock cycles for code to execute will always be the same regardless of how long those clock cycles are.

    So to answer your question, no. It will not affect the scaling.
    So basically you're telling me that a 2ghz Barcelona would have the same L3 cache latency (any cache latency for that fact) as a 3ghz Barcelona? Thanks for helping out with this stuff, it's interesting.

  24. #449
    Xtreme Legend
    Join Date
    Sep 2002
    Location
    Finland
    Posts
    1,707
    Quote Originally Posted by gbjorn View Post
    I was just looking at the forums at coolaler and saw this. This is K10 in Pi B0 stepping I think. And from the picture it does Pi in 32 seconds at only 1800Mhz. By the way, I'm new here.
    Quote Originally Posted by gbjorn View Post
    CPU-Z-window has different theme than SuperPi window? Yeah right, nice photoshop.
    Favourite game: 3DMark
    Work: Muropaketti.com - Finnish hardware site
    Views and opinions about IT industry: Twitter: sampsa_kurri

  25. #450
    Banned
    Join Date
    Aug 2007
    Posts
    1,014
    Quote Originally Posted by Sampsa View Post
    CPU-Z-window has different theme than SuperPi window? Yeah right, nice photoshop.
    i thought superpi theme was customizble

Page 18 of 29 FirstFirst ... 81516171819202128 ... LastLast

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •