Very interesting. I gotta say, I'm learning a LOT from this thread, although I also honestly, still don't understand alot of it as well only because I'm not a programmer. BUT...I THINK that I sorta get the jist of it though (and pity that you're so far otherwise I think that I can really learn programming from you), since you're one of the few people that I've EVER met who can explain it and explain it well!
Here's something interesting that I've noticed -- on the 48-core system that I've got at work - it's four sockets. I THINK that I have it set up so that there are four NUMA nodes.
And when I run y-cruncher on it, it doesn't really seem to quite take full advantage of the hardware and that's evident in the CPU utilization reported by Windows Task Manager and also by the program itself.
Is that (^ the above aforementioned ^) why it's like that?
I'm only used to commercially available software that's MPI and/or OpenMP and maybe it's cuz it's commercial, so it has very high CPU utilization. ?




Reply With Quote

Bookmarks