Quote Originally Posted by savantu View Post
You've shown the worst example for HT. Linpack uses Intel libraries which are assembler optimized, you cannot get any higher efficiency than Linpack on CPUs; obviously HT has nothing to help here.
Well, technically, he showed you what you asked for, and then you dismissed it.

If you are arguing that with perfectely optimized code, there will be no use for HT, then I agree with you. HT is designed to take advantage of gaps in the pipeline.

What happens when code gets more optimized, does HT become less relevant?

As code becomes more optimized, more physical cores will give you better throughput. As code becomes more optimized, HT would, theoretically, give you less throughput (or, more likely the same, but the main thread would be better and the HT thread would be worse, probably zero-sum game.)