is tehre differences between the SSE implementations between K10 and Kentsfield?

I just remembered that a few minutes before reading it being said here. It can be a possibility. That and intel compiler making huge use of caches by default.