i think i see what chew is getting at

if we overclock to the max with 2 CUs and 4 threads, vs 4 CUs and 4 threads, you can OC higher because heat wont be your issue nearly as quickly. we saw that in low threaded apps 5% gains are expected from not sharing CUs (in gaming for example), while he can get more than 5% higher clocks which will offset that value.

so i think the best thing is to just leave all cores there, OC high for full load, but then use turbo to OC lower threads even higher if possible.
so with a normal OCing air cooler, like ~4.7ghz on all CUs with a moderate voltage, but 5.0-5.2ghz for 1-2 CUs
i would like to know if using turbo with overclocking is as easy as it was with thuban, any insight on this guys?