A bump after a long time...

Here's a screenshot from a binary tuned for AMD Bulldozer using FMA4 and XOP instructions.
AMD FX-8350 @ 4.0 GHz (stock) with 16 GB @ 1333 MHz:



It's not close to done yet as the large size algorithms still need to be re-tuned.
I plan to release this binary in v0.6.4. But it may come earlier (in v0.6.3) if the stuff that's supposed to be in v0.6.3 drags on too long.

If all goes well (which it never does), v0.6.3 is ETA: late December. v0.6.4 in January.

In the meantime, if you have a Bulldozer machine, I highly recommend running the "x64 SSE3 ~ Kasumi" binary instead of what the program auto-selects (which is "x64 AVX ~ Hina"). I've found Bulldozer's 256-bit AVX performance to be pretty crappy.
The author of Prime95 explains in this link: http://www.mersenneforum.org/showthread.php?t=17618
And as such, the FMA4/XOP binary will use 128-bit AVX, FMA4, and XOP instructions.

If the FMA4/XOP binary doesn't make it into v0.6.3, I'll have the version-selector choose "x64 SSE3 ~ Kasumi" instead "x64 AVX ~ Hina" for AMD Bulldozer line processors.

Other news: I burned my Sandy Bridge machine last week.
A careless short-circuit took out the motherboard and possibly the CPU as well. So I will no longer be able to do performance tuning for the "x64 AVX ~ Hina" binary. The binary will remain (for a while), but all the tuning parameters can no longer be updated.