I just changed the file to write 0x00A00020 instead of 0x00200020 for the 2nd register. Performance decreased slightly, about in par with Fix 1 (without writing anything to the 2nd register).
So you should maybe try 0x00200020, it seems faster for me. No stability issues so far, been running benches and crunching for a while now.
Bookmarks