L3 latency is usually slightly lower on Bloomfield than L2 latency on Yorkfield. Also Bloomfields L3 is a bit smaller than Yorkfields L2. I guess Nehalems cache system will start to shine in 3+ threaded apps, where shared L3 will definitely outperform two L2s connected by FSB and with doubled data.