is it 32 clusters of 16 shaders or 16 clusters or 32 shaders?

Which is it?

And when did benchmarks become wildly theoretical?