The Problem
Outliers stretch the quantization range → slots wasted on tails → normal values lose precision
The Rotation Fix
WHT rotation spreads outlier energy → range collapses → all coordinates follow N(0, 1/d)
Optimal Slots
Known distribution → Lloyd-Max finds optimal centroids → more slots near zero → PolarQuant
The QJL Lesson
Residual correction reduces bias but increases variance → softmax amplifies variance → more centroids wins
Asymmetric Insight
K errors → softmax → exponential damage. V errors → linear scaling. Keep K precise, compress V freely.
Stack Everything
Boundary V + Sparse V + Block 128 → orthogonal optimizations → 3.8–6.4× compression + 22.8% faster