Belief manifolds, and how to steer along them
May 24, 2026A reproduction of Sarfati et al.’s “The Shape of Beliefs”
How LLMs encode in-context beliefs as curved manifolds, and how manifold-aware steering changes them with fewer side effects than linear steering.
BlueDot Technical AI Safety Project