[LG] Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders
[Yale University & Shanghai Jiao Tong University]
https://arxiv.org/abs/2506.14002