1Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet (opens in new tab)(transformer-circuits.pub)1smaddox2y ago1Save
2The Matrix: A Bayesian learning model for LLMs (opens in new tab)(arxiv.org)arXiv3smaddox2y ago0Save