Everything filed under ml



What's different about a Matryoshka SAE?

Brief notes from the Matryoshka SAEs paper.

10 Autoencoders in a Trenchcoat, part 1

Notes on the core sections of Anthropic's Toy Models of Superposition.

Notes on "A Mathematical Framework for Transformer Circuits"

Close-reading a classic interpretability paper and trying to make sense of it

Four Papers: Align and Translate, Seq2Seq, Pointer Networks, and Attention

My notes from the third meeting of the 90/30 Club, a paper-reading club (open to everyone!) in SF.