Everything filed under notes



Understanding the Parameter Decomposition papers

Understanding attribution-based and stochastic parameter decomposition methods

What's different about a Matryoshka SAE?

Brief notes from the Matryoshka SAEs paper.

10 Autoencoders in a Trenchcoat, part 1

Notes on the core sections of Anthropic's Toy Models of Superposition.

Notes on "A Mathematical Framework for Transformer Circuits"

Close-reading a classic interpretability paper and trying to make sense of it

(Roughly) 200 Words on Sartre

On Existentialism is a Humanism; selections from Being and Nothingness

Four Papers: Align and Translate, Seq2Seq, Pointer Networks, and Attention

My notes from the third meeting of the 90/30 Club, a paper-reading club (open to everyone!) in SF.

Building a transformer

Trying to figure out how to set up a transformer in pytorch, before I knew any ML or pytorch.

Information theory and statistics

Notes on the basics of information theory

Algorithmic information theory and friends

Incomplete notes from when I was younger on algorithmic information theory.

Formal models of decision-making

Incomplete notes on decision theory.

Notes on Sorting Algorithms

A started-but-not-nearly-finished set of notes on sorting algorithms.

Notes on set theory and the foundations of math

Basic notes from getting comfortable with thinking about sets.