Research Interests
Hi! I'm Logan. I wanted to compile a document describing my current interests, in case it increased the likelihood of serendipitous coincidences — someone seeing this site, saying "oh, I'm working on that!" or "oh, I want to do a project like that!" and reaching out. As described before, a website is a long and complex search query (etc).
My main interest right now is mechanistic interpretability. I think mechanistic interpretability is both technically fascinating and philosophically, or something like "for the purposes of general world-modeling, " interesting. As of Spring 2025 I'm working on understanding learning differences in ViTs and ResNets using SAEs (it's for a vision models class, lol). Mechanistic interpretability is my intellectual priority right now, and I want this page to first of all reflect that; this, or adjacent things, is what I mainly want to work on. It's also something I actually have real technical skills in, which makes working on projects in it easier.
Note (May 20): I'm looking work in interpretability, or adjacent fields! Please reach out (me at logan graves dot com
) if you (or others you know) are hiring. :)
With that said, here are other research topics I'd totally still be interested in:
- Mathematical models of agent coordination
- Abstract models (e.g. game theory)
- In the wild models (e.g. mechanism design)
- Qualitative models (e.g. history)
- Computational neuroscience, particularly computational neuroscience projects that draw from effective methodologies from AI (e.g. mechanistic interpretability)
- Neuron modeling
- Modeling neural circuitry (this one especially)
- Intellectual history
- 20th century intellectual history; how ideology shaped response to crises
- Longer-term undercurrents in human thought, e.g. comparative classics research
- Theology in/and mathematics
- Cyborgism-y things, e.g. what Neuralink is doing, as well as Chalmers and Clark's "Extended mind"