Research Thoughts

Hello! Here I wanted to store general thoughts and mini explorations about different interpretability research.

Analysis of two layer transformer trained to count unique tokens in a sequence.