ongoing projects Exploring claims about stable regions in activation space notes Why don’t SAEs solve superposition Ways to Extract Directions in Latent Space dump Improving Model Latent Visualization