Implementation and analysis of Sparse Autoencoders for neural network interpretability research. Features interactive visualization dashboard and W&B integration.
sparse-autoencoders interpretability activation-functions neuron-activity wandb transformerlens mech-interp
-
Updated
Feb 27, 2025 - Python