Activation Patching

2 articles tagged “Activation Patching”

Local MI Lab: How Controls Broke My First Induction Story

By GTCode.com • Jun 26, 2026

I built Local MI Lab because SELF-GROUND had become too heavy for the amount of mechanistic interpretability practice I actually had. The negation SAE work taught me a lot, but it also wrapped every …

My First Real Mechanistic Interpretability Attempt: SELF-GROUND, Controls, and the Discipline of Negative Results

By GTCode.com • Jun 26, 2026

I did not get the result I wanted from my first serious mechanistic interpretability attempt. Probably the best thing that could have happened. I started with an attractive idea: take sparse …