The field of mechanistic interpretability has matured rapidly over the past two years, transitioning from an academic curiosity to a critical component of AI safety research. As large language models …
AI Safety
1 article tagged “AI Safety”
1 article tagged “AI Safety”
The field of mechanistic interpretability has matured rapidly over the past two years, transitioning from an academic curiosity to a critical component of AI safety research. As large language models …