Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla paper page: https://
huggingface.co/papers/2307.09
458
… Circuit analysis is a promising technique for understanding the internal mechanisms of language models. However, existing analyses are done
Circuit Analysis Interpretability Scaling in Chinchilla Language Models
By
–
