We could not just understand an AI's "thoughts," but actually steer and improve them! A massive collaborative team from HKU, Fudan, LMU Munich, Microsoft, Tencent & more presents a new playbook. They reframe AI interpretability as an actionable pipeline: Locate the key
Steering AI Thoughts: New Interpretability Pipeline Framework
By
–
Leave a Reply