AI Dynamics

Global AI News Aggregator

About

Understanding Mixture of Experts Architecture and FFN Weights

Based on how MoEs work, I believe this is possible. Each expert is like a FFN with it's own weights.

→ View original post on X — @akshay_pachaar