AI Dynamics

Global AI News Aggregator

About

OLMoE: Open Sparse Mixture-of-Experts Language Model

5). OLMoE – introduces a fully-open LLM that leverages sparse Mixture-of-Experts. OLMoE is a 7B parameter model and uses 1B active parameters per input token; there is also an instruction-tuned version that claims to outperform Llama-2-13B-Chat and DeepSeekMoE 16B.

→ View original post on X — @dair_ai