It's a dense model. Apparently, they merged weights from different versions of Gemma 2.
Gemma 2 Model Uses Merged Weights From Different Versions
By
–
Global AI News Aggregator
By
–
It's a dense model. Apparently, they merged weights from different versions of Gemma 2.
Leave a Reply