AI Dynamics

Global AI News Aggregator

13B Parameter Long Sequence Model Achieves Competitive Accuracy

Using this recipe, we are able to train a competitive long sequence model at 13B parameter scale. We achieve 2-12 points better accuracy across a wide variety of long sequence tasks from Scrolls and ZeroScrolls. (7/10)

→ View original post on X — @sambanovaai,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *