AI Dynamics

Global AI News Aggregator

SN-13B-8K-Instruct Outperforms MPT, XGen, LLAMA2 on Long Sequences

We compare SN-13B-8K-Instruct with MPT, XGen and LLAMA2 and find that this model achieves better scores on long sequence suite derived from Scrolls and validation set of ZeroScrolls, a benchmark developed by @TelAvivUni and @MetaAI
. (3/10)

→ View original post on X — @sambanovaai,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *