AI Dynamics

Global AI News Aggregator

About

SN-13B-8K-Instruct Outperforms MPT, XGen, LLAMA2 on Long Sequences

We compare SN-13B-8K-Instruct with MPT, XGen and LLAMA2 and find that this model achieves better scores on long sequence suite derived from Scrolls and validation set of ZeroScrolls, a benchmark developed by @TelAvivUni and @MetaAI
. (3/10)

→ View original post on X — @sambanovaai,