We compare SN-13B-8K-Instruct with MPT, XGen and LLAMA2 and find that this model achieves better scores on long sequence suite derived from Scrolls and validation set of ZeroScrolls, a benchmark developed by @TelAvivUni and @MetaAI
. (3/10)
SN-13B-8K-Instruct Outperforms MPT, XGen, LLAMA2 on Long Sequences
By
–
Leave a Reply