Unlike existing techniques – this method requires no additional fine tuning. By setting a few parameters in your HuggingFace config.json, you can instantly extend the performance of ALiBi models such as BTLM-3B-8K by ~2x: https://
huggingface.co/cerebras/btlm-
3b-8k-base#during-inference-without-fine-tuning
…
ALiBi Models Performance Extension Without Fine-Tuning
By
–
