If you have some (big) GPUs laying around there’re a lot of cool variant to train on top of Falcon btw. A longer context would be awesome and of course instruction finetuned (TII original IFT one is already #1 on the open leaderboard). Excited to see what you’ll share!
Training Falcon Model Variants with Extended Context Windows
By
–
Leave a Reply