I've been experimenting with that recently. It seems pretty similar. You can do both BTW – LORA for initial layers and full f/t for final
LoRA vs Full Fine-tuning: Combining Techniques for Model Optimization
By
–
By
–
I've been experimenting with that recently. It seems pretty similar. You can do both BTW – LORA for initial layers and full f/t for final