The most fun, but frustrating to do (due to data challenges) are open-ended agent fine-tunes. Easier ones are more chat/tool-use focused. A really fun one was experimenting with changing a LLM’s personality significantly. Made it “sassy” lol
Fine-tuning Open-ended LLM Agents: Challenges and Personality Experiments
By
–