AI Dynamics

Global AI News Aggregator

Clarification on supervised instruction-finetuning versus RLHF methods

Btw. was supervised instruction-finetuning vs RLHF instruction-finetuning what you had in mind, or were you more curious about supervised finetuning as in classification vs supervised instruction-finetuning?

→ View original post on X — @rasbt,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *