AI Dynamics

Global AI News Aggregator

LaMDA reinforcement learning approach differs from OpenAI RLHF

From what I gather, LaMDA is not per se using RLHF with capital letters as OpenAI has tersely discussed, but using some (other) sort of reinforcement learning with human feedback of their own that has not been disclosed.

→ View original post on X — @garymarcus,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *