AI Dynamics

Global AI News Aggregator

Detecting Additional Forces Beyond Token Prediction in Fine-tuned LLMs

Many LLMs have already been RLHFed and finetuned into activities other than "predict the next token a human would write". This being the case, how would you tell if the output was being driven by some extra force above that and all the finetuning?

→ View original post on X — @esyudkowsky,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *