Detecting Additional Forces Beyond Token Prediction in Fine-tuned LLMs

AI Dynamics

Global AI News Aggregator

Detecting Additional Forces Beyond Token Prediction in Fine-tuned LLMs

–

23 September 2023 12h06

Many LLMs have already been RLHFed and finetuned into activities other than "predict the next token a human would write". This being the case, how would you tell if the output was being driven by some extra force above that and all the finetuning?

→ View original post on X — @esyudkowsky,

23 September 2023

AI ETHICS GENERATIVE AI LLMS RESEARCH SAFETY

AI Dynamics

Detecting Additional Forces Beyond Token Prediction in Fine-tuned LLMs

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

AI Generates Perfect Jokes Using Image Generation Skills

Codex App Transformation: Atlas Integration Reshapes User Experience

AI File Access Limitations: Screenshot vs Disk Storage Issues

Synthetic Aperture Radar: Satellite Tech for Global Monitoring