RLHF Annotation Bias and Model Vocabulary Development

AI Dynamics

Global AI News Aggregator

RLHF Annotation Bias and Model Vocabulary Development

–

19 April 2024 0h32

I don't know that RLHF would bias that kind of thing – my mental model is that annotators are shown two answers to the same prompt and asked which is "best", so if none of the test prompts happened to touch on the concept of a roadside kiosk that vocabulary wouldn't be affected

→ View original post on X — @simonw,

19 April 2024

AI Dynamics

RLHF Annotation Bias and Model Vocabulary Development

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

AI Generates Perfect Jokes Using Image Generation Skills

Codex App Transformation: Atlas Integration Reshapes User Experience

AI File Access Limitations: Screenshot vs Disk Storage Issues

Synthetic Aperture Radar: Satellite Tech for Global Monitoring