AI Dynamics

Global AI News Aggregator

Answer-Based Evals Limitations and Conversation-Based Comparisons

Agreed, I don't like this behavior either. Imo, this is a limitation of our answer-based evals, where answers with more information are preferred over shorter ones. Conversation-based comparisons might prevent this because you'd judge the entire experience.

→ View original post on X — @maximelabonne,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *