AI Dynamics

Global AI News Aggregator

About

HealthBench: New AI Evaluation Benchmark for Healthcare Settings

Evaluations are essential to understanding how models perform in health settings. HealthBench is a new evaluation benchmark, developed with input from 250+ physicians from around the world, now available in our GitHub repository.

→ View original post on X — @openai