AI Dynamics

Global AI News Aggregator

Pointwise vs Cumulative Metrics in Language Model Evaluation

Good catch! The numbers in the old table were pointwise estimates – pointwise performance is a bucketed estimate over context lengths, while the paper reports a cumulative average over context lengths. Pointwise and cumulative metrics are naturally incomparable and the pointwise

→ View original post on X — @oriolvinyalsml,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *