Very exciting article about the new MathFrontier benchmark, which they say LLMs are currently still failing at. The tasks are so difficult, so specialized, that perhaps PhDs from the special field can solve a task, but certainly not in general. It will be very exciting to see
MathFrontier Benchmark: New Challenge for LLM Capabilities
By
–
Leave a Reply