There are so few benchmarks the AI companies compete on outside of software and general knowledge benchmarks. They also fine tune obsessively to optimize software. Language, turn-taking, logical reasoning, lack of hallucinations, and other critical issues get less clear focus.
AI Companies Neglect Language and Reasoning Benchmarks for Software Optimization
By
–
Leave a Reply