that’s one particular benchmark that came up; not a universal claim by me (or anyone else AFAIK). compare “Gary mentioned a particular benchmark” that didn’t include reasoning models (true) with “no studies have indicated hallucinations within reasoning models” (false). you