Next: @willkoh_kc from @tryramp tested GPT-5.5 inside their own harness.
— Romain Huet (@romainhuet) 5 mai 2026
“It was discovering ways to use the tools that we had given it… and figuring out novel ways to solve problems.”
He also tested it on financial-doc evals: GPT-5.5 hit their highest perfect extraction rate. pic.twitter.com/4C1N8QIrsc
Next: @willkoh_kc from @tryramp tested GPT-5.5 inside their own harness. “It was discovering ways to use the tools that we had given it… and figuring out novel ways to solve problems.” He also tested it on financial-doc evals: GPT-5.5 hit their highest perfect extraction rate.
Leave a Reply