We just open-sourced FinQA — an #RL environment for financial reasoning agents. Real SEC 10-K data, multi-step reasoning + tool use, constrained SQL, binary rewards. The whole 9 yards! The kicker: a 4B model fine-tuned with FinQA outperformed a 235B model from the same family on finance reasoning: 58x smaller!
→ View original post on X — @snorkelai, 2026-03-31 00:19 UTC
Leave a Reply