@snorkelai - AI Dynamics

Data Cleaning Infrastructure for AI Builders Discussed in Interview

By

–

06 May 2026 20h20

Our congrats to @harvey and the team including @nikogrupen
, @gabepereyra
, and @ItsJulioPereyra for open-sourcing Harvey's long horizon legal agent benchmark, which is specifically built to evaluate and improve agent capabilities for supporting legal work. Read their blog:

→ View original post on X — @snorkelai,

6 May 2026

AI Industry Insights: The Central Role of Data

By

@snorkelai

–

04 May 2026 20h29

Continual Learning Bench 1.0 is out, co-authored by @UCBerkeley & @SnorkelAI
, funded through Open Benchmarks Grants. A new benchmark for measuring whether AI systems actually improve over time.

→ View original post on X — @snorkelai,

4 May 2026

Olmix Framework for Data Mixing Throughout LM Development

By

@snorkelai

–

01 May 2026 0h18

Our thanks to everyone who came out to hear @MayeeChen dive into her paper "Olmix: A Framework for Data Mixing Throughout LM Development." Replay ICYMI live: https://
youtube.com/watch?v=sFIPst
z0cTc
…

→ View original post on X — @snorkelai,

1 May 2026

Creating AI Benchmark from Real-World Engineering Problems

By

@snorkelai

–

29 April 2026 0h00

“We can create the best benchmark out there that should dictate model development for the next year,” says @alexgshaw of @LaudeInstitute. The goal: get top engineers to contribute the hardest real-world problems they’ve actually faced in their work.

They’re actively looking for… pic.twitter.com/kIF0PukCAT
— Snorkel AI (@SnorkelAI) 28 avril 2026

“We can create the best benchmark out there that should dictate model development for the next year,” says @alexgshaw of @LaudeInstitute
. The goal: get top engineers to contribute the hardest real-world problems they’ve actually faced in their work. They’re actively looking for

→ View original post on X — @snorkelai,

29 April 2026

BigLaw Bench: Evaluating AI Legal Research Capabilities

By

@snorkelai

–

28 April 2026 22h20

We recently partnered with @harvey on BigLaw Bench: Research, evaluating how well models handle real end-to-end case law research, identifying the right authorities, applying legal reasoning, and delivering answers you can actually use. It also shows where things still break and

→ View original post on X — @snorkelai,

28 April 2026

ICLR Brazil 2026 Wrap-up: Research Collaborations and Perspectives

By

@snorkelai

–

27 April 2026 22h00

That’s a wrap on @iclr_conf Brazil 2026 Great connecting with so many of you at the DFAI happy hour, conference, and workshops! Thanks to the many workshop folks and researchers we collaborated and presented with, and others who shared their work and perspectives on frontier

→ View original post on X — @snorkelai,

27 April 2026

Open Benchmark Grants Initiative Funds Frontier AI Research

By

@snorkelai

–

24 April 2026 21h05

We're proud to have supported this work through Open Benchmark Grants, our initiative funding the next wave of frontier benchmarks!

→ View original post on X — @snorkelai,

24 April 2026

Geospatial reasoning and AI future interview with Rezaur Rahman

By

@snorkelai

–

23 April 2026 22h27

Our thanks to Rezaur Rahman for stepping away from a busy NEXT to join Chris Sniffen for a great conversation on geospatial reasoning and the future of AI. Stay tuned for full interview!

→ View original post on X — @snorkelai,

23 April 2026

RLVR Effectiveness in Low Data Compute Regimes Study

By

@snorkelai

–

23 April 2026 18h25

Our MLSys 2026 paper is live on arXiv: “Learning from Less: Measuring the Effectiveness of RLVR in Low Data and Compute Regimes.” @realjustinbauer @Walshe_tech @pham_derek @harit_v @ArminPCM @fredsala and @paroma_varma present a comprehensive empirical study of open-source SLMs

→ View original post on X — @snorkelai,

23 April 2026

Snorkel Workshops on Data Quality and Agents at ICLR

By

@snorkelai

–

20 April 2026 22h29

Big week ahead at @iclr_conf Brazil🇧🇷

Don’t miss where Snorkel will be from workshops with @fredsala and @vincentsunnchen on data quality and agents to a @DFAI_Community Gathering on Saturday.

Full lineup below👇 https://t.co/ISe7mjN7kO
— Snorkel AI (@SnorkelAI) 20 avril 2026

Big week ahead at @iclr_conf Brazil Don’t miss where Snorkel will be from workshops with @fredsala and @vincentsunnchen on data quality and agents to a @DFAI_Community Gathering on Saturday. Full lineup below

→ View original post on X — @snorkelai,

20 April 2026