Congratulations to @StanfordLaw liftlab on the release of JudgmentBench, the first publicly available benchmark in a high-judgment domain where both methods for assessing quality are solicited over the same tasks. Snorkel was proud to contribute as research and data partners on
JudgmentBench: First Public Benchmark for AI Quality Assessment Released
By
–
