We introduce CURIE, a scientific long-Context Understanding, Reasoning and Information Extraction benchmark to measure the potential of large language models in scientific problem-solving and assisting scientists in realistic workflows. Learn more at https://
goo.gle/4jah5Ds
CURIE: Scientific Long-Context Understanding Benchmark for LLMs
By
–
