EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements Paper: https://
pub.sakana.ai/edinet-bench/ We released a Japanese financial benchmark on @Huggingface
, designed to evaluate the performance of LLMs on financial tasks like fraud detection in Japan.
EDINET-Bench: Evaluating LLMs on Japanese Financial Tasks
By
–
Leave a Reply