Can your AI assistant actually build an interactive game or science tool from scratch? Researchers from Ant Group, Shanghai Jiao Tong University, and Carnegie Mellon University introduce MiniAppBench — the first benchmark to test if LLMs can generate dynamic, interactive HTML
MiniAppBench: Benchmark for LLMs Building Interactive HTML Apps
By
–
