References: – Finding Blind Spots in Evaluator LLMs with Interpretable Checklists https://
arxiv.org/abs/2406.13439
v1
…
– Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment https://
arxiv.org/abs/2402.14016
– On the Limitations of Fine-tuned Judge Models
LLM Evaluators: Blind Spots and Adversarial Vulnerabilities Exposed
By
–