I want to see key GPT-4 papers re-tested with Gemini 1.5 and Claude 3 to see what generalizes across GPT-4 class LLMs. At a minimum, the papers on hallucination rates, Theory of Mind & Chain of Thought; as well as papers on performance on medical, legal & psychological questions
Testing GPT-4 Papers Against Gemini 1.5 and Claude 3
By
–
Leave a Reply