Not exactly a benchmark, but I always tell people the best way to test a model is to have a conversation with it about something you know a ton about. Gardening, running, music, tv, whatever it is. You get a feel fast for where its strengths/weaknesses are, then go from there
Testing AI Models Through Conversation: A Practical Evaluation Approach
By
–
Leave a Reply