Feels to me like a bit of a reputation risk to @lmsysorg though if this is indeed a stealth model launch They're supposed to be a neutral benchmarking tool, it's not a great look if they're working behind-the-scenes with model vendors in an opaque manner like this
LMSYS Reputation Risk: Opaque Model Launch Concerns
By
–
Leave a Reply