We kept MRCR in the system card for scientific honesty, but we've actually been phasing it out slowly. Two reasons: (1) it's built around stacking distractors to trick the model, which isn't how people actually use long context, and (2) we care more about applied long-context
MRCR Phased Out: Focus Shifts to Applied Long-Context
By
–
Leave a Reply