I’d love to understand this better too… I thought it was just a quirk of the specifics of labeling instructions, but then multiple (what I think should be mostly independent) language models seem to all do this.
Understanding Consistent Behavior Across Independent Language Models
By
–
Leave a Reply