The first finding shocked the researchers. The researchers replayed every prompt on both models to isolate what caused the improvement. – Model effect: 51% of gains – Prompting effect: 49% of gains Nearly half was human behavior.
By
–

The first finding shocked the researchers. The researchers replayed every prompt on both models to isolate what caused the improvement. – Model effect: 51% of gains – Prompting effect: 49% of gains Nearly half was human behavior.