I got GPT-5 Pro to research situational awareness by @leopoldasch and assess whether his claims are True, Tracking True, Tracking False, False or Not Enough Info. So far it is doing pretty well 8 out of 13 True or Tracking True, only 1 tracking false. Including the link below
GPT-5 Pro Evaluates Situational Awareness Claims Accuracy
By
–
Leave a Reply