I'm excited about @OpenAI
's repository for frontier tasks that challenge even GPT-4. Yesterday I coded up my own symbolic reasoning eval in the spirit of http://
arxiv.org/abs/2303.03846. Took <2 hours from start to end!
As a bonus, submitters also get GPT-4 access.
OpenAI Frontier Tasks Repository and Custom Symbolic Reasoning Evaluation
By
–
Leave a Reply