Please ignore the deluge of complete nonsense about Q*.
One of the main challenges to improve LLM reliability is to replace Auto-Regressive token prediction with planning. Pretty much every top lab (FAIR, DeepMind, OpenAI etc) is working on that and some have already published
LLM Reliability: Replacing Auto-Regressive Prediction with Planning
By
–
Leave a Reply