If AZ cannot rely on a pre-trained LM to communicate, it is impossible (there is no reason the model learns with self play to communicate with valid sentences). Other issue will of course be the infinite action space (which we also had in theorem proving) but this is manageable
Pre-trained Language Models Essential for Agent Communication
By
–
Leave a Reply