Why do you say we haven’t solved “hierarchical structure”? Transformers can learn parse trees & bracketing, each of which is full generalizable to arbitrary hierarchies. I think we totally machine learn hierarchical structures now.
Transformers Can Learn Hierarchical Structures Through Parse Trees
By
–
Leave a Reply