“(1D) Ordered Tokens Enable Efficient Test-Time Search” Most AR image models generate 2D grid tokens patch-by-patch, so partial generations are hard to verify. This paper shows that 1D coarse-to-fine tokens make search much easier, with early tokens encode global semantics,
1D Ordered Tokens Improve AR Image Model Test-Time Search
By
–
Leave a Reply