AI Dynamics

Global AI News Aggregator

T5 Span Corruption: Data Masking and Autoregressive Decoder Loss

Span corruption in t5 is just a data operation over regular text. Masking stuff in inputs and moving them to targets. It's still fundamentally autoregressive loss on the decoder end.

→ View original post on X — @yitayml,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *