Span corruption in t5 is just a data operation over regular text. Masking stuff in inputs and moving them to targets. It's still fundamentally autoregressive loss on the decoder end.
T5 Span Corruption: Data Masking and Autoregressive Decoder Loss
By
–
Leave a Reply