AI Dynamics

Global AI News Aggregator

About

Flan-T5-XXL Performance Compared to Decoder-Only Models

people also forget that the speed of flan-t5-xxl is also equivalent to a ~5B+ decoder-only model because it's an encoder-decoder model.

→ View original post on X — @yitayml