Meta presents Layer Skip – up-to 200% fast inference > Applies layer dropout: low rates for early layers, high rates for later layers
> Uses early exit loss with shared exit for all transformer layers Inference: > Increases early exit accuracy without auxiliary layers
>
Meta Layer Skip Enables 200% Faster Transformer Inference
By
–
Leave a Reply