The architecture takes the best from transformer-based models and RNNs to provide faster inference and theoretical infinite context length
New Architecture Combines Transformers and RNNs for Faster Inference
By
–

By
–

The architecture takes the best from transformer-based models and RNNs to provide faster inference and theoretical infinite context length