huggingface transformers remains extremely useful for getting started but has bloated beyond recognition and is no longer helpful for learning someone should make nano-transformers. implement all the models from scratch without all the dependencies. would be amazing
Nano-Transformers: Simplifying Model Implementation Without Dependencies
By
–
