AI Dynamics

Global AI News Aggregator

About

70B LLM Inference Optimization in Kaggle Competitions

Yes exactly that – although it's particularly used by Kagglers in competitions, since many of them *require* using Kaggle's infra. In a recent competition one person got 70B inference running in 14GB RAM! :O

→ View original post on X — @jeremyphoward