Yes exactly that – although it's particularly used by Kagglers in competitions, since many of them *require* using Kaggle's infra. In a recent competition one person got 70B inference running in 14GB RAM! :O
70B LLM Inference Optimization in Kaggle Competitions
By
–