2022:
– oh nooo!!! you can't run language models on cpu! you need an expensive nvidia GPU and special CUDA kernels and–
– *one bulgarian alpha chad sits down and writes some c++ code to run LLMs on cpu* – code works fine (don't need a GPU), becomes llama.cpp 2023:
– oh noo!!
Bulgarian Developer Makes LLMs Run on CPU with llama.cpp
By
–