AI Dynamics

Global AI News Aggregator

About

Using Ollama for Testing and vLLM for Production Serving

I use ollama most of the times for testing and prototyping and then vLLM when i need to serve it.

→ View original post on X — @akshay_pachaar,