That's forcing developers to get creative, including shifting their focus to fine-tuning models—and moving more work over to CPUs. As one source told me about increasing CPU usage: "would you prefer slow inference, or no inference?"
Developers Shift to CPU-Based Inference and Model Fine-Tuning
By
–