When NOT to use ensembling: Simple factual lookups ("What's the capital of France?") Time-sensitive tasks (response speed matters) Low-stakes decisions (what to eat for lunch) When you're paying per API call and budget is tight Cost: 5x the tokens. Use