In addition to the standard 5-minute prompt caching TTL, we now offer an extended 1-hour TTL. This reduces costs by up to 90% and reduces latency by up to 85% for long prompts, making extended agent workflows more practical.
Extended Prompt Caching TTL Reduces Costs and Latency
By
–
Leave a Reply