AI Dynamics

Global AI News Aggregator

About

1h Prompt Cache: Cost Analysis and Usage Patterns

1h prompt cache is nuanced actually. It costs more for cache writes, and less for cache reads. Whether you benefit from cheaper cache reads depends on your usage pattern — context window size, whether the query is the main agent or subagent, etc. We have been testing a

→ View original post on X — @bcherny