AI Dynamics

Global AI News Aggregator

Prompt Cache Pricing: Read vs Write Cost Trade-offs

1h prompt cache is nuanced actually. It costs more for cache writes, and less for cache reads. Whether you benefit from cheaper cache reads depends on your usage pattern — context window size, whether the query is the main agent or subagent, etc. We have been testing a

→ View original post on X — @bcherny,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *