We just added thinking block preservation in the Claude API. You can now control how thinking blocks are managed in your context window, resulting in more cache hits and lower costs.
Claude API adds thinking block preservation for better caching
By
–
Leave a Reply