Thanks, glad it landed. For TTFT, the first thing I check is prompt length and whether the prefix is stable enough to hit prompt caching. Most wins come from trimming the system prompt and keeping volatile content at the end, not from model-level changes.