this is what saves model costs in a big way. dynamic prefix cache thrashing by relocating volatile dynamic context blocks to make cache invariants.
Dynamic prefix cache saves model costs
Dynamic prefix cache thrashing saves model costs significantly.