optimizing token usage in long-context models