Don’t all the providers do this, though? Anthropic/Claude has different pricing based on if you’re caching for five mins vs one hour (which are the only two options for cache TTL). https://platform.claude.com/docs/en/about-claude/pricing
Copilot was uniquely awful at this, because up to until literally days before the switch to usage based billing there was no way for people to track token usage, despite repeated calls from the community.
Microsoft only added a billing “projection” feature on the admin page that was meant to download a spreadsheet (which straight up didn’t work for most people) less than a week before the new billing structure.
Don’t all the providers do this, though? Anthropic/Claude has different pricing based on if you’re caching for five mins vs one hour (which are the only two options for cache TTL). https://platform.claude.com/docs/en/about-claude/pricing
Copilot was uniquely awful at this, because up to until literally days before the switch to usage based billing there was no way for people to track token usage, despite repeated calls from the community.
Microsoft only added a billing “projection” feature on the admin page that was meant to download a spreadsheet (which straight up didn’t work for most people) less than a week before the new billing structure.
Well, the point here is the deception. So, if you can find a similar link from the past from Microsoft…