See live pricing
Up-to-date per-model rates live on deepshi.ai. They’re the source of truth and can change over time.
How a request is priced
Pricing is per model, in USD per 1M tokens, with separate rates for input (prompt) and output (completion) tokens:usage.prompt_tokens_details.cached_tokens).
You never have to guess the cost
Every successful response reports exactly what it cost inusage.cost.total_cost (USD), drawn from your balance:
"stream_options": {"include_usage": true} to get a final chunk with the same usage (including cost). Standard OpenAI SDKs ignore the extra cost field, so it doesn’t break compatibility.
Credits & billing
How your balance, top-ups, and running out of credits work.
Text models
Context windows and capabilities for every model.