A reference guide for choosing and configuring the right pricing structure for your AI product. Each model has different trade-offs for you and your customers.
Charge a fixed rate per token consumed. Simple to understand and easy to implement.
When to use: Developer API products where customers want to pay exactly for what they use.
Meter setup:
prompt_tokens and completion_tokensSUM for bothmodel_id property if you have multiple modelsPricing plan setup:
block_size: 1000)Example rates (fictional):
Trade-offs:
Customers purchase a block of tokens or a dollar amount upfront. Usage draws down the balance. This model is common for developer platforms and self-serve products.
When to use: Products where customers want spending predictability and control.
Solvimon setup:
Variant — dollar credits:
AMOUNT type meter value instead of NUMBERTrade-offs:
A flat monthly fee per user seat that includes a token budget. Usage above the included allowance is billed at an overage rate.
When to use: B2B team products where buyers prefer predictable pricing.
Solvimon setup:
PER_SEAT) for the base chargemonthly_token_budget set as an entitlement per plan tierExample plan:
Trade-offs:
Same subscription, but different per-token rates depending on which model the customer uses. Implemented using pricing rules that evaluate the model_id meter property.
When to use: Platforms that expose multiple LLMs and want to reflect the cost difference to customers.
Solvimon setup:
completion_tokens meter with a required model_id propertymodel_id = gpt-4o → $0.015/1k tokensmodel_id = gpt-4o-mini → $0.0006/1k tokensTrade-offs:
Charge per completed task rather than per token. The customer pays for a “translation”, a “code review”, a “document summary” — not for the underlying tokens consumed.
When to use: Vertical AI agents where customers think in terms of tasks, not tokens. Particularly effective when you can control and optimize the model usage on the backend.
Solvimon setup:
tasks_completedNUMBER, calculation: SUMtask_type property if you have multiple task types at different pricesUSAGE_BASED, flat rate per taskExample:
Implement using a task_type property on the meter and a pricing rule per task type.
Trade-offs:
Real products often combine these. A common pattern:
Solvimon supports multiple product items per pricing plan, so you can combine all of these on a single subscription.