Billing & Tokens
INFER uses a prepaid token-based billing system. Buyers add funds to their account and pay per token of inference used. Operators earn a share of each request processed.
How Billing Works
For Buyers
- Add funds — Deposit USDC or pay with credit card to add balance to your account
- Use the API — Each inference request deducts tokens from your balance based on usage
- Monitor usage — View detailed usage logs in the dashboard with per-request breakdowns
For Operators
- Set pricing — Configure your price per 1K tokens in the operator dashboard
- Serve requests — Process inference requests routed to your node
- Earn tokens — Receive 80% of the request cost as earnings
- Withdraw — Transfer earnings to your wallet or bank account
Pricing Model
Pricing is based on tokens processed — both input (prompt) tokens and output (completion) tokens.
Default Platform Pricing
| Model Tier | Input (per 1M tokens) | Output (per 1M tokens) |
|---|---|---|
| Small (7B) | $0.10 | $0.15 |
| Medium (13–34B) | $0.30 | $0.50 |
| Large (70B+) | $0.80 | $1.20 |
| Premium (405B) | $2.00 | $3.00 |
Operators can set custom pricing that overrides the platform defaults.
Revenue Split
| Party | Share | Description |
|---|---|---|
| Node Operator | 80% | Compensation for GPU compute |
| INFER Platform | 20% | Routing, billing, marketplace, infrastructure |
Token Counting
Tokens are counted using the model’s native tokenizer. As a rough approximation:
- 1 token ≈ 4 characters in English
- 1 token ≈ ¾ of a word
- 100 tokens ≈ 75 words
Both input and output tokens are metered. The total cost for a request is:
cost = (input_tokens × input_price) + (output_tokens × output_price)Balance Management
Adding Funds
Funds can be added through:
- Crypto — Send USDC to your INFER deposit address (Ethereum, Base, or Solana)
- Credit Card — Pay with Visa/Mastercard (processed via Stripe)
Minimum deposit: $5.00. No maximum.
Low Balance Warnings
The platform sends notifications when your balance drops below:
- $5.00 — Email warning
- $1.00 — Email + desktop app notification
- $0.00 — API requests return
402 Payment Required
Earnings Withdrawal
Operators can withdraw earnings when:
- Minimum balance of $10.00 in earnings
- Account is in good standing (no violations)
- KYC verified (for withdrawals over $500)
Usage Logs
Every API request is logged with:
- Timestamp
- Model used
- Input and output token counts
- Cost breakdown
- Node that served the request
- Response latency
- Status (success/error)
View usage logs at Dashboard → Usage Logs or via the API at GET /api/usage.
Billing FAQ
Q: What happens if my balance runs out mid-request?
The request will complete normally. Your balance may go slightly negative (up to the cost of one response). Add funds to continue making requests.
Q: Are there volume discounts?
Not currently. All users pay the same per-token rate. Enterprise pricing with volume commitments is planned for Q3 2026.
Q: How are streaming requests billed?
Streaming requests are billed identically to non-streaming requests — based on total input + output tokens. The streaming format doesn’t affect pricing.