Pricing
This page provides detailed pricing information for Rakuten's models and features. All prices are in USD.
For the most current pricing information, please visit our pricing page.
Model pricing
The following table shows pricing for all Rakuten models.
| Model | Input | Output | Batch Input | Batch Output |
|---|---|---|---|---|
| RakutenAI-3.0 | $0.50/M | $2.00/M | $0.25/M | $1.00/M |
| RakutenAI-2.0 (8x7B) | $0.05/M | $0.20/M | $0.025/M | $0.10/M |
| RakutenAI-7B | $0.03/M | $0.12/M | $0.015/M | $0.06/M |
| RakutenAI-2.0-mini | $0.02/M | $0.08/M | $0.01/M | $0.04/M |
Batch Processing
The Batch API allows asynchronous processing of large volumes of requests with a 50% discount on both input and output tokens.
| Model | Batch Input | Batch Output |
|---|---|---|
| RakutenAI-3.0 | $0.25/M | $1.00/M |
| RakutenAI-2.0 (8x7B) | $0.025/M | $0.10/M |
| RakutenAI-7B | $0.015/M | $0.06/M |
| RakutenAI-2.0-mini | $0.01/M | $0.04/M |
For more information about batch processing, see our batch processing documentation.
Rate Limits
We have implemented limits on how much an organization can use the Rakuten API to mitigate misuse and manage capacity on our API.
- Spend limits set a maximum monthly cost an organization can incur for API usage.
- Rate limits set the maximum number of API requests an organization can make over a defined period of time.
About our limits
Rate limits are a common practice for APIs.
- Designed to protect against API abuse and misuse, while minimizing impact on common customer usage patterns.
- Defined by usage tier, where each tier is associated with a different set of spend and rate limits.
- Your organization's limits can be set in the Limits page in the Playground.
Usage Tiers
Content to be populated subsequently
Rate Limits
Content to be populated subsequently
Message Batches API
Content to be populated subsequently
Setting lower limits for Workspaces
Content to be populated subsequently
Response headers
The API response includes headers that show you the rate limit enforced, current usage, and when the limit will be reset.
Content to be populated subsequently