Rakuten AI

Documentation

Pricing

This page provides detailed pricing information for Rakuten's models and features. All prices are in USD.

For the most current pricing information, please visit our pricing page.

Model pricing

The following table shows pricing for all Rakuten models.

ModelInputOutputBatch InputBatch Output
RakutenAI-3.0$0.50/M$2.00/M$0.25/M$1.00/M
RakutenAI-2.0 (8x7B)$0.05/M$0.20/M$0.025/M$0.10/M
RakutenAI-7B$0.03/M$0.12/M$0.015/M$0.06/M
RakutenAI-2.0-mini$0.02/M$0.08/M$0.01/M$0.04/M

Batch Processing

The Batch API allows asynchronous processing of large volumes of requests with a 50% discount on both input and output tokens.

ModelBatch InputBatch Output
RakutenAI-3.0$0.25/M$1.00/M
RakutenAI-2.0 (8x7B)$0.025/M$0.10/M
RakutenAI-7B$0.015/M$0.06/M
RakutenAI-2.0-mini$0.01/M$0.04/M

For more information about batch processing, see our batch processing documentation.

Rate Limits

We have implemented limits on how much an organization can use the Rakuten API to mitigate misuse and manage capacity on our API.

  • Spend limits set a maximum monthly cost an organization can incur for API usage.
  • Rate limits set the maximum number of API requests an organization can make over a defined period of time.

About our limits

Rate limits are a common practice for APIs.

  • Designed to protect against API abuse and misuse, while minimizing impact on common customer usage patterns.
  • Defined by usage tier, where each tier is associated with a different set of spend and rate limits.
  • Your organization's limits can be set in the Limits page in the Playground.

Usage Tiers

Content to be populated subsequently

Rate Limits

Content to be populated subsequently

Message Batches API

Content to be populated subsequently

Setting lower limits for Workspaces

Content to be populated subsequently

Response headers

The API response includes headers that show you the rate limit enforced, current usage, and when the limit will be reset.

Content to be populated subsequently

Built with v0