Choosing a model
Selecting the optimal Rakuten model for your application involves balancing three key criteria:
- Capabilities: What specific features or capabilities will you need the model to have in order to meet your needs?
- Speed: How quickly does the model need to respond in your application?
- Cost: What's your budget for both development and production usage?
This guide helps you make an informed decision based on your specific requirements.
Model Selection Matrix
| What you need | Suggested model | Example use cases |
|---|---|---|
| High performance with efficient resource use across diverse tasks | RakutenAI-2.0-MoE | Large-scale multitask applications (e.g. complex content generation) |
| Natural, engaging conversations | RakutenAI-7B-chat | Conversational AI (e.g. chatbots, virtual assistants) |
| Balanced performance and resource usage for precise instruction-following tasks | RakutenAI-7B-instruct | General-purpose tasks (e.g. translation, summarization) |
| Fast response with lower costs, real-time applications | RakutenAI-2.0-mini-1.5B | Edge device deployment |