Chatbot
Compare simple chatbot cost with short input, short output, and many repeated messages each month.
What matters most
For chatbots, low input price and low output price matter more than very large context. Fast, cheap turns usually win.
Base example
This page starts with 500 input tokens, 300 output tokens, 100 messages per user, and 1,000 users per month.
Low-cost chatbot models in the base example
These rows are server-rendered, so search engines can read them before any script runs.
| Model | Input | Output | Monthly cost |
|---|---|---|---|
| titan-embed-text-v2 | $0.0200 / 1M tokens | N/A | $1.00 |
| Qwen2.5-Coder-3B-Instruct | $0.0100 / 1M tokens | $0.0300 / 1M tokens | $1.40 |
| Qwen2.5-Coder-7B-Instruct | $0.0100 / 1M tokens | $0.0300 / 1M tokens | $1.40 |
| Qwen2.5-Coder-7B | $0.0100 / 1M tokens | $0.0300 / 1M tokens | $1.40 |
| llama3.2-11b-vision-instruct | $0.0150 / 1M tokens | $0.0250 / 1M tokens | $1.50 |
| llama3.2-3b-instruct | $0.0150 / 1M tokens | $0.0250 / 1M tokens | $1.50 |
| Llama-3.2-3B-Instruct | $0.0200 / 1M tokens | $0.0200 / 1M tokens | $1.60 |
| paddleocr-vl | $0.0200 / 1M tokens | $0.0200 / 1M tokens | $1.60 |
Compare these models
Open the compare tool with the lowest-cost chatbot candidates preselected.