Coding agent
Compare cost for longer prompts, more context, and repeated turns. Coding agents often need both context size and strong output.
What matters most
Coding agents need room for repo context, instructions, and multi-step replies. Context size and output price both matter.
Base example
This page starts with 2,000 prompt tokens, 5,000 context tokens, 1,500 output tokens, 3 iterations, and 500 tasks per month.
Compare these coding-agent models
Open the compare tool with the lowest-cost context-ready candidates preselected.
Low-cost coding-agent models in the base example
| Model | Context window | Input | Output | Monthly cost |
|---|---|---|---|---|
| Qwen2.5-Coder-7B | 32.8K | $0.0100 / 1M tokens | $0.0300 / 1M tokens | $0.17 |
| llama3.2-11b-vision-instruct | 131.1K | $0.0150 / 1M tokens | $0.0250 / 1M tokens | $0.21 |
| llama3.2-3b-instruct | 131.1K | $0.0150 / 1M tokens | $0.0250 / 1M tokens | $0.21 |
| Llama-3.2-3B-Instruct | 131.1K | $0.0200 / 1M tokens | $0.0200 / 1M tokens | $0.26 |
| paddleocr-vl | 16.4K | $0.0200 / 1M tokens | $0.0200 / 1M tokens | $0.26 |
| Meta-Llama-3.1-8B-Instruct-Turbo | 131.1K | $0.0200 / 1M tokens | $0.0300 / 1M tokens | $0.28 |
| Mistral-Nemo-Instruct-2407 | 131.1K | $0.0200 / 1M tokens | $0.0400 / 1M tokens | $0.30 |
| llama-3.1-8b-instruct | 16.4K | $0.0200 / 1M tokens | $0.0500 / 1M tokens | $0.32 |