Coding agent

Compare cost for longer prompts, more context, and repeated turns. Coding agents often need both context size and strong output.

What matters most

Coding agents need room for repo context, instructions, and multi-step replies. Context size and output price both matter.

This page starts with 2,000 prompt tokens, 5,000 context tokens, 1,500 output tokens, 3 iterations, and 500 tasks per month.

Open the compare tool with the lowest-cost context-ready candidates preselected.

Model	Context window	Input	Output	Monthly cost
Qwen2.5-Coder-7B	32.8K	$0.0100 / 1M tokens	$0.0300 / 1M tokens	$0.17
llama3.2-11b-vision-instruct	131.1K	$0.0150 / 1M tokens	$0.0250 / 1M tokens	$0.21
llama3.2-3b-instruct	131.1K	$0.0150 / 1M tokens	$0.0250 / 1M tokens	$0.21
Llama-3.2-3B-Instruct	131.1K	$0.0200 / 1M tokens	$0.0200 / 1M tokens	$0.26
paddleocr-vl	16.4K	$0.0200 / 1M tokens	$0.0200 / 1M tokens	$0.26
Meta-Llama-3.1-8B-Instruct-Turbo	131.1K	$0.0200 / 1M tokens	$0.0300 / 1M tokens	$0.28
Mistral-Nemo-Instruct-2407	131.1K	$0.0200 / 1M tokens	$0.0400 / 1M tokens	$0.30
gpt-oss-20b	32.8K	$0.0145 / 1M tokens	$0.0700 / 1M tokens	$0.31

Model

Prompt tokens

Context tokens

Output tokens

Iterations / task

Tasks / month

Pricing data from catalog last generated Jul 13, 2026. Verify before production decisions. Data sources.