Arcee AI API Pricing

Arcee AI builds small, efficient enterprise models — the Virtuoso, Coder, and Trinity families — using model-merging and distillation. They target strong quality-per-dollar for coding, reasoning, and on-prem deployment.

Official site: www.arcee.ai →

Cheapest

Trinity Mini

$0.045/1M in

Flagship

Coder Large

$0.500/1M in

Models

4 tracked

All tiers, latest pricing.

All Arcee AI Models

Model	Tier	Input / 1M	Output / 1M	Context
Trinity Mini	fast	$0.045	$0.150	131K
Trinity Large Thinking	reasoning	$0.250	$0.800	262K
Coder Large	flagship	$0.500	$0.800	33K
Virtuoso Large	flagship	$0.750	$1.20	131K

Model Details

Trinity Mini

$0.045 in

Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featuring 128 experts with 8 active per token.

Trinity Large Thinking

$0.250 in

Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI. It shows strong performance in PinchBench, agentic workloads, and reasoning tasks.

Coder Large

$0.500 in

Coder‑Large is a 32 B‑parameter offspring of Qwen 2.5‑Instruct that has been further trained on permissively‑licensed GitHub, CodeSearchNet and synthetic bug‑fix corpora.

Virtuoso Large

$0.750 in

Virtuoso‑Large is Arcee's top‑tier general‑purpose LLM at 72 B parameters, tuned to tackle cross‑domain reasoning, creative writing and enterprise QA.

Calculate Arcee AI API Costs

Use the TokenRate calculator to estimate exactly what Arcee AI models will cost for your workload.

Open Calculator →

Other Providers