Arcee AI API Pricing
Arcee AI builds small, efficient enterprise models — the Virtuoso, Coder, and Trinity families — using model-merging and distillation. They target strong quality-per-dollar for coding, reasoning, and on-prem deployment.
Models
4 tracked
All tiers, latest pricing.
All Arcee AI Models
| Model | Tier | Input / 1M | Output / 1M | Context |
|---|---|---|---|---|
| Trinity Mini | fast | $0.045 | $0.150 | 131K |
| Trinity Large Thinking | reasoning | $0.250 | $0.800 | 262K |
| Coder Large | flagship | $0.500 | $0.800 | 33K |
| Virtuoso Large | flagship | $0.750 | $1.20 | 131K |
Model Details
Trinity Mini
$0.045 inTrinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featuring 128 experts with 8 active per token.
Trinity Large Thinking
$0.250 inTrinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI. It shows strong performance in PinchBench, agentic workloads, and reasoning tasks.
Coder Large
$0.500 inCoder‑Large is a 32 B‑parameter offspring of Qwen 2.5‑Instruct that has been further trained on permissively‑licensed GitHub, CodeSearchNet and synthetic bug‑fix corpora.
Virtuoso Large
$0.750 inVirtuoso‑Large is Arcee's top‑tier general‑purpose LLM at 72 B parameters, tuned to tackle cross‑domain reasoning, creative writing and enterprise QA.
Calculate Arcee AI API Costs
Use the TokenRate calculator to estimate exactly what Arcee AI models will cost for your workload.
Open Calculator →Other Providers