What we added — and why it matters
The point isn't size for its own sake. The frontier labs — Anthropic, OpenAI, Google — get the headlines, but a large and growing share of real production value sits with open-weight and specialist models that cost a fraction as much. You can't pick the cheapest model that clears your quality bar if it was never on the table. This update puts eighteen more makers on the table. Browse them all in the providers directory, or read on for what each one brings.
Every provider named below links to its own TokenRate page, where you'll find the full model lineup, current input and output prices per million tokens, context windows, and tiers. The official site for each is linked in the Primary sources block at the foot of this article.
The open-weight wave from China
MiniMax, out of Shanghai, pairs very long context windows — north of a million tokens — with aggressive pricing and solid agentic performance. Tencent brings the Hunyuan family, including Mixture-of-Experts variants that power its own products and are available open-weight for general chat, reasoning, and multilingual work. And Baidu adds ERNIE 4.5, China's longest-running large-model line, now with big MoE and vision-language variants. If you've wondered whether the open Chinese models really are as cheap as people say, you can now check the live numbers directly instead of taking anyone's word for it.
Search, agents, and computer use
Nous Research rounds out the group with Hermes — steerable, instruction-tuned fine-tunes of Llama known for strong function-calling and minimal refusals, a long-time favourite of the open-source agent community. If you're building anything that plans, calls tools, or browses, these three are worth pricing against the usual flagships before you default to a frontier model for every step.
Enterprise, RAG, and writing specialists
Writer contributes the Palmyra family, tuned for business writing and domain-specific knowledge work, with Palmyra X5 offering a 1M-token window for whole-document tasks. Arcee AI builds small, efficient models — the Virtuoso, Coder, and Trinity families — using model-merging and distillation for strong quality-per-dollar on coding and on-prem deployment. And South Korea's Upstage adds the Solar family: compact models that punch above their parameter count via depth-up-scaling, with Solar Pro strong on document understanding and multilingual text.
Hardware, multimodal, and the fully-open lab
Allen Institute for AI adds OLMo, notable for being fully open: weights, training data, and code are released together, which makes it a reference point for reproducible, transparent research. Finally, Inflection AI brings the Inflection-3 models behind Pi — tuned for empathetic, safe, conversational assistants. Different shapes, different goals, all now priced side by side with everything else in the catalogue.
How to compare all 29 providers
The bet behind all of this hasn't changed: the right model is rarely the most expensive one — it's the cheapest one that clears your quality bar. Eighteen new providers means eighteen more chances that the cheapest model good enough for your job is one you simply hadn't priced yet. Open the full providers directory to explore every lineup, or run your numbers through the calculator to see what a switch could save.