Microsoft API Pricing
Microsoft's Phi family of small language models (SLMs) is designed for edge and on-device inference. Phi-4 delivers frontier-class reasoning within a 14B parameter footprint, optimized for STEM and coding tasks.
Models
3 tracked
All tiers, latest pricing.
All Microsoft Models
| Model | Tier | Input / 1M | Output / 1M | Context |
|---|---|---|---|---|
| Phi-4 | fast | $0.065 | $0.140 | 16K |
| Phi 4 Mini Instruct | fast | $0.080 | $0.350 | 131K |
| WizardLM-2 8x22B | balanced | $0.620 | $0.620 | 66K |
Model Details
Phi-4
$0.065 inPhi-4 is Microsoft's small language model — a 14B parameter model that punches above its weight class on reasoning and math benchmarks. Designed for on-device and edge inference where quality-per-parameter matters.
Phi 4 Mini Instruct
$0.080 inPhi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4...
WizardLM-2 8x22B
$0.620 inWizardLM-2 8x22B is Microsoft AI's most advanced Wizard model. It demonstrates highly competitive performance compared to leading proprietary models, and it consistently outperforms all existing state-of-the-art opensource models.
Calculate Microsoft API Costs
Use the TokenRate calculator to estimate exactly what Microsoft models will cost for your workload.
Open Calculator →Other Providers