Learn how to configure and use Fireworks AI’s lightning-fast inference platform with Cline. Experience up to 4x faster inference speeds with optimized models and competitive pricing.
Parameter Count | Price per 1M Input Tokens |
---|---|
Less than 4B parameters | $0.10 |
4B - 16B parameters | $0.20 |
More than 16B parameters | $0.90 |
MoE 0B - 56B parameters | $0.50 |
Base Model Size | Price per 1M Training Tokens |
---|---|
Up to 16B parameters | $0.50 |
16.1B - 80B parameters | $3.00 |
DeepSeek R1 / V3 | $10.00 |
GPU Type | Price per Hour |
---|---|
A100 80GB | $2.90 |
H100 80GB | $5.80 |
H200 141GB | $6.99 |
B200 180GB | $11.99 |
AMD MI300X | $4.99 |
<think>
tag processing and reasoning content extraction, making complex multi-step reasoning practical for real-time applications.