14.4 · ID103 · Platform Admin · AI Inference Layer
5
Available Models
3
PTU Deployments
1
Critical PTU Alerts
$2,520
Month-to-Date Spend
gpt-4o
Context Window
128K tokens
Input Cost
$0.005/1K
Output Cost
$0.015/1K
Latency P50
1200ms
Capabilities
gpt-4o-mini
Context Window
128K tokens
Input Cost
$0.00015/1K
Output Cost
$0.0006/1K
Latency P50
450ms
Capabilities
claude-3-5-sonnet-20241022
Context Window
200K tokens
Input Cost
$0.003/1K
Output Cost
$0.015/1K
Latency P50
900ms
Capabilities
gemini-1.5-pro
Context Window
1000K tokens
Input Cost
$0.00125/1K
Output Cost
$0.005/1K
Latency P50
1100ms
Capabilities
llama-3.1-sonar-large-128k-online
Context Window
127K tokens
Input Cost
$0.001/1K
Output Cost
$0.001/1K
Latency P50
2000ms
Capabilities