TRAPI Tunnel Dashboard HEALTHY

Public: https://tx-trapi.com · Pools: redmond, gcr · Token len: 2198 · Global cap: 150 r/s / 80,000,000 TPM

Live stats (5-min sliding window)

Total
0
Success
429s
0
Now (5s)
0.0 r/s
By pool (5min)

By model (recent traffic)

No traffic in last 5 minutes.

Pool routing & merged caps

⚙ = override applied (see RATE_OVERRIDES in source). Merged r/s = sum across pools; load-balanced 50/50 round-robin when both have it.

DeploymentAvailable poolsRouteMerged r/sMerged TPM
DeepSeek-R1_1redmond, gcrload-balanced79.34,760,000
DeepSeek-V3.2_1redmond, gcrload-balanced100.023,800,000
DeepSeek-V4-Flash_2026-04-23redmond, gcrload-balanced19.81,190,000
DeepSeek-V4-Pro_2026-04-23redmond, gcrload-balanced79.34,760,000
Kimi-K2.5_1redmond, gcrload-balanced89.77,437,500
Kimi-K2.6_2026-04-20redmond, gcrload-balanced79.34,760,000
Llama-3.3-70B-Instruct_5redmond, gcrload-balanced19.81,190,000
MAI-Image-2.5_2026-06-02redmond, gcrload-balanced2.00
Mistral-Large-3_1redmond, gcrload-balanced79.34,760,000
codex-mini_2025-05-16redmond, gcrload-balanced17.91,070,999
computer-use-preview_2025-03-11redmond, gcrload-balanced79.3476,000
gpt-4-32k_0314gcr→ gcr1.05,950
gpt-4.1-mini_2025-04-14redmond, gcrload-balanced14.9892,500
gpt-4.1-nano_2025-04-14redmond, gcrload-balanced14.9892,500
gpt-4.1_2025-04-14redmond, gcrload-balanced100.075,569,760
gpt-4o-mini-tts_2025-03-20redmond→ redmond50.02,142,000
gpt-4o-mini_2024-07-18redmond, gcrload-balanced100.011,900,000
gpt-4o-transcribe-diarize_2025-10-15redmond, gcrload-balanced100.0475,998
gpt-4o_2024-11-20redmond, gcrload-balanced100.053,460,750
gpt-5-chat_2025-08-07redmond, gcrload-balanced17.31,041,250
gpt-5-chat_2025-10-03redmond, gcrload-balanced46.62,796,500
gpt-5-codex_2025-09-15redmond, gcrload-balanced17.91,070,999
gpt-5-mini_2025-08-07redmond, gcrload-balanced17.31,041,250
gpt-5-nano_2025-08-07redmond, gcrload-balanced17.31,041,250
gpt-5-pro_2025-10-06redmond, gcrload-balanced20.02,000,000
gpt-5.1-chat_2025-11-13redmond, gcrload-balanced100.01,190,000
gpt-5.1-codex-max_2025-12-04redmond, gcrload-balanced100.01,190,000
gpt-5.1-codex-mini_2025-11-13redmond, gcrload-balanced19.81,190,000
gpt-5.1-codex_2025-11-13redmond, gcrload-balanced19.81,190,000
gpt-5.2-chat_2025-12-11redmond, gcrload-balanced100.01,190,000
gpt-5.2-codex_2026-01-14redmond, gcrload-balanced100.01,190,000
gpt-5.2_2025-12-11redmond, gcrload-balanced86.3517,649
gpt-5.3-chat_2026-03-03redmond, gcrload-balanced100.08,922,619
gpt-5.3-codex_2026-02-24redmond, gcrload-balanced100.017,850,000
gpt-5.4-mini_2026-03-17redmond, gcrload-balanced100.012,000,000
gpt-5.4-nano_2026-03-17redmond, gcrload-balanced19.81,190,000
gpt-5.4-pro_2026-03-05redmond, gcrload-balanced45.12,707,250
gpt-5.4_2026-03-05redmond, gcrload-balanced51.0600,950
gpt-5.5_2026-04-24redmond, gcrload-balanced20.01,199,999
gpt-5_2025-08-07redmond, gcrload-balanced100.053,431,000
gpt-audio-1.5_2026-02-23redmond, gcrload-balanced100.035,700,000
gpt-audio-mini_2025-10-06redmond, gcrload-balanced9.9297,498
gpt-audio_2025-08-28redmond, gcrload-balanced100.029,788,675
gpt-chat-latest_2026-05-28redmond, gcrload-balanced100.09,520,000
gpt-image-1redmond, gcrload-balanced2.00
gpt-oss-120b_1redmond, gcrload-balanced100.023,800,000
gpt-realtime-1.5_2026-02-23redmond, gcrload-balanced3.4101,149
gpt-realtime-2_2026-05-06redmond, gcrload-balanced4.0118,998
gpt-realtime-mini_2025-10-06redmond, gcrload-balanced4.0118,998
gpt-realtime-translate_2026-05-06redmond, gcrload-balanced4.0118,998
gpt-realtime-whisper_2026-05-06redmond, gcrload-balanced4.0118,998
gpt-realtime_2025-08-28redmond, gcrload-balanced4.0118,998
grok-4-1-fast-non-reasoning_1redmond, gcrload-balanced79.34,760,000
grok-4-1-fast-reasoning_1redmond, gcrload-balanced79.34,760,000
grok-4-20-non-reasoning_1redmond, gcrload-balanced79.34,760,000
grok-4-20-reasoning_1redmond, gcrload-balanced79.34,760,000
grok-4.3_1redmond, gcrload-balanced79.34,760,000
grok-4_1redmond, gcrload-balanced79.34,760,000
model-router_2025-05-19redmond, gcrload-balanced5.0297,498
model-router_2025-08-07redmond, gcrload-balanced100.09,520,000
model-router_2025-11-18redmond, gcrload-balanced4.0238,000
o1_2024-12-17redmond, gcrload-balanced6.92,495,430
o3-mini_2025-01-31redmond, gcrload-balanced8.01,000,000
o3-pro_2025-06-10redmond, gcrload-balanced3.0999,999
o3_2025-04-16redmond, gcrload-balanced20.02,000,000
o4-mini_2025-04-16redmond, gcrload-balanced20.01,499,999
sora-2_2025-10-06redmond, gcrload-balanced3.40
text-embedding-3-large_1gcr→ gcr15.01,000,000
text-embedding-3-small_1gcr→ gcr15.01,000,000
text-embedding-ada-002_2redmond, gcrload-balanced11.9714,000
whisper_001redmond, gcrload-balanced2.00

Available aliases (TRAPI_MODEL_*)

AliasDeployment
gpt41gpt-4.1_2025-04-14
gpt4ogpt-4o_2024-11-20
gpt51gpt-5.1_2025-11-13
gpt52gpt-5.2_2025-12-11
gpt52_chatgpt-5.2-chat_2025-12-11
gpt54gpt-5.4_2026-03-05
gpt55gpt-5.5_2026-04-24
gpt5_chatgpt-5-chat_2025-10-03
gpt5_minigpt-5-mini_2025-08-07
o3o3_2025-04-16
o4_minio4-mini_2025-04-16

AMLT job config

AZURE_OPENAI_ENDPOINT=https://tx-trapi.com
AZURE_OPENAI_API_KEY=<append ?token=YOUR_KEY to URL to reveal>
AZURE_OPENAI_API_VERSION=2025-04-01-preview

Auto-refresh 5s · /health · /pools