Overview
Tembo Proxy is an AI gateway that provides access to tested and verified models from Anthropic, OpenAI, and Google. You connect your local coding tools — Claude Code, Codex, OpenCode, Cursor, and others — to Tembo’s infrastructure through a single API key, with no need to manage separate provider accounts. Tembo Proxy is available on all Tembo subscription plans, including the free tier. It works the same way as configuring any other model provider — set a base URL and API key, and your tool connects through Tembo.Background
There are dozens of AI models available, but not all of them work well for coding tasks. Getting reliable results from coding agents requires models that handle long context, follow complex instructions, and produce consistent output. Tembo Proxy solves the multi-provider problem by acting as a single gateway. Instead of juggling API keys and accounts across Anthropic, OpenAI, and Google, you authenticate once with your Tembo API key and get access to all supported models. Tembo handles provider authentication, routing, and billing on your behalf.How it Works
- Get your API key from the Tembo dashboard under Settings → API Keys
- Set environment variables or config files to point your coding tool at
proxy.tembo.io - Start coding — requests route through Tembo to the correct provider and stream back to your tool
Endpoints
Models
The following models are available through Tembo Proxy. Use the endpoint and configuration that matches your coding tool.| Model | Model ID | Endpoint | Tools |
|---|---|---|---|
| Claude Opus 4.5 | claude-opus-4-5 | proxy.tembo.io/anthropic | Claude Code, OpenCode, Cursor, Amp |
| Claude 4.5 Sonnet | claude-4-5-sonnet | proxy.tembo.io/anthropic | Claude Code, OpenCode, Cursor, Amp |
| Claude 4.1 Opus | claude-4.1-opus | proxy.tembo.io/anthropic | Claude Code, OpenCode, Cursor, Amp |
| Claude 4.5 Haiku | claude-4-5-haiku | proxy.tembo.io/anthropic | Claude Code, OpenCode, Cursor, Amp |
| Claude 4 Sonnet | claude-4-sonnet | proxy.tembo.io/anthropic | Claude Code, OpenCode, Cursor |
| Claude 3.5 Sonnet | claude-3-5-sonnet | proxy.tembo.io/anthropic | Claude Code, OpenCode |
| Claude 3.5 Haiku | claude-3-5-haiku | proxy.tembo.io/anthropic | Claude Code, OpenCode |
| GPT-5.2 | gpt-5.2 | proxy.tembo.io/openai | Codex, OpenCode, Cursor |
| GPT-5.1 | gpt-5.1 | proxy.tembo.io/openai | Cursor |
| GPT-5.1 Codex | gpt-5.1-codex | proxy.tembo.io/proxy/openai | Codex, OpenCode, Cursor |
| GPT-5.1 Codex Max | gpt-5.1-codex-max | proxy.tembo.io/proxy/openai | Codex, OpenCode |
| GPT-5.1 Codex Mini | gpt-5.1-codex-mini | proxy.tembo.io/proxy/openai | Codex, OpenCode |
| GPT-5.1 Codex High | gpt-5.1-codex-high | proxy.tembo.io/proxy/openai | Codex, Cursor |
| GPT-5.2 Codex | gpt-5.2-codex | proxy.tembo.io/proxy/openai | Codex |
| GPT-5 | gpt-5 | proxy.tembo.io/openai | OpenCode |
| Gemini 2.5 Pro | gemini-2.5-pro | proxy.tembo.io/anthropic | Claude Code, OpenCode |
| Gemini 3 Pro | gemini-3-pro | proxy.tembo.io/anthropic | Cursor |
| Gemini 3 Flash | gemini-3-flash | proxy.tembo.io/anthropic | Cursor |
| Kimi K2 (Bedrock) | bedrock-kimi-k2-thinking | proxy.tembo.io/anthropic | Claude Code, OpenCode |
| ZAI GLM-4.7 | zai-glm-4.7 | proxy.tembo.io/openai | OpenCode |
| Grok | grok | Cursor API | Cursor |
| Composer 1 | composer-1 | Cursor API | Cursor |
agent:model or agent:model:reasoningLevel for GPT-5 variants. For example: codex:gpt-5.2:high.
For full agent configuration details, see Coding Agents.
Tool Configuration
opencode.json:
Codex may store credentials in
~/.codex/auth.json that override environment variables. Remove or update that file if your environment variables aren’t taking effect.Pricing
Proxy usage is billed through Tembo’s credit system. Every request consumes credits based on token usage — more complex tasks use more tokens and therefore more credits.| Plan | Monthly Cost | Credits Included | Overage |
|---|---|---|---|
| Free | $0 | 5 credits/day | None (paused until refresh) |
| Pro | $60 | 100 credits/month | Pay-as-you-go |
| Max | $200 | 400 credits/month | Pay-as-you-go |
Auto-reload
Paid plans support auto-reload to prevent interruptions. When your credit balance drops below a threshold, Tembo automatically purchases additional credits. Configure the reload threshold, target balance, and monthly limit from Billing settings.Monthly Limits
You can set a maximum overage limit on paid plans to control spending. The auto-reload system respects this limit — even if your balance drops below the threshold, reloads won’t exceed your monthly cap. For billing questions, contact support@tembo.io. See Billing for full details.Privacy
All Tembo infrastructure is hosted in the United States, with no infrastructure in China. Tembo is SOC 2 Type 1 certified and in SOC 2 Type 2 observation. Tembo maintains zero data retention agreements with providers where available. Current retention policies by provider:| Provider | Data Retention |
|---|---|
| Anthropic | Zero retention — code and prompts are not stored or used for training |
| Zero retention | |
| OpenAI | 30-day retention (OpenAI policy) |
For Teams
Tembo Proxy works with team accounts. Administrators manage access, monitor usage, and control spending from the dashboard.Roles
- Admins can create and revoke API keys, invite team members, set spending limits, and view usage across the team
- Members receive their own API key and can use the proxy within the limits set by their admin
Model Access
Per-model access controls for team administrators are coming soon. Today, all team members have access to all models available on your plan.
Bring Your Own Key
For OpenAI models, team members supply their own OpenAI API key alongside the Tembo API key. Anthropic and Google models are accessed directly through Tembo — no additional keys needed.Bring-your-own-key support for additional providers is planned. Contact support@tembo.io for details.
Goals
- Simplify multi-provider access — One API key, one billing account, all major model providers
- Verify model quality — Only expose models that have been tested and verified for coding agent use cases
- Enable team collaboration — Shared billing, usage monitoring, and access controls for engineering teams
- Stay provider-agnostic — Support the best models regardless of provider, so you can switch without reconfiguring infrastructure