Verified inference means cryptographic proof that the exact open model you requested produced your output, not a cheaper or quantized stand-in. Run frontier open models like GLM-5.2, billed by the token.
1. Claude Code
In ~/.claude/settings.json, then run claude:
{
"env": {
"ANTHROPIC_BASE_URL": "https://api.engy.ai",
"ANTHROPIC_AUTH_TOKEN": "$ENGY_API_KEY",
"ANTHROPIC_MODEL": "glm-5.2",
"ANTHROPIC_DEFAULT_HAIKU_MODEL": "glm-5.2"
}
}
No /v1 on the URL; keep the haiku model set.
2. OpenAI API
curl https://api.engy.ai/v1/chat/completions \
-H "Authorization: Bearer $ENGY_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model":"glm-5.2","messages":[{"role":"user","content":"hello"}]}'
from openai import OpenAI
client = OpenAI(base_url="https://api.engy.ai/v1", api_key="$ENGY_API_KEY")
client.chat.completions.create(model="glm-5.2",
messages=[{"role":"user","content":"hello"}])
3. Cursor
Cursor Settings → Models → API Keys, under OpenAI API Key:
OpenAI API Key $ENGY_API_KEY Override OpenAI Base URL https://api.engy.ai/v1 ← enable the toggle, then Verify
Under Models, + Add model → glm-5.2, enable it, and select it in
chat. Custom endpoints drive the chat/plan panel; Tab and Composer stay on Cursor's own models.
Requests are relayed by Cursor's servers, so it works from any machine.
4. Codex
In ~/.codex/config.toml, then export ENGY_API_KEY=… and run codex:
model = "glm-5.2" model_provider = "engy" [model_providers.engy] name = "engy" base_url = "https://api.engy.ai/v1" env_key = "ENGY_API_KEY" wire_api = "responses"
5. Hermes
In ~/.hermes/config.yaml, then run hermes chat:
model: provider: "custom" default: "glm-5.2" base_url: "https://api.engy.ai/v1" api_key: "$ENGY_API_KEY"