LLMWate API Documentation

Introduction

LLMWate provides a unified API gateway for accessing multiple AI models — including GPT-4o, Claude 3.5 Sonnet, Gemini, DeepSeek, Qwen, Meta Llama 3 and more — through a single, OpenAI-compatible interface.

One API key. 45+ models across 10 providers. One unified endpoint.

Base URL

https://api.llmwate.com/v1

Quick Start

Get up and running in 3 steps:

1
Get Your API KeySign up and create an API key from your dashboard.
2
Choose a ModelBrowse models at AI Playground or via GET /v1/models.
3
Make Your First RequestSend a chat completions request with your API key.

cURL

curl https://api.llmwate.com/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-4o", "messages": [{"role": "user", "content": "Hello!"}]}'

Authentication

All API requests require authentication via Bearer token:

Request Header

Authorization: Bearer YOUR_API_KEY

Manage your API keys on the API Keys Management page.

Chat Completions

Send a conversation and receive an AI-generated response. Fully compatible with the OpenAI Chat Completions API format.

POST/v1/chat/completions

Request Body

Parameter	Type	Required	Description
model	string	Yes	Model ID (e.g., gpt-4o, claude-3.5-sonnet, deepseek-chat)
messages	array	Yes	Array of message objects with role and content
temperature	float	No	Sampling temperature (0-2), default 1.0
max_tokens	integer	No	Maximum tokens to generate
stream	boolean	No	Enable server-sent events streaming, default false

cURL

curl https://api.llmwate.com/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-4o", "messages": [{"role": "user", "content": "What is 2+2?"}], "temperature": 0.7, "max_tokens": 256}'

Python

from openai import OpenAI
client = OpenAI(api_key="YOUR_API_KEY", base_url="https://api.llmwate.com/v1")
response = client.chat.completions.create(model="gpt-4o", messages=[{"role": "user", "content": "What is 2+2?"}])
print(response.choices[0].message.content)

Node.js

import OpenAI from "openai";
const client = new OpenAI({api_key:"YOUR_API_KEY", baseURL:"https://api.llmwate.com/v1"});
const resp = await client.chat.completions.create({model:"gpt-4o",messages:[{"role":"user","content":"What is 2+2?"}]});
console.log(resp.choices[0].message.content);

Response

{"id":"chatcmpl-abc123","object":"chat.completion","created":1715623456,"model":"gpt-4o","choices":[{"index":0,"message":{"role":"assistant","content":"2+2 equals 4."},"finish_reason":"stop"}],"usage":{"prompt_tokens":12,"completion_tokens":8,"total_tokens":20}}

List Available Models

Get all available AI models. Supports optional category filtering.

GET/v1/models

Parameter	Type	Required	Description
category	string	No	Filter by category: general, coding, reasoning, vision, fast, cheap, chinese

cURL - All Models

curl https://api.llmwate.com/v1/models -H "Authorization: Bearer YOUR_API_KEY"

cURL - Coding Models

curl "https://api.llmwate.com/v1/models?category=coding" -H "Authorization: Bearer YOUR_API_KEY"

Response (partial)

{"models":[{"id":"gpt-4o","name":"GPT-4o","provider":"OpenAI","category":"general","context_length":128000,"pricing":{"prompt":0.0025,"completion":0.01}},{"id":"claude-3.5-sonnet","name":"Claude 3.5 Sonnet","provider":"Anthropic","category":"coding","context_length":200000,"pricing":{"prompt":0.003,"completion":0.015}}],"total":45}

General Coding Reasoning Vision Fast Cheap Chinese

45 models across 10 providers. Browse in the AI Playground.

Auto Router

Automatically select the best model for your task type. Returns recommended models ordered by preference.

GET/v1/models/auto?task=<type>

Parameter	Type	Required	Description
task	string	Yes	Task type: general, coding, reasoning, vision, fast, cheap, chinese

cURL

curl "https://api.llmwate.com/v1/models/auto?task=coding" -H "Authorization: Bearer YOUR_API_KEY"

Response

{"task":"coding","primary":{"id":"claude-3.5-sonnet","name":"Claude 3.5 Sonnet","provider":"Anthropic","reason":"Best overall coding performance"},"alternatives":[{"id":"gpt-4o","name":"GPT-4o","provider":"OpenAI"},{"id":"deepseek-chat","name":"DeepSeek Chat","provider":"DeepSeek"}]}

Task	Best For	Recommended
coding	Code generation, debugging, review	Claude 3.5 Sonnet, GPT-4o
reasoning	Complex reasoning, analysis, math	Claude 3.5 Sonnet, DeepSeek R1
vision	Image understanding, OCR	GPT-4o, Claude 3.5 Sonnet
fast	Quick responses, low latency	GPT-4o-mini, Claude 3 Haiku
cheap	Cost-effective inference	DeepSeek Chat, Qwen Turbo
chinese	Chinese language tasks	Qwen 2.5, DeepSeek Chat
general	General conversation	GPT-4o, Claude 3.5 Sonnet

Account Balance

Check your current account balance, usage, and quota.

GET/v1/balance

cURL

curl https://api.llmwate.com/v1/balance -H "Authorization: Bearer YOUR_API_KEY"

Response

{"balance":87.50,"plan":"enterprise","used_this_month":1250000,"quota_this_month":6000000,"quota_reset_at":"2026-06-01T00:00:00Z"}

Chat Status and Provider Health

Check which provider APIs are configured and their current status.

GET/v1/chat/status

cURL

curl https://api.llmwate.com/v1/chat/status -H "Authorization: Bearer YOUR_API_KEY"

Response

{"providers":{"openai":{"status":"configured"},"anthropic":{"status":"unconfigured"},"siliconflow":{"status":"configured"},"google":{"status":"unconfigured"},"deepseek":{"status":"unconfigured"},"qwen":{"status":"unconfigured"},"meta":{"status":"unconfigured"},"mistral":{"status":"unconfigured"},"cohere":{"status":"unconfigured"},"xai":{"status":"unconfigured"},"perplexity":{"status":"unconfigured"}}}

Configure additional provider API keys to enable more models.

Error Codes

Code	Meaning	Resolution
401	Invalid or missing API key	Check your API key in the dashboard
403	Model not available for your plan	Upgrade your subscription plan
422	Invalid request parameters	Check request body format and types
429	Rate limit exceeded	Wait and retry, or upgrade your plan
500	Internal server error	Retry or contact support
503	Service temporarily unavailable	Check provider status and retry

SDK & Client Libraries

Official client libraries. One API key, 420+ models from 20+ providers.

🐍 Python

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.llmwate.com/v1"
)

response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

pip install openai

📜 JavaScript

const client = new OpenAI({
  apiKey: process.env.LLMWATE_API_KEY,
  baseURL: "https://api.llmwate.com/v1"
});

const response = await client.chat.completions.create({
  model: "gpt-4o",
  messages: [{ role: "user", content: "Hello!" }]
});

npm install openai

⬡ cURL

curl https://api.llmwate.com/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

No installation required

◈ Go

client := openai.NewClient("YOUR_API_KEY")
client.BaseURL = "https://api.llmwate.com/v1/"

resp, _ := client.CreateChatCompletion(ctx,
    openai.ChatCompletionRequest{
        Model: "gpt-4o",
        Messages: []openai.ChatCompletionMessage{
            {Role: "user", Content: "Hello!"},
        },
    },
)

go get github.com/sashabaranov/go-openai

Base URL: https://api.llmwate.com/v1

API Marketplace

Browse all available models with real-time pricing. Click any model to open it in the Playground.

45

Models

10

Providers

7