Skip to main content

Chat Completions API

POST /v1/chat/completions is the most commonly used endpoint for multi-turn conversations with language models.

Request​

POST https://real200.com/v1/chat/completions

Headers​

HeaderTypeRequiredDescription
AuthorizationstringYesBearer sk-real200-xxx
Content-TypestringYesapplication/json

Body Parameters​

ParameterTypeRequiredDefaultDescription
modelstringYes-Model name, e.g., gpt-4o, claude-sonnet-4
messagesarrayYes-Conversation message list
temperaturenumberNo1.0Randomness, range 0–2
top_pnumberNo1.0Nucleus sampling threshold
max_tokensintegerNoModel defaultMaximum generation tokens
streambooleanNofalseEnable streaming output

Example Request​

curl https://real200.com/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $REAL200_API_KEY" \
-d '{
"model": "gpt-4o",
"messages": [
{"role": "user", "content": "Introduce yourself in three sentences."}
],
"temperature": 0.7,
"max_tokens": 200
}'

Response​

Non-Streaming Response​

{
"id": "chatcmpl-abc123",
"object": "chat.completion",
"created": 1716300000,
"model": "gpt-4o",
"choices": [
{
"index": 0,
"message": {
"role": "assistant",
"content": "I am an AI assistant..."
},
"finish_reason": "stop"
}
],
"usage": {
"prompt_tokens": 12,
"completion_tokens": 45,
"total_tokens": 57
}
}

Streaming Response​

When stream: true is enabled, responses are returned in text/event-stream format.

Error Codes​

HTTP StatusError TypeDescription
400invalid_request_errorRequest parameter error
401invalid_api_keyInvalid API Key
402insufficient_quotaInsufficient balance or quota
429rate_limit_exceededRequest rate limit exceeded
500api_errorServer error

See Error Code Dictionary.