Tokenthon | Stop Overpaying for AI - The Most Affordable AI API

The /api/v1/messages/batches endpoint allows you to create multiple jobs in a single request. This endpoint is ideal for processing multiple tasks asynchronously and is best suited for scenarios where you want to submit several jobs at once and check their status later or receive callbacks upon completion.

POST /api/v1/messages/batches

Header	Type	Required	Description
`Content-Type`	string	Yes	Must be `application/json`
`x-api-key`	string	Yes	Your API key for authentication

The request body contains an array of job creation objects:

Field	Type	Required	Description
`model`	string	Yes	AI model to use. Options: `gpt-auto`, `gpt-5`, `gpt-5.2`, `gpt-5-mini`. See models for additional details and options.
`message_cache_ttl`	number	No	Cache TTL in seconds (180-3600). Default: 300 seconds
`response_format`	object	Yes	Output format specification. See details.
`messages`	array	Yes	Array of messages for the conversation
`callback`	object	No	Optional callback configuration for webhook notifications. See details.

The callback parameter allows you to receive webhook notifications when jobs complete:

json

{
    "callback_url": "https://your-domain.com/webhook",
    "callback_headers": {
        "Authorization": "Bearer your-webhook-token",
        "X-Custom-Header": "custom-value"
    },
    "callback_payload": {
        "job_reference": "batch_123",
        "user_id": "user_456"
    }
}

At least one message with role: "user" is required per job
Maximum token size per request is based on user's plan. See details in the pricing page
Valid roles: system, developer, assistant, user

Different user tiers have different batch size limits:

Tier	Maximum jobs per request
`user_free`	2 jobs
`user_go`	10 jobs
`user_pro`	20 jobs
`user_max`	30 jobs

json

[
    {
        "model": "gpt-auto",
        "response_format": {
            "format": "text"
        },
        "messages": [
            {
                "role": "user",
                "content": "Explain photosynthesis in simple terms"
            }
        ],
        "callback": {
            "callback_url": "https://your-domain.com/webhook",
            "callback_payload": {
                "job_type": "explanation",
                "topic": "photosynthesis"
            }
        }
    },
    {
        "model": "gpt-auto",
        "response_format": {
            "format": "json",
            "schema": {
                "type": "object",
                "properties": {
                    "summary": {
                        "type": "string",
                        "description": "Brief summary of the concept"
                    },
                    "key_points": {
                        "type": "string",
                        "description": "Main key points"
                    }
                },
                "required": ["summary", "key_points"],
                "additionalProperties": false
            }
        },
        "messages": [
            {
                "role": "user",
                "content": "Summarize quantum computing in JSON format"
            }
        ]
    }
]

Status Code	Description
`202`	Jobs enqueued successfully
`403`	Forbidden - batch size limit exceeded
`401`	Unauthorized - invalid API key
`400`	Bad request - invalid parameters
`500`	Internal server error

json

{
    "success": true,
    "message": "Job enqueued successfully",
    "data": {
        "job_ids": ["550e8400-e29b-41d4-a716-446655440000", "550e8400-e29b-41d4-a716-446655440001"]
    }
}

json

{
    "success": false,
    "message": "Max jobs per batch request exceeded for free user"
}

When you provide a callback configuration, the system will send POST requests to your specified URL when jobs complete.

json

{
    "job_id": "550e8400-e29b-41d4-a716-446655440000",
    "status": "completed",
    "created_at": 1704067200,
    "result": {
        "response_content": "The AI response content here..."
    },
    "error": null,
    "callback_payload": {
        "job_reference": "batch_123",
        "user_id": "user_456"
    }
}

The callback will include:

Any custom headers you specified in callback_headers
Standard HTTP headers
Content-Type: application/json

Status	Description
`completed`	Job finished successfully
`failed`	Job failed with an error
`pending`	Job is still processing
`assigned`	Job has been assigned to a worker

After submitting a batch, you can check individual job status using the GET endpoint:

code

GET /api/v1/messages/{job_id}

typescript

const API_CONFIG = {
    BASE_URL: "https://api.tokenthon.com",
    ENDPOINT: "/api/v1/messages/batches",
    HEADERS: {
        "Content-Type": "application/json",
        "x-api-key": "your-api-key-here",
    },
};
 
async function createBatchJobs() {
    const batchRequests = [
        {
            model: "gpt-auto",
            response_format: { format: "text" },
            messages: [
                {
                    role: "user",
                    content: "What is the capital of France?",
                },
            ],
            callback: {
                callback_url: "https://your-domain.com/webhook",
                callback_payload: { question_id: "q1" },
            },
        },
        {
            model: "gpt-auto",
            response_format: { format: "text" },
            messages: [
                {
                    role: "user",
                    content: "What is the capital of Germany?",
                },
            ],
            callback: {
                callback_url: "https://your-domain.com/webhook",
                callback_payload: { question_id: "q2" },
            },
        },
    ];
 
    try {
        const response = await fetch(`${API_CONFIG.BASE_URL}${API_CONFIG.ENDPOINT}`, {
            method: "POST",
            headers: API_CONFIG.HEADERS,
            body: JSON.stringify(batchRequests),
        });
 
        const result = await response.json();
 
        if (!response.ok) {
            throw new Error(`Error: ${result.message}`);
        }
 
        console.log("Batch jobs created:", result.data.job_ids);
        return result.data.job_ids;
    } catch (error) {
        console.error("Error creating batch jobs:", error);
        throw error;
    }
}
 
// Usage
createBatchJobs()
    .then((jobIds) => {
        console.log("Job IDs:", jobIds);
        // Store job IDs for later status checking
    })
    .catch((error) => console.error(error));

bash

curl -X POST "https://api.tokenthon.com/api/v1/messages/batches" \
  -H "Content-Type: application/json" \
  -H "x-api-key: your-api-key-here" \
  -d '[
    {
      "model": "gpt-auto",
      "response_format": {
        "format": "text"
      },
      "messages": [
        {
          "role": "user",
          "content": "Explain machine learning"
        }
      ]
    },
    {
      "model": "gpt-auto",
      "response_format": {
        "format": "text"
      },
      "messages": [
        {
          "role": "user",
          "content": "Explain deep learning"
        }
      ]
    }
  ]'

Use the maximum batch size allowed for your tier to optimize throughput
Consider job complexity when determining optimal batch sizes

Ensure your webhook endpoint can handle concurrent requests
Implement idempotency to handle duplicate callback deliveries
Include proper error handling and retry logic

Monitor batch responses for partially successful operations
Implement retry logic for failed jobs using the returned job IDs
Log job IDs for debugging and monitoring purposes

Implement exponential backoff when polling job status
Use callbacks for real-time notifications instead of polling when possible
Cache job results to avoid redundant API calls

If some jobs in a batch fail validation:

The entire batch request will be rejected
No jobs will be created
Fix the validation errors and resubmit the entire batch

If jobs fail during processing:

Use the job IDs to check individual job status
Resubmit failed jobs with corrected parameters
Monitor error messages for common issues

If webhook callbacks fail:

The system will attempt delivery (retry policy may vary)
Implement endpoint monitoring and alerting
Consider implementing job status polling as a fallback

POST /api/v1/messages - Create and execute a single job
GET /api/v1/messages/:jobId - Get job result by ID

Message Batches - Create Multiple Jobs#

Overview#

Request#

Method#

Headers#

Request Body#

Request Body Fields#

Callback Configuration#

Message Requirements#

Batch Size Limits#

Example Request#

Response#

Status Codes#

Response Examples#

Successful Batch Creation (202 Accepted)#

Batch Size Limit Exceeded (403 Forbidden)#

Webhook Callbacks#

Callback Request Format#

Callback Headers#

Callback Status Values#

Job Status Checking#

Usage Examples#

TypeScript Example#

cURL Example#

Best Practices#

1. Batch Size Optimization#

2. Callback Implementation#

3. Error Handling#

4. Status Polling#

Error Recovery#

Partial Batch Failures#

Job Processing Failures#

Callback Delivery Failures#

Related Endpoints#