Pricing and rate limits

Pay-as-you-go

The prices listed below are exclusive of VAT.

Chat Completions API

Embeddings API

Rate limits

All endpoints have a rate limit of 2 requests per second, 2 million tokens per minute, and 200 million tokens per month. You can check your current rate limits on the platform. If you need to increase them, please contact support with your estimated consumption and use case.

We will raise the limits for embedding models in the future.

Last updated