Cloud-based services
Cloud providers have their own rate limits. For instance, Amazon SageMaker rate limits are determined by the instance that you deploy to hold the model. Amazon Bedrock has their own pricing and rate limits for AI21 model usage. See your cloud provider’s documentation for details.Foundation models
Foundation models have usage limits per second (RPS) and per minute (RPM):Foundation Model | RPS | RPM |
---|---|---|
Jamba Large | 10 | 200 |
Jamba Mini | 10 | 200 |