Cloud Platform Deployment - AI21 Labs

Overview

Overview

Deploy AI21’s Jamba models on managed cloud platforms for production-ready, scalable inference. Choose from the following cloud service options:

AWS SageMaker

Deploy Jamba models using Amazon SageMaker’s managed infrastructure.

Google Model Garden

Access Jamba models through Google Cloud’s Vertex AI Model Garden.

For self-managed deployments, see our Self Deployment Guide.

vLLM Troubleshooting & Performance Optimization

⌘I