Create a deployment
Base model
Fine-tuned model
Parameters
| Parameter | Required | Description |
|---|---|---|
name | Yes | A name for the deployment. Must be 5–100 characters. |
description | Yes | A description of the deployment. Must be 5–1000 characters. |
model_type | Yes | DeploymentType.BASE_MODEL for a base model or DeploymentType.FINE_TUNED_RUN for a fine-tuned model. |
model_id | Yes | The model ID (base model name or fine-tuning job ID) to deploy. |
n_instances | Yes | Number of dedicated instances to provision. Must be between 1 and 50. |
Deployment status
| Status | Description |
|---|---|
Pending | Deployment requested, infrastructure provisioning in progress. |
Active | Serving inference traffic. |
Inactive | Paused, not serving requests. |
Failed | Error during startup or runtime. |