Create a Managed Inference Job (Parakeet)
Create a speech-to-text Parakeet Managed Inference Job with the CLI.
Create a Parakeet Managed Inference Job from the CLI. Answer the prompts, and CosmicAC deploys the speech-to-text model behind an OpenAI-compatible transcription endpoint.
Prerequisites
You need the following before you start:
- A running CosmicAC deployment. See Installation.
- The CosmicAC CLI installed and configured. See Install the CLI.
Steps
Create the job
Start the interactive job setup:
cosmicac jobs createSelect Managed Inference (Parakeet) as the job type, then set these fields:
- Job name — a name to identify the job.
- Tags — comma-separated labels for the job.
- Location — the region where the job runs.
- GPU type — the GPU to use. The CLI lists the GPUs available in your location.
- GPU count — the number of GPUs.
- Model — the Parakeet model to serve,
nvidia/parakeet-tdt-0.6b-v3. - Chunk duration — the audio chunk length in seconds.
- Chunk overlap — the overlap between chunks in seconds.
- Max file size (MB) — the maximum audio upload size.
- Endpoint name — a name for the endpoint, used in its URL path.
- Require Authorization header — whether callers must send an API key. See Create an API key.
The Job configuration reference describes each field.
CosmicAC creates the job and prints its ID.
Confirm the deployment
Check that your endpoint is serving:
cosmicac models healthcheckYour endpoint appears as Endpoint: <endpoint-name>.