Models - RunInfra

GET https://api.runinfra.ai/v1/models

Returns verified deployed models across all active_verified deployments in your workspace. The shape matches OpenAI’s client.models.list() response for supported model-list clients.

Request

from openai import OpenAI
client = OpenAI(base_url="https://api.runinfra.ai/v1", api_key="YOUR_RUNINFRA_API_KEY")

for model in client.models.list().data:
    print(model.id, model.owned_by)

const models = await client.models.list();
models.data.forEach((m) => console.log(m.id, m.owned_by));

curl https://api.runinfra.ai/v1/models \
  -H "Authorization: Bearer YOUR_RUNINFRA_API_KEY"

Response

{
  "object": "list",
  "data": [
    {
      "id": "llama-3.3-70b",
      "object": "model",
      "created": 1714502400,
      "owned_by": "runinfra"
    },
    {
      "id": "bge-m3",
      "object": "model",
      "created": 1714502400,
      "owned_by": "runinfra"
    }
  ]
}

Semantics

De-duplicated - the same modelId deployed in two different verified pipelines appears only once.
Live - reflects verified deployment state; stopped, failed, health-only, or unverified deployments disappear from the list within seconds.
No credits charged - model discovery is free; rate-limit budget is not consumed by this endpoint so SDK boot flows that call it eagerly don’t inflate your quota usage.
Pipeline-scoped keys see only their pipeline’s verified models; workspace-scoped keys see verified deployed models in the workspace.

Notes

The created timestamp is cosmetic - it reflects the response time, not the deployment’s actual create time. SDK clients generally only rely on id and object from this payload.

Models your pipeline uses but hasn’t deployed (e.g., listed in the Model tab but with status stopped) do NOT appear here. This endpoint reflects runtime reachability, not pipeline configuration.

Audio

Rate limits

⌘I

​Request

​Response

​Semantics

​Notes

Request

Response

Semantics

Notes