Skip to main content

Documentation Index

Fetch the complete documentation index at: https://runinfra.ai/docs/llms.txt

Use this file to discover all available pages before exploring further.

GET https://api.runinfra.ai/v1/models
Returns verified deployed models across all active_verified deployments in your workspace. The shape matches OpenAI’s client.models.list(), so any SDK that calls it gets a correct answer out of the box.

Request

from openai import OpenAI
client = OpenAI(base_url="https://api.runinfra.ai/v1", api_key="YOUR_RUNINFRA_API_KEY")

for model in client.models.list().data:
    print(model.id, model.owned_by)

Response

{
  "object": "list",
  "data": [
    {
      "id": "llama-3.3-70b",
      "object": "model",
      "created": 1714502400,
      "owned_by": "runinfra"
    },
    {
      "id": "bge-m3",
      "object": "model",
      "created": 1714502400,
      "owned_by": "runinfra"
    }
  ]
}

Semantics

  • De-duplicated - the same modelId deployed in two different verified pipelines appears only once.
  • Live - reflects verified deployment state; stopped, failed, health-only, or unverified deployments disappear from the list within seconds.
  • No credits charged - model discovery is free; rate-limit budget is not consumed by this endpoint so SDK boot flows that call it eagerly don’t inflate your quota usage.
  • Pipeline-scoped keys see only their pipeline’s verified models; workspace-scoped keys see verified deployed models in the workspace.

Notes

The created timestamp is cosmetic - it reflects the response time, not the deployment’s actual create time. SDK clients generally only rely on id and object from this payload.
Models your pipeline uses but hasn’t deployed (e.g., listed in the Model tab but with status stopped) do NOT appear here. This endpoint reflects runtime reachability, not pipeline configuration.