Skip to main content

Container-Deployment

Configurations for SCR deployment

For the creation of the SAS Container Runtime publishing destination it is recommended to use the SAS Viya CLI. The command detailed below is used to create a publishing destination that is using a Azure Container Registry:

There are example deployment YAMLs provided in the SCR-LLM-Deployment-YAML folder which are build for Azure. The following assumptions are made:

  • Namespace called llm in which the SCR containers will be deployed.
  • No resource limits are currently imposed, that is why we recommend having a separate node pool for this workload - in non production environments it is recommended to use a spot-instance with a lot of available CPU and RAM, e.g. Standard_D64s_v5.
  • The URL endpoint schema looks like this host/llm/model_name here the container will be reached, that means the full address for a container is host/llm/model_name/model_name.