update_aks_webservice: Update a deployed AKS web service

Description

Update an AKS web service with the provided properties. You can update the web service to use a new model, a new entry script, or new dependencies that can be specified in an inference configuration.

Values left as NULL will remain unchanged in the web service.

Usage

update_aks_webservice(
  webservice,
  autoscale_enabled = NULL,
  autoscale_min_replicas = NULL,
  autoscale_max_replicas = NULL,
  autoscale_refresh_seconds = NULL,
  autoscale_target_utilization = NULL,
  auth_enabled = NULL,
  cpu_cores = NULL,
  memory_gb = NULL,
  enable_app_insights = NULL,
  scoring_timeout_ms = NULL,
  replica_max_concurrent_requests = NULL,
  max_request_wait_time = NULL,
  num_replicas = NULL,
  tags = NULL,
  properties = NULL,
  description = NULL,
  models = NULL,
  inference_config = NULL,
  gpu_cores = NULL,
  period_seconds = NULL,
  initial_delay_seconds = NULL,
  timeout_seconds = NULL,
  success_threshold = NULL,
  failure_threshold = NULL,
  namespace = NULL,
  token_auth_enabled = NULL
)

Arguments

webservice

The AksWebservice object.

autoscale_enabled

If TRUE enable autoscaling for the web service.

autoscale_min_replicas

An int of the minimum number of containers to use when autoscaling the web service.

autoscale_max_replicas

An int of the maximum number of containers to use when autoscaling the web service.

autoscale_refresh_seconds

An int of how often in seconds the autoscaler should attempt to scale the web service.

autoscale_target_utilization

An int of the target utilization (in percent out of 100) the autoscaler should attempt to maintain for the web service.

auth_enabled

If TRUE enable key-based authentication for the web service. Defaults to TRUE.

cpu_cores

The number of cpu cores to allocate for the web service. Can be a decimal. Defaults to 0.1.

memory_gb

The amount of memory (in GB) to allocate for the web service. Can be a decimal. Defaults to 0.5.

enable_app_insights

If TRUE enable AppInsights for the web service. Defaults to FALSE.

scoring_timeout_ms

An int of the timeout (in milliseconds) to enforce for scoring calls to the web service.

replica_max_concurrent_requests

An int of the number of maximum concurrent requests per node to allow for the web service.

max_request_wait_time

An int of the maximum amount of time a request will stay in the queue (in milliseconds) before returning a 503 error.

num_replicas

An int of the number of containers to allocate for the web service. If this parameter is not set then the autoscaler is enabled by default.

Value

None