-
Notifications
You must be signed in to change notification settings - Fork 16
Description
I'm currently evaluating Coder and so far its great! Definitely beats manually provisioning workspaces.
I had a few questions and some minor issues
Environment
- Provider:
aws-eks
- K8s Version:
1.21
- Coder Helm Version:
1.29.1
In our cluster, we use ASGs, and specifically for GPUs, we separate them by the instance-type
size as well as the GPU type.
Example
ASG 1: T4-XL
- g4dn.xlarge
- Node Labels: compute-role:gpu, compute-size:xlarge, gpu-type:t4
ASG 2: A10G-XL
- g5.xlarge
- Node Labels: compute-role:gpu, compute-size:xlarge, gpu-type:a10g
ASG 3: Mixed-XL
- g4dn.xlarge
- g5.xlarge
- Node Labels: compute-role:gpu, compute-size:xlarge, gpu-type:mixed
Questions:
-
Are there any future plans to allow the admin to specify node-selectors/taints based on images? For CUDA enabled images, we would pre-select the node-selectors and taints to ensure that the image gets properly provisioned with a GPU node, rather than a CPU node.
-
Follow-on, would it be possible to allow users to specify the node-selectors/taints when creating workspaces without using a template? (if option is enabled by admin)
-
Is there a way to adjust/specify the session-timeout for OIDC? Currently it seems like the limit is 60 mins before refresh kicks in and requires reauth.
Issues:
I was trying to have the node-selector modified by using a template that did specify compute-type:gpu, compute-role:coder
, but within the provider settings, only compute-role:coder
is defined.
However, after testing the template, and subsequently deleting it, several workspaces that were provisioned afterwards retained the nodeSelectors that were defined only in the template itself, rather than sticking strictly with the provider specified one.
In Template Policy
, I do have write enabled for node-selector
so I wonder if that's what's causing the issue.
Thanks!