-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
K3s not detecting GPUs for nodes #8575
Comments
sarahwooders
changed the title
K3 not detecting GPUs for nodes
K3s not detecting GPUs for nodes
Oct 9, 2023
how it was resolved? |
I have similar issue, I see available amd gpu on my node:
pod container requests gpu:
but I still see in the scheduler errors:
k3s service logs:
|
How was this resolved? We're having a similar issue. |
For me, the solution was here: #10534 (comment) |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Environmental Info:
K3s Version: v1.27.6+k3s1 (bd04941)
go version go1.20.8
Node OS: Ubuntu 20.04.6 LTS
Node(s) CPU architecture:
GPUs:
Cluster Configuration:
1 server (running on GCP e2-standard-2 instance), 2 agents (running on GCP g2-standard-4 instances with L4 GPUs)
Describe the bug:
I am trying to run pods on GPU nodes, however K3 does not seem to detect the GPUs.
Steps To Reproduce:
nvidia-container-toolkit
https://cloud.google.com/compute/docs/gpus/install-drivers-gpu#installation_scriptsExpected behavior:
I expect pods to be able to be scheduled on GPU nodes.
Actual behavior:
When I try to run a pod on the cluster, the pod is stuck with the status "PENDING".
When I run
kubectl get events
, I see the following error:When I run
kubectl describe nodes
, the "Allocated Resource" section does not show any GPUs:The text was updated successfully, but these errors were encountered: