Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add NVIDIA tools for GPU support in containers #1900

Merged
merged 2 commits into from
Jan 13, 2022

Conversation

arnaldo2792
Copy link
Contributor

Issue number:
N / A

Description of changes:

Related to #1799 , that PR is quite big. I'm taking some of the commits in that PR as their individual PR. The packages in this PR were build in #1799, but this PR also updates the packages to their latest versions.

* bb4a5a30 packages: add nvidia-container-toolkit
* 24b3812a packages: add libnvidia-container

Testing done:

I included both commits in a local branch based on #1799, and I launched a container with k8s. I was able to execute nvidia-smi from the container:

~ ❯ kubectl exec cuda-vectoradd -- nvidia-smi
Wed Jan 12 02:42:10 2022
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.82.01    Driver Version: 470.82.01    CUDA Version: 11.2     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  Tesla V100-SXM2...  Off  | 00000000:00:1E.0 Off |                    0 |
| N/A   32C    P0    25W / 300W |      0MiB / 16160MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

Terms of contribution:

By submitting this pull request, I agree that this contribution is dual-licensed under the terms of both the Apache License, version 2.0, and the MIT license.

@arnaldo2792
Copy link
Contributor Author

Forced push includes:

  • Use %make_build
  • Fix go ld flags
  • Fix symlink to shimpei

@arnaldo2792 arnaldo2792 requested a review from bcressey January 13, 2022 00:48
@arnaldo2792 arnaldo2792 requested a review from webern January 13, 2022 22:55
libnvidia-container provides a CLI to configure NVIDIA GPUs for
orchestrated containers

Signed-off-by: Arnaldo Garcia Rincon <agarrcia@amazon.com>
This package is required to support NVIDIA GPUs within orchestrated
containers

Signed-off-by: Arnaldo Garcia Rincon <agarrcia@amazon.com>
@arnaldo2792
Copy link
Contributor Author

Forced push includes releases-url in both packages

@arnaldo2792 arnaldo2792 merged commit ca61ab5 into bottlerocket-os:develop Jan 13, 2022
@arnaldo2792 arnaldo2792 deleted the nvidia-packages branch January 14, 2022 20:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants