-
Notifications
You must be signed in to change notification settings - Fork 525
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for NVIDIA MIG #4418
Conversation
86ad6e0
to
9a70605
Compare
08cee62
to
0d8e9ec
Compare
Signed-off-by: Piyush Jena <piyushjena1996@gmail.com>
0d8e9ec
to
2ac59cd
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM if testing comes back green.
Is there a test to make sure that once |
@arnaldo2792 I have added the test towards the end. |
CHANGELOG.md
Outdated
* Add support for NVIDIA Multi-Instance GPU (MIG) ([#4418]) | ||
|
||
## Build Changes | ||
* Update bottlerocket-core-kit to 6.0.2 ([#4416]) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd like to have links to the changelogs to help one confirm what the actual delta is here. For example, I have to check the previous entry to see we went from core kit 6.0.1 -> 6.0.2 and it would be great if there was a link on this line giving me the diff for that repo.
07dc803
to
e581904
Compare
e581904
to
e7f43c3
Compare
Issue number:
Fixes: #4406, #4252
Description of changes:
Testing done:
Migration Testing
v1.32.0
Upgrade to v1.34.0
Downgrade to v1.32.0
Feature Testing
Settings
kubectl describe node
outputnvidia-k8s-device-plugin
statusWorkload test
MIG + Time-slicing
kubectl describe node
output (gpus = 8, partitions = 4, replicas = 4)P5 Testing
NVLink status when MIG is enabled
NVLink status when MIG is disabled
Fabric Statuses
MIG enabled - P5
MIG disabled - P5
MIG disabled - P4
MIG enabled - P4
Terms of contribution:
By submitting this pull request, I agree that this contribution is dual-licensed under the terms of both the Apache License, version 2.0, and the MIT license.