-
Notifications
You must be signed in to change notification settings - Fork 518
Issues
is:issue state:open
is:issue state:open
Issue creation is restricted in this repository
Search results
[Feature]: Driver upgrades cannot free GPU consumers that lack an nvidia.com/gpu request or run as DaemonSets
featureissue/PR that proposes a new feature or functionalityissue/PR that proposes a new feature or functionalityneeds-triageissue or PR has not been assigned a priority-px labelissue or PR has not been assigned a priority-px labelStatus: Open.#2570 In NVIDIA/gpu-operator;[Bug]: Driver pod fails to recreate after GPU hot-detach/re-attach:
/lib/firmware/nvidiaENOENT and stalefirmware_classsearch pathbugIssue/PR to expose/discuss/fix a bugIssue/PR to expose/discuss/fix a bugneeds-triageissue or PR has not been assigned a priority-px labelissue or PR has not been assigned a priority-px labelStatus: Open.#2559 In NVIDIA/gpu-operator;[Feature]: Support running nv-fabricmanager in Shared NVSwitch (fabric partition) mode with a configurable command socket in the driver daemonset
featureissue/PR that proposes a new feature or functionalityissue/PR that proposes a new feature or functionalityneeds-triageissue or PR has not been assigned a priority-px labelissue or PR has not been assigned a priority-px labelStatus: Open.#2552 In NVIDIA/gpu-operator;[Bug]: nvidia-operator-validator fails on nodes with nvidia.com/gpu.deploy.device-plugin=false
bugIssue/PR to expose/discuss/fix a bugIssue/PR to expose/discuss/fix a bugneeds-triageissue or PR has not been assigned a priority-px labelissue or PR has not been assigned a priority-px labelStatus: Open.#2550 In NVIDIA/gpu-operator;[Bug]: Gpu-Operator Upgrade Controller Incorrectly Transitions from upgrade-failed to upgrade-done
bugIssue/PR to expose/discuss/fix a bugIssue/PR to expose/discuss/fix a bugneeds-triageissue or PR has not been assigned a priority-px labelissue or PR has not been assigned a priority-px labelStatus: Open.#2549 In NVIDIA/gpu-operator;[Feature]: Provide good securityContext by default
featureissue/PR that proposes a new feature or functionalityissue/PR that proposes a new feature or functionalityneeds-triageissue or PR has not been assigned a priority-px labelissue or PR has not been assigned a priority-px labelStatus: Open.#2533 In NVIDIA/gpu-operator;[Feature]: Publish the gpu-operator Helm chart as an OCI artifact
featureissue/PR that proposes a new feature or functionalityissue/PR that proposes a new feature or functionalityStatus: Open.#2520 In NVIDIA/gpu-operator;Update Symlink evaluation for NVIDIA-SMI
featureissue/PR that proposes a new feature or functionalityissue/PR that proposes a new feature or functionalityneeds-triageissue or PR has not been assigned a priority-px labelissue or PR has not been assigned a priority-px labelStatus: Open.#2506 In NVIDIA/gpu-operator;Decouple host client shutdown from host driver setting
enhancementImprovements to existing features, performance, or usability (not bug fixes or new features).Improvements to existing features, performance, or usability (not bug fixes or new features).Status: Open.[Bug]: dcgm-exporter appears to stall after "Initializing system entities of type 'CPU Core'" on H200 with dense MIG topology
needs-triageissue or PR has not been assigned a priority-px labelissue or PR has not been assigned a priority-px labelStatus: Open.[Enhancement]: Enhance RHEL Driver Image Selection to Use Major-Version Tags for RHEL 8 and RHEL 9
enhancementImprovements to existing features, performance, or usability (not bug fixes or new features).Improvements to existing features, performance, or usability (not bug fixes or new features).needs-triageissue or PR has not been assigned a priority-px labelissue or PR has not been assigned a priority-px labelStatus: Open.