-
Notifications
You must be signed in to change notification settings - Fork 459
feat: Integrate NVIDIA/k8s-device-plugin into Project-HAMi/HAMi #855
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
feat: Integrate NVIDIA/k8s-device-plugin into Project-HAMi/HAMi #855
Conversation
…ject-HAMi#323) Signed-off-by: chaunceyjiang <[email protected]>
Signed-off-by: wawa0210 <[email protected]>
Bumps [docker/login-action](https://github.com/docker/login-action) from 3.1.0 to 3.2.0. - [Release notes](https://github.com/docker/login-action/releases) - [Commits](docker/login-action@v3.1.0...v3.2.0) --- updated-dependencies: - dependency-name: docker/login-action dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]>
Signed-off-by: wawa0210 <[email protected]>
Bumps ubuntu from 20.04 to 24.04. --- updated-dependencies: - dependency-name: ubuntu dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]>
Signed-off-by: wawa0210 <[email protected]>
…evice-plugin crashes. (Project-HAMi#328) Signed-off-by: chaunceyjiang <[email protected]>
* 💡 Add Benchmark build script and deployment files Signed-off-by: haitwang-cloud <[email protected]> * address latest comment Signed-off-by: haitwang-cloud <[email protected]> * Update Dockerfile to use the correct repository for cloning the ai-benchmark project Signed-off-by: haitwang-cloud <[email protected]> --------- Signed-off-by: haitwang-cloud <[email protected]>
Signed-off-by: coderth <[email protected]>
Signed-off-by: rongfu.leng <[email protected]>
…roject-HAMi#350) * fix: typo in use_exclusive_card.yaml and add new exclusive example Signed-off-by: wenqing.he <[email protected]> * fix: typo in scheduler Signed-off-by: wenqing.he <[email protected]> --------- Signed-off-by: wenqing.he <[email protected]>
Signed-off-by: rongfu.leng <[email protected]>
Bumps [golang.org/x/net](https://github.com/golang/net) from 0.22.0 to 0.26.0. - [Commits](golang/net@v0.22.0...v0.26.0) --- updated-dependencies: - dependency-name: golang.org/x/net dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Bumps [github.com/spf13/cobra](https://github.com/spf13/cobra) from 1.7.0 to 1.8.1. - [Release notes](https://github.com/spf13/cobra/releases) - [Commits](spf13/cobra@v1.7.0...v1.8.1) --- updated-dependencies: - dependency-name: github.com/spf13/cobra dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
…amespaces (Project-HAMi#364) The code changes in `webhook.yaml` modify the namespace whitelist for the webhook. The `kube-system` and `istio-system` namespaces are added to the whitelist. This change aims to allow the webhook to function properly in these namespaces. Signed-off-by: haitwang-cloud <[email protected]> Co-authored-by: Tim Wang <[email protected]>
Fix HostGPUMemoryUsage wrong unit on grafana dashboard Signed-off-by: joy717 <[email protected]>
Signed-off-by: rongfu.leng <[email protected]>
* feat: add liveness probe on scheduler extender Signed-off-by: wenqing.he <[email protected]> * feat: also add liveness probe on kube-scheduler Signed-off-by: wenqing.he <[email protected]> --------- Signed-off-by: wenqing.he <[email protected]>
Signed-off-by: 王然 <[email protected]>
07cfdd2 to
05bd32a
Compare
…tpl (Project-HAMi#845) * Update condition to include regexReplaceAll for outputting proper numbers from minor versions Signed-off-by: HJJ256 <[email protected]> * Update condition Signed-off-by: HJJ256 <[email protected]> * Update definition of strippedKubeVersion to handle variety of version numbering systems Signed-off-by: HJJ256 <[email protected]> * Update job-createSecret.yaml Signed-off-by: HJJ256 <[email protected]> * Update job-patchWebhook.yaml Signed-off-by: HJJ256 <[email protected]> --------- Signed-off-by: HJJ256 <[email protected]>
* update libvgpu Signed-off-by: limengxuan <[email protected]>
Signed-off-by: rockpanda <[email protected]>
Due to potential pod startup issues in certain environments, set passDeviceSpecsEnabled to false by default. This configuration can still be enabled via helm values for environments that need it to handle runtime GPU access issues. Signed-off-by: Nimbus318 <[email protected]>
4e86da3 to
2bda58a
Compare
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: haitwang-cloud The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
2bda58a to
b054212
Compare
b893505 to
d8ec2cd
Compare
…rfaces Signed-off-by: haitwang-cloud <[email protected]>
d8ec2cd to
4521121
Compare
|
Thanks for your pull request. Before we can look at it, you'll need to add a 'DCO signoff' to your commits. 📝 Please follow instructions in the contributing guide to update your commits with the DCO Full details of the Developer Certificate of Origin can be found at developercertificate.org. The list of commits missing DCO signoff:
DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
a56b1b0 to
4521121
Compare
Add NVIDIA k8s device plugin as a submodule and implement initial device plugin structure
What type of PR is this?
/kind feature
What this PR does / why we need it:
This pull request integrates the NVIDIA/k8s-device-plugin into the Project-HAMi/HAMi project. The integration involves adding the NVIDIA/k8s-device-plugin repository as a submodule, specifically the release-0.17 branch, and creating a script to update the submodule and copy the necessary files into the appropriate directories within the Project-HAMi/HAMi project.
Changes Made
Add NVIDIA k8s-device-plugin as a Submodule:
Added the NVIDIA/k8s-device-plugin repository as a submodule in the third_party/nvidia/k8s-device-plugin directory.
The submodule points to the release-0.17 branch.
Create Update Script:
Created a script named update_nvidia_plugin.sh to update the submodule to the latest version of the specified branch and copy the necessary files to the project's directories.
Which issue(s) this PR fixes:
Fixes #
Special notes for your reviewer:
Does this PR introduce a user-facing change?: