Skip to content

Commit d1e8350

Browse files
tkatilauniemimu
authored andcommitted
gpu: add new nfd + monitoring + shared-dev deployment option
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
1 parent 973f1a3 commit d1e8350

File tree

4 files changed

+43
-0
lines changed

4 files changed

+43
-0
lines changed

cmd/gpu_plugin/README.md

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -15,6 +15,7 @@ Table of Contents
1515
* [Pre-built Images](#pre-built-images)
1616
* [Install to all nodes](#install-to-all-nodes)
1717
* [Install to nodes with Intel GPUs with NFD](#install-to-nodes-with-intel-gpus-with-nfd)
18+
* [Install to nodes with NFD, Monitoring and Shared-dev](#install-to-nodes-with-nfd-monitoring-and-shared-dev)
1819
* [Install to nodes with Intel GPUs with Fractional resources](#install-to-nodes-with-intel-gpus-with-fractional-resources)
1920
* [Fractional resources details](#fractional-resources-details)
2021
* [Verify Plugin Registration](#verify-plugin-registration)
@@ -167,6 +168,21 @@ $ kubectl apply -k 'https://github.com/intel/intel-device-plugins-for-kubernetes
167168
$ kubectl apply -k 'https://github.com/intel/intel-device-plugins-for-kubernetes/deployments/gpu_plugin/overlays/nfd_labeled_nodes?ref=<RELEASE_VERSION>'
168169
```
169170

171+
#### Install to nodes with NFD, Monitoring and Shared-dev
172+
173+
Same as above, but configures GPU plugin with logging, [monitoring and shared-dev](#modes-and-configuration-options) features enabled. This option is useful when there is a desire to retrieve GPU metrics from nodes. For example with [XPU-Manager](https://github.com/intel/xpumanager/) or [collectd](https://github.com/collectd/collectd/tree/collectd-6.0).
174+
175+
```bash
176+
# Start NFD - if your cluster doesn't have NFD installed yet
177+
$ kubectl apply -k 'https://github.com/intel/intel-device-plugins-for-kubernetes/deployments/nfd?ref=<RELEASE_VERSION>'
178+
179+
# Create NodeFeatureRules for detecting GPUs on nodes
180+
$ kubectl apply -k 'https://github.com/intel/intel-device-plugins-for-kubernetes/deployments/nfd/overlays/node-feature-rules?ref=<RELEASE_VERSION>'
181+
182+
# Create GPU plugin daemonset
183+
$ kubectl apply -k 'https://github.com/intel/intel-device-plugins-for-kubernetes/deployments/gpu_plugin/overlays/monitoring_shared-dev_nfd/?ref=<RELEASE_VERSION>'
184+
```
185+
170186
#### Install to nodes with Intel GPUs with Fractional resources
171187

172188
With the experimental fractional resource feature you can use additional kubernetes extended
Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
apiVersion: apps/v1
2+
kind: DaemonSet
3+
metadata:
4+
name: intel-gpu-plugin
5+
spec:
6+
template:
7+
spec:
8+
containers:
9+
- name: intel-gpu-plugin
10+
args:
11+
- "-shared-dev-num=30"
12+
- "-enable-monitoring"
13+
- "-v=2"
Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,9 @@
1+
apiVersion: apps/v1
2+
kind: DaemonSet
3+
metadata:
4+
name: intel-gpu-plugin
5+
spec:
6+
template:
7+
spec:
8+
nodeSelector:
9+
intel.feature.node.kubernetes.io/gpu: "true"
Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
bases:
2+
- ../../base
3+
patches:
4+
- path: add-args.yaml
5+
- path: add-nodeselector-intel-gpu.yaml

0 commit comments

Comments
 (0)