Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

inventory operator has too long of GPU labels #277

Closed
Zblocker64 opened this issue Jan 28, 2025 · 3 comments
Closed

inventory operator has too long of GPU labels #277

Zblocker64 opened this issue Jan 28, 2025 · 3 comments

Comments

@Zblocker64
Copy link

Zblocker64 commented Jan 28, 2025

When using a rtx 4080 super, the label is too long for k3s/k8s to handle

ERROR	nodes.node.monitor	couldn't apply patches for node "uzinode"	{"error": "Node \"uzinode\" is invalid: metadata.labels: Invalid value: \"akash.network/capabilities.gpu.vendor.nvidia.model.rtx4080super.interface.pcie\": name part must be no more than 63 characters"}

Assumption is we need to rework how the labeling is done.

Maybe akash.network/gpu.nvidia.rtx4080super.interface.pcie

Other possibility is to name super cards rtx4080s s for super

However this would still cause issues for a card like the rtx4000sffada

@uzi-akash
Copy link

uzi-akash commented Jan 28, 2025

Thank You 🥇 Akash Team
This was moved to: #233

@cloud-j-luna
Copy link
Member

When using a rtx 4080 super, the label is too long for k3s/k8s to handle

ERROR	nodes.node.monitor	couldn't apply patches for node "uzinode"	{"error": "Node \"uzinode\" is invalid: metadata.labels: Invalid value: \"akash.network/capabilities.gpu.vendor.nvidia.model.rtx4080super.interface.pcie\": name part must be no more than 63 characters"}

Assumption is we need to rework how the labeling is done.

Maybe akash.network/gpu.nvidia.rtx4080super.interface.pcie

Other possibility is to name super cards rtx4080s s for super

However this would still cause issues for a card like the rtx4000sffada

I wouldn't close this issue.
GPU codes could be part of the value in the label, but then it would require a list of GPU codes for the "GPU capability", there is a limit of 63 characters in the value as well which I believe was the reason for the current labelling.
Is there any historical context available as to why the choice was to use labels instead of annotations? Annotations are ideal to store metadata which seems to be the case in these specific "capability labels".

@uzi-akash
Copy link

This was moved to: #233

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants