Open
Description
When creating HyperPod clusters with 2 ml.g5.8xlarge instances, we are seeing errors trying to run containers with Pyxis + Enroot.
srun: unrecognized option '--container-image'
Cloudwatch does not show an error with the execution of the install_enroot_lifeycle.sh lifecycle script.
Reinstalling the enroot + pyxis on all the nodes solves this
Metadata
Metadata
Assignees
Labels
No labels