最近更新时间:2023-07-31 17:01:21
当所选GPU实例搭载的GPU卡支持NvLink&NvSwitch,需额外安装与驱动版本对应的nvidia-fabricmanager服务使GPU卡间能够互联。
从nvidia官网选择对应镜像、系统架构、驱动版本的安装包进行下载安装即可,包含rpm和deb格式。
下载地址:https://developer.download.nvidia.cn/compute/cuda/repos/
比如rpm的nvidia-fabric-manager-xxxx,nvidia-fabric-manager-devel-xxxx
version=470.103.01 #已经安装的驱动版本
yum -y install yum-utils
yum-config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel7/x86_64/cuda-rhel7.repo
yum install -y nvidia-fabric-manager-${version}-1
version=470.103.01
main_version=$(echo $version | awk -F '.' '{print $1}')
apt-get update
apt-get -y install nvidia-fabricmanager-${main_version}=${version}-*
systemctl start nvidia-fabricmanager
systemctl status nvidia-fabricmanager
systemctl enable nvidia-fabricmanager
纯净模式