pytorch解决两个GPU同时训练问题。

2022-10-22 14:45:43

解决两个GPU同时训练问题。

使用场景

我有两个GPU卡。我希望我两个GPU能并行运行两个网络模型。

代码

错误代码1：

#对于0号GPU
os.environ['CUDA_VISIBLE_DEVICES']='0,1'
device= torch.device("cuda:0"if torch.cuda.is_available()else"cpu")#对于1号GPU
os.environ['CUDA_VISIBLE_DEVICES']='0,1'
device= torch.device("cuda:1"if torch.cuda.is_available()else"cpu")

0号GPU不报错，1号GPU报错。错误如下
RuntimeError: Expected tensor for argument #1 ‘input’ to have the same device as tensor for argument #2 ‘weight’; but device 0 does not equal 1 (while checking arguments for cudnn_convolution)

错误代码2：

#对于0号GPU
os.environ['CUDA_VISIBLE_DEVICES']='0'
device= torch.device("cuda:0"if torch.cuda.is_available()else"cpu")#对于1号GPU
os.environ['CUDA_VISIBLE_DEVICES']='1'
device= torch.device("cuda:1"if torch.cuda.is_available()else"cpu")

0号GPU不报错，1号GPU报错。错误如下
CUDA: invalid device ordinal
正确代码如下：

#对于0号GPU
os.environ['CUDA_VISIBLE_DEVICES']='0'
device= torch.device("cuda:0"if torch.cuda.is_available()else"cpu")#对于1号GPU
os.environ['CUDA_VISIBLE_DEVICES']='1'
device= torch.device("cuda:0"if torch.cuda.is_available()else"cpu")

作者：werdery
原文链接：https://blog.csdn.net/werdery/article/details/106154294
更新时间：2022-10-22 14:45:43

相关文章

SpringBoot项目配置ssl证书，实现https协议
一、阿里云购买cas证书、配置证书、下载证书2.下载证书3.证书二、项目打包方式一： jar包1.下载的证书放
2022-07-18

css 绝对定位和相对定位
绝对定位绝对定位指的是通过规定HTML元素在水平和垂直方向上的位置来固定元素，基于绝对定位的元素不占据空间。绝
2022-07-18

spring boot的多线程
spring boot默认是单线程的，当有多个定时需要跑的时候，他会等到上一个定时跑完再跑下一个定时，而下一个
2022-07-18

MySQL远程连接的设置
与SQL Server类似，MySQL在需要远程操纵其他电脑时，也需要对其做远程连接的相应设置，具体操作如下。
2022-07-18

随机文章

Spring案例数据源对象管理及加载properties文件
spring中DruidDataSource和ComboPooledDataSource的资源配置管理，及加载
2022-06-20

springboot系类代码：cxf-spring-boot-starter-jax
Apache CXF = Celtix + XFire，开始叫 Apache CeltiXfire，后来更名为
2022-06-20

linux系统下安装jupyterlab及.py格式转换
linux系统下安装有python3,直接用pip可以安装jupyterlab.sudo apt instal
2022-06-20

判断两个有序数组中是否存在相同的数字（Python）
判断两个数组中是否存在相同的数字，两个已经排好序的数组，判断这两个数组中是否存在相同的数字？要求时间复杂度越低
2022-06-20

文章导航