Blas xGEMMBatched launch failed的出现原因 - 代码天地

Blas xGEMMBatched launch failed的出现原因

其他 2021-02-09 08:23:47 阅读次数: 0

如果你的cudatoolkit是9.x版本的，在执行两个很大的batch做matmal的时候，可能会报一个很奇怪的错误：

但是实际上你的显存是够的。为什么会报这样的错误呢？

这个问题困扰了我好几天。从网上查阅了很多资料，才发现是cublas的内部的一个保护机制。当你对两个batch做matmul的时候，如果batch的大小大于172800(大概是这么一个数)，就会报错。不太确定cudatoolkit10.x还有没有类似的问题，但是至少cudatoolkit9.x都会遇到这个问题，所以只能想办法把batch改小一点。

注意这里说的batch大小是说矩阵相乘的前面的维度的综合。比如你要做的操作是:

tf.matmul(tf.ones([512, 1024, 4, 2]), tf.ones([512, 1024, 2, 1]))

也会报错的。虽然后面真实相乘的矩阵很小，但是512*1024>172800了，所以会报错。

不信的话，你可以用下面的程序测试一下：

import tensorflow as tf
import numpy as np

config = tf.ConfigProto()
config.gpu_options.allow_growth=True
tf.Session(config=config).close()

def calc():
    N = 15 # works for N <= 14
    a = 64
    b = 16
    X = np.random.rand(N, 11520, b, 1).astype(np.float32)
    print(X.nbytes*1e-6, "MB")
    W = np.random.rand(N, 11520, a, b).astype(np.float32)
    print(W.nbytes*1e-6, "MB")
    X_ = tf.constant(X, name="X-constant", dtype=tf.float32)
    W_ = tf.constant(W, name="W-constant", dtype=tf.float32)

    # tf.matmul(W_, X_, name="mymatmul")
    return W_ @ X_

tf.reset_default_graph()
a = calc()
sess = tf.Session()
sess.run(tf.global_variables_initializer())
b = sess.run(a)
sess.close()
print(b.shape)

猜你喜欢

转载自blog.csdn.net/bonjourdeutsch/article/details/103334810

Blas xGEMMBatched launch failed的出现原因

Blas SGEMM launch failed

nternalError: Blas GEMM launch failed

TensorFlow: InternalError: Blas SGEMM launch failed

InternalError (see above for traceback): Blas GEMM launch failed

keras 或 tensorflow 调用GPU报错：Blas GEMM launch failed

TensorFlow2.0或2.1出现如下错误:Blas GEMM launch failed

180509 tensorflow-gpu显存分配与InternalError (see above for traceback): Blas SGEMM launch failed

tensorflow报错:tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed :

【已解决】“tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed“

InternalError: Blas GEMM launch failed : a.shape=(100, 784), b.shape=(784, 10), m=100, n=10...问题解决办法

报错：InternalError: Blas GEMM launch failed : a.shape=(32, 8), b.shape=(8, 30), m=32, n=30, k=8

failed to launch IBCocoaTouchImageCatalogTool

深坑cudnn PoolFoward launch failed

奇怪的cudnn PoolForward launch failed

ubuntu安装python库scipy出现的问题：no lapack/blas resources等

Launch

Xcode Internal launch error: process launch failed: Unspecified

运行GPU出现CUDA_ERROR_LAUNCH_FAILED

Apple - BLAS

BLAS 接口

Failed to install *.apk on device timeout Launch canceled

linux launch failed.binary not found Linux

Failed to execute child process “dbus-launch“

eclipse运行出现unable to launch 错误

ROS运行launch文件出现问题

iOS真机调试时“process launch failed: timed out waiting for app to launch”问题

failed to launch process in the docker container on mac m2, and return message “could not launch pro

在Eclipse中运行C++程序出现"Launch failed. Binary not foud"

spark集群安装出现master: failed to launch: nice -n 0错误

今日推荐

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

周排行

浏览器对同一域名进行请求的最大并发连接数

React Hook之自定义Hook

【转】MyBatis缓存机制

-Java-泛型

自动化测试常用脚本-发送邮件

LeetCode#859: Buddy Strings

java、Python处理字符串

第二篇の博客

Hadoop伪分布式环境安装

SQL Server进阶（十一）临时表、表变量

每日归档

更多

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)