Closed yeyupiaoling closed 1 year ago
dim_a.width_:1024 != dim_b.height_:2048.
打印下模型结构吧,估计是某个参数错误,导致lstm后是concat还是add之类的问题。
------------------------------------------------------------------------------------------------------------------
Layer (type) Input Shape Output Shape Param #
==================================================================================================================
Conv2D-1 [[1, 1, 900, 161]] [1, 32, 449, 80] 320
GELU-1 [[1, 32, 449, 80]] [1, 32, 449, 80] 0
ConvBn-1 [[1, 1, 900, 161], [1]] [[1, 32, 449, 80], [1]] 0
Conv2D-2 [[1, 32, 449, 80]] [1, 32, 224, 39] 9,248
GELU-2 [[1, 32, 224, 39]] [1, 32, 224, 39] 0
ConvBn-2 [[1, 32, 449, 80], [1]] [[1, 32, 224, 39], [1]] 0
ConvStack-1 [[1, 161, 900], [1]] [[1, 224, 1248], [1]] 0
LSTM-1 [[1, 224, 1248], None, [1]] [[1, 224, 2048], [[1, 1, 2048], [1, 1, 2048]]] 27,017,216
LayerNorm-1 [[1, 224, 2048]] [1, 224, 2048] 4,096
RNNForward-1 [[1, 224, 1248], [1], None] [[1, 224, 2048], [[1, 1, 2048], [1, 1, 2048]]] 0
LSTM-2 [[1, 224, 2048], None, [1]] [[1, 224, 2048], [[1, 1, 2048], [1, 1, 2048]]] 33,570,816
LayerNorm-2 [[1, 224, 2048]] [1, 224, 2048] 4,096
RNNForward-2 [[1, 224, 2048], [1], None] [[1, 224, 2048], [[1, 1, 2048], [1, 1, 2048]]] 0
LSTM-3 [[1, 224, 2048], None, [1]] [[1, 224, 2048], [[1, 1, 2048], [1, 1, 2048]]] 33,570,816
LayerNorm-3 [[1, 224, 2048]] [1, 224, 2048] 4,096
RNNForward-3 [[1, 224, 2048], [1], None] [[1, 224, 2048], [[1, 1, 2048], [1, 1, 2048]]] 0
LSTM-4 [[1, 224, 2048], None, [1]] [[1, 224, 2048], [[1, 1, 2048], [1, 1, 2048]]] 33,570,816
LayerNorm-4 [[1, 224, 2048]] [1, 224, 2048] 4,096
RNNForward-4 [[1, 224, 2048], [1], None] [[1, 224, 2048], [[1, 1, 2048], [1, 1, 2048]]] 0
LSTM-5 [[1, 224, 2048], None, [1]] [[1, 224, 2048], [[1, 1, 2048], [1, 1, 2048]]] 33,570,816
LayerNorm-5 [[1, 224, 2048]] [1, 224, 2048] 4,096
RNNForward-5 [[1, 224, 2048], [1], None] [[1, 224, 2048], [[1, 1, 2048], [1, 1, 2048]]] 0
RNNStack-1 [[1, 224, 1248], [1], None, None] [[1, 224, 2048], [5, 1, 2048], [5, 1, 2048]] 0
Linear-1 [[1, 224, 2048]] [1, 224, 6436] 13,187,364
==================================================================================================================
Total params: 174,517,892
Trainable params: 174,517,892
Non-trainable params: 0
------------------------------------------------------------------------------------------------------------------
Input size (MB): 0.55
Forward/backward pass size (MB): 102.31
Params size (MB): 665.73
Estimated Total Size (MB): 768.59
------------------------------------------------------------------------------------------------------------------
模型是GRU的。关注下代码lstm/gru后面的调用是否符合预期。可能是配置文件问题,或者模型不匹配。
刚才打印是错的,重新打印了,用的是LSTM
是我输入错误了paddle.nn.LSTM()
中的initial_states
维度输入不正确。修改好之后,GPU和CPU都能用了
bug描述 Describe the Bug
使用GPU推理是正常的,但是使用CPU推理就报下次错误了。
代码片段
其他补充信息 Additional Supplementary Information
No response