Open xuesu opened 1 month ago
paddle单线程编译make -j1一直都有问题,编译不过
我们本地也在复现
非常感谢!贵司是我看到的回复最即时的类似库!其实-j50也是报一样的错误
有点不好意思,但是TensorArgDef OutputAt(size_t idx) { return args_def().input_defs()[idx]; }
这是刻意这么写的么?我看其他到的头文件没有这么写呀。。。https://github.com/PaddlePaddle/Paddle/blob/develop/paddle/phi/capi/include/wrapper_base.h ln553
出问题的kernel是
PD_REGISTER_KERNEL(eigvalsh, // cuda_only
GPU,
ALL_LAYOUT,
phi::EigvalshKernel,
float,
double,
phi::dtype::complex<float>,
phi::dtype::complex<double>) {
kernel->InputAt(1).SetDataType(phi::dtype::ToReal(kernel_key.dtype()));
这里input只有1个,但是却要求第1个(也就是第二个)的input datatype为REAL。那么这里到底是
forward : eigvalsh (Tensor x, str uplo = "L", bool is_test = false) -> Tensor(eigenvalues), Tensor(eigenvectors)
kernel->InputAt(1).SetDataType(phi::dtype::ToReal(kernel_key.dtype()));
的>python -m pip install paddlepaddle-gpu==3.0.0b1 -i https://www.paddlepaddle.org.cn/packages/stable/cu123/
>python -m unittest test_eigvalsh_op.py
Illegal instruction (core dumped)
我关掉了-DWITH_TESTING,错误不变。另外我无法用-DWITH_TESTING来编译该库。 我使用了kernel->InputAt(0).SetDataType(phi::dtype::ToReal(kernel_key.dtype()));编译成功
问题描述 Issue Description
🔎 Search before asking
🐛 Bug (问题描述)
I got the following error:
I added some print logs at :
I got:
so the idx(1) is equal to size()(1).
I added traceback at:
I got:
I wonder if this is because all source files under the folder
paddle/fluid/eager/api/generated/fluid_generated/forwards/
(e.g.:dygraph_forward_functions3.cc
), are empty, but this function(https://github.com/jiaoxuewu/PaddleBox/blob/7552ba29f6b729f3192b4747283770b254433c8b/paddle/fluid/eager/auto_code_generator/generate_file_structures.py#L98) suggests that those files should be empty: GenerateFileStructureForIntermediateDygraph....Sorry for writing in English...
🌰 Minimal Reproducible Example (最小可复现问题的Demo)
or
版本&环境信息 Version & Environment Information
🏃♂️ Environment (运行环境)
OS: ubuntu 22.04 GCC version: (Ubuntu 11.4.0-1ubuntu1~22.04) 11.4.0 Clang version: 17.0.6 (https://github.com/llvm/llvm-project.git 6009708b4367171ccdbf4b5905cb6a803753fe18) CMake version: version 3.22.1 Libc version: glibc 2.35 Python version: 3.10.15
CUDA version: 12.4.131 Build cuda_12.4.r12.4/compiler.34097967_0 cuDNN version: 9.4.0 Nvidia driver version: 560.35.03 Nvidia driver List: GPU 0: NVIDIA GeForce RTX 4090 GCC: gcc 11 Clang: 17.0.6 (tried both GCC and Clang) Memory: 64GB Architecture: x86_64 CPU op-mode(s): 32-bit, 64-bit Address sizes: 39 bits physical, 48 bits virtual Byte Order: Little Endian CPU(s): 32 On-line CPU(s) list: 0-31 Vendor ID: GenuineIntel Model name: Intel(R) Core(TM) i9-14900KF CPU family: 6 Model: 183 Thread(s) per core: 2 Core(s) per socket: 24 Socket(s): 1 Stepping: 1 CPU max MHz: 6000.0000 CPU min MHz: 800.0000 BogoMIPS: 6374.40