AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Apache License 2.0
4.51k
stars
359
forks
source link
error during inferencing: Error: Constant embeddings_token_embedding_weight was not set! Set the value with set_constant. #968
Hi team,
Thank you for your nice work! I met the error during inference. Stable-diffusion 1.5 in sample file.
[11:48:42] model_container.cu:87: Init AITemplate Runtime with 1 concurrency
[11:48:42] model_container.cu:69: Device Runtime Version: 12010; Driver Version: 12000
[11:48:42] model_container.cu:83: Hardware accelerator device properties:
Device:
ASCII string identifying device: NVIDIA GeForce RTX 4090
Major compute capability: 8
Minor compute capability: 9
UUID: GPU-c8774ba0-19af-65d7-012f-30748af968cb
Unique identifier for a group of devices on the same multi-GPU board: 0
PCI bus ID of the device: 164
PCI device ID of the device: 0
PCI domain ID of the device: 0
Memory limits:
Constant memory available on device in bytes: 65536
Global memory available on device in bytes: 25393692672
Size of L2 cache in bytes: 75497472
Shared memory available per block in bytes: 49152
Shared memory available per multiprocessor in bytes: 102400
[11:48:42] model_container.cu:87: Init AITemplate Runtime with 1 concurrency
[11:48:43] model_interface.cu:200: Error: Constant embeddings_token_embedding_weight was not set! Set the value with set_constant.
Traceback (most recent call last):
python3 scripts/demo.py --prompt "Mountain Rainier in van Gogh's world"
Hi team, Thank you for your nice work! I met the error during inference. Stable-diffusion 1.5 in sample file.