facebookincubator / AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Apache License 2.0
4.51k stars 359 forks source link

error during inferencing: Error: Constant embeddings_token_embedding_weight was not set! Set the value with set_constant. #968

Open mengbingrock opened 9 months ago

mengbingrock commented 9 months ago

Hi team, Thank you for your nice work! I met the error during inference. Stable-diffusion 1.5 in sample file.

[11:48:42] model_container.cu:87: Init AITemplate Runtime with 1 concurrency
[11:48:42] model_container.cu:69: Device Runtime Version: 12010; Driver Version: 12000
[11:48:42] model_container.cu:83: Hardware accelerator device properties:
  Device:
     ASCII string identifying device: NVIDIA GeForce RTX 4090
     Major compute capability: 8
     Minor compute capability: 9
     UUID: GPU-c8774ba0-19af-65d7-012f-30748af968cb
     Unique identifier for a group of devices on the same multi-GPU board: 0
     PCI bus ID of the device: 164
     PCI device ID of the device: 0
     PCI domain ID of the device: 0
  Memory limits:
     Constant memory available on device in bytes: 65536
     Global memory available on device in bytes: 25393692672
     Size of L2 cache in bytes: 75497472
     Shared memory available per block in bytes: 49152
     Shared memory available per multiprocessor in bytes: 102400
[11:48:42] model_container.cu:87: Init AITemplate Runtime with 1 concurrency
[11:48:43] model_interface.cu:200: Error: Constant embeddings_token_embedding_weight was not set! Set the value with set_constant.
Traceback (most recent call last):
python3 scripts/demo.py --prompt "Mountain Rainier in van Gogh's world"
kadeng commented 9 months ago

Thanks for your report, we will look into it.

Maxwell777777 commented 5 months ago

Hi, I encountered the same error with Stable-diffusion 1.5 and A10, has it been resolved now? @kadeng