Closed nikitaromanoov closed 2 months ago
Hi nikitaromanoov,
Thanks for trying out n300 and reaching out.
tt-buda-demos are primarily intended and extensively tested for e75, e150 and n150 configurations. There are some architectural differences between e75, e150, n150 and n300 cards that cause different compile behaviour hence expose this specific issue.
Saying that, even though n300 is not architecture that we, at the moment, extensively test, from BUDA compiler stack perspective, there are no architectural limitations for all models running on n150 to also run on n300 cards.
Based on above, if you are just trying model demo tests with n300 please have in mind that, at the moment, model demos target n150 architecture and that n300 will be fully supported in some of further releases. In case you are preparing this specific model for some production usage we can definitely assist with providing workaround and long term compiler upgrade with higher priority.
Since there are no follow ups after my update, I'll go ahead and close this issue. Please feel free to reach out in case further assistance is needed.
Hello! We get the following error when running this code:
python tt-buda-demos/model_demos/nlp_demos/gpt2/gpt2_text_generation.py
Devices: n300
Also, is there any way to look at how much memory the model uses during startup?