tinygrad / 7900xtx

385 stars 26 forks source link

[PATCH] drm/amdgpu : Add mes_log_enable to control mes log feature #4

Open warrenelrod opened 8 months ago

warrenelrod commented 8 months ago

https://lore.kernel.org/all/20240322180021.391680-1-shaoyun.liu@amd.com/T/

warrenelrod commented 8 months ago

https://lore.kernel.org/all/9871b5a1-8838-42a0-99c5-18bbb7f90c86@amd.com/T "Due to a CP interrupt bug, bad packet garbage exception codes are raised. Do a range check so that the debugger and runtime do not receive garbage codes. Update the user api to guard exception code type checking as well."

https://lore.kernel.org/all/e23a0e94-f31e-4711-ad3b-93caca180286@amd.com/T/ "Here I think we should cancel work items that are still in the work queue, not flush workqueue node->ih_wq. In this case the kfd functions have been terminated, there is no way to handle the left work items. That would make work queue flush never finish. I think it is the reason there are orphan kernel tasks.

After cancel left work items we can call destroy_workqueue."