Open amazingyyc opened 4 months ago
This issue has been labeled inactive-30d
due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d
if there is no activity in the next 60 days.
This issue has been labeled inactive-90d
due to no recent activity in the past 90 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed.
I want todo a function like
D = func(A x B) x C
,Want do 3 operator: a matrix multiply follow a function than do another matrix multiply in one kernel. I have 2 idea.
For 1 it's easy to understand. Does it cost because write func(T) into shared mem and read again when T x C? For 2 how can I make sure func(T) in register is need by T x C?