Open sleepwalker2017 opened 5 months ago
This issue has been labeled inactive-30d
due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d
if there is no activity in the next 60 days.
This issue has been labeled inactive-90d
due to no recent activity in the past 90 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed.
yeah I encounter very similar issues. haven't been able to make any cutlass code build. I think we are doing something simple wrong, I am rusty with C build process.
I think the code in the quickstart.md
is out-of-date. DefaultEpilogue
calls for another template argument EpilogueSchedule_
, but when I add the argument with cutlass::epilogue::collective::EpilogueScheduleAuto
and its head file cutlass/epilogue/collective/collective_builder.hpp
, it goes wrong with another failure:
mixed_gemm.cu(110): error: no instance of overloaded function "cutlass::gemm::device::GemmUniversalAdapter<GemmKernel_, std::enable_if_t<cutlass::gemm::detail::IsCutlass3GemmKernel<GemmKernel_, void>::value, void>>::operator() [with GemmKernel_=cutlass::gemm::kernel::GemmUniversal<cute::tuple<int32_t, int32_t, int32_t>, cutlass::gemm::collective::CollectiveMma<cutlass::gemm::MainloopSm90TmaGmmaWarpSpecialized<7, cute::tuple<cute::_1, cute::_2, cute::_1>, cutlass::gemm::KernelTmaWarpSpecializedCooperative>, cute::tuple<cute::_128, cute::_128, cute::_64>, cutlass::half_t, cute::tuple<int64_t, cute::C<1>, int64_t>, cutlass::half_t, cute::tuple<int64_t, cute::C<1>, int64_t>, cute::TiledMMA<cute::MMA_Atom<cute::SM90::GMMA::MMA_64x128x16_F32F16F16_SS<cute::SM90::GMMA::Major::K, cute::SM90::GMMA::Major::K, cute::SM90::GMMA::ScaleIn::One, cute::SM90::GMMA::ScaleIn::One>>, cute::Layout<cute::tuple<cute::_2, cute::_1, cute::_1>, cute::tuple<cute::_1, cute::_0, cute::_0>>, cute::tuple<cute::Underscore, cute::Underscore, cute::Underscore>>, cute::SM90_TMA_LOAD_MULTICAST, cute::ComposedLayout<cute::Swizzle<3, 4, 3>, cute::smem_ptr_flag_bits<16>, cute::Layout<cute::tuple<cute::_8, cute::_64>, cute::tuple<cute::_64, cute::_1>>>, void, cute::identity, cute::SM90_TMA_LOAD, cute::ComposedLayout<cute::Swizzle<3, 4, 3>, cute::smem_ptr_flag_bits<16>, cute::Layout<cute::tuple<cute::_8, cute::_64>, cute::tuple<cute::_64, cute::_1>>>, void, cute::identity>, cutlass::epilogue::collective::DefaultEpilogue<cute::tuple<cute::C<1>, int64_t, int64_t>, cute::tuple<cute::C<1>, int64_t, int64_t>, cutlass::epilogue::thread::LinearCombination<cutlass::half_t, 1, float, float, cutlass::epilogue::thread::ScaleType::Default, cutlass::FloatRoundStyle::round_to_nearest, cutlass::half_t>, cutlass::epilogue::collective::EpilogueScheduleAuto>, void, void>]" matches the argument list
argument types are: ({...})
object type is: Gemm
status = gemm_op({
I run the example in the quick start guide.
My GPU is A30, the command is
nvcc 01_gemm_3.0.cu -arch=sm_80
It complains errors:Here are my codes: