Open llvm-beanz opened 11 months ago
Marking re-estimate this. Since this was originally refined, we have a different approach to implementing intrinsics and the way we've been estimating intrinsics has changed quite a bit.
I'll be working on this.
AC:
The intrinsic is implemented
Tests validate that it is lowered to LLVM-IR
Tests validate that control forms are not hoisted out of control flow ** Maps to the correct construct
[ ] Implement
GroupMemoryBarrierWithGroupSync
clang builtin,[ ] Link
GroupMemoryBarrierWithGroupSync
clang builtin withhlsl_intrinsics.h
[ ] Add sema checks for
GroupMemoryBarrierWithGroupSync
toCheckHLSLBuiltinFunctionCall
inSemaChecking.cpp
[ ] Add codegen for
GroupMemoryBarrierWithGroupSync
toEmitHLSLBuiltinExpr
inCGBuiltin.cpp
[ ] Add codegen tests to
clang/test/CodeGenHLSL/builtins/GroupMemoryBarrierWithGroupSync.hlsl
[ ] Add sema tests to
clang/test/SemaHLSL/BuiltIns/GroupMemoryBarrierWithGroupSync-errors.hlsl
[ ] Create the
int_dx_GroupMemoryBarrierWithGroupSync
intrinsic inIntrinsicsDirectX.td
[ ] Create the
DXILOpMapping
ofint_dx_GroupMemoryBarrierWithGroupSync
to80
inDXIL.td
[ ] Create the
GroupMemoryBarrierWithGroupSync.ll
andGroupMemoryBarrierWithGroupSync_errors.ll
tests inllvm/test/CodeGen/DirectX/
[ ] Create the
int_spv_GroupMemoryBarrierWithGroupSync
intrinsic inIntrinsicsSPIRV.td
[ ] In SPIRVInstructionSelector.cpp create the
GroupMemoryBarrierWithGroupSync
lowering and map it toint_spv_GroupMemoryBarrierWithGroupSync
inSPIRVInstructionSelector::selectIntrinsic
.[ ] Create SPIR-V backend test case in
llvm/test/CodeGen/SPIRV/hlsl-intrinsics/GroupMemoryBarrierWithGroupSync.ll
DirectX
SPIR-V
OpControlBarrier:
Description:
Wait for all active invocations within the specified Scope to reach the current point of execution.
All active invocations within Execution scope reach this point of execution before any invocation proceeds beyond it.
When Execution is Workgroup or larger, behavior is undefined unless all invocations within Execution execute the same dynamic instance of this instruction.
If Semantics is not None, this instruction also serves as an OpMemoryBarrier instruction, and also performs and adheres to the description and semantics of an OpMemoryBarrier instruction with the same Memory and Semantics operands. This allows atomically specifying both a control barrier and a memory barrier (that is, without needing two instructions). If Semantics is None, Memory is ignored.
Before version 1.3, it is only valid to use this instruction with TessellationControl, GLCompute, or Kernel execution models. There is no such restriction starting with version 1.3.
If used with the TessellationControl execution model, it also implicitly synchronizes the Output Storage Class: Writes to Output variables performed by any invocation executed prior to a OpControlBarrier are visible to any other invocation proceeding beyond that OpControlBarrier.
4
224
Scope <id>
Execution
Scope <id>
Memory
Memory Semantics <id>
Semantics
Test Case(s)
Example 1
HLSL:
Blocks execution of all threads in a group until all group shared accesses have been completed and all threads in the group have reached this call.
Syntax
Parameters
This function has no parameters.
Return value
This function does not return a value.
Remarks
The behavior of calls to this function that are within diverging branches of a thread group are undefined.
Minimum Shader Model
This function is supported in the following shader models.
This function is supported in the following types of shaders:
See also