FlagOpen / FlagGems

FlagGems is an operator library for large language models implemented in Triton Language.
Apache License 2.0
296 stars 27 forks source link

Dev sort[SiliconFlow] #222

Open MARD1NO opened 3 weeks ago

MARD1NO commented 3 weeks ago

PR Category

Operator

Type of Change

New Feature

Description

Add SOrt Kernel

Issue

Progress

Performance

image

when sorted element is too large, bitomic merge cause register spilling