Open FlotingDream opened 4 months ago
VRWKV6 fp16? Mixed Precision Training?
Hi, currently we transferred all types of data to fp32 in vrwkv like this: link. You can use this simple way for vrwkv6 as well.
VRWKV6 fp16? Mixed Precision Training?