Open DxxtchGamer opened 9 months ago
Yeah my bad I forgot to change that part. You can edit the notebook in colab and change the quantization name from awq
to gptq
. I'll fix this later so we can have an option for selecting GPTQ/AWQ/SqueezeLLM.
Yeah my bad I forgot to change that part. You can edit the notebook in colab and change the quantization name from
awq
togptq
. I'll fix this later so we can have an option for selecting GPTQ/AWQ/SqueezeLLM.
Yeah, I tried that, but the output isn't what it should be, it's also giving me a lot of warnings, and the output isn't what it used to be. But I can wait until later so you're done fixing it, I just thought I should let you know.
These are the repeating warnings I get (I'm not sure what they mean):
WARNING 12-13 18:04:10 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:11 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:11 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:11 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:11 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:11 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:11 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:11 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:12 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:12 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:12 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:12 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:12 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:12 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:12 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:12 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:13 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:13 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:13 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:13 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:13 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:13 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:13 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:13 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:14 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:14 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:14 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:14 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:14 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:14 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:14 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:14 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:15 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:16 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:16 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:16 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:16 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:16 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:16 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:16 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:16 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:16 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:17 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:17 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:17 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:17 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:17 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:17 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:17 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:18 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:18 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:18 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:18 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:18 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:18 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:18 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:18 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:18 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:19 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:19 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:19 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:19 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:19 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:19 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:19 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:19 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:19 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:20 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:20 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:20 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:20 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:20 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:20 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:20 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:21 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:21 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:21 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:21 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:21 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:21 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:21 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:21 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:21 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:22 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:22 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:22 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:22 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:22 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:22 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:22 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:22 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:23 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:23 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:23 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:23 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:23 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:23 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:23 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:24 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:24 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:24 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:24 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:24 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:24 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:24 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:24 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:24 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:25 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:25 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:25 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:25 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:25 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:25 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:25 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:26 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:26 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:26 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:26 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:26 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:26 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:26 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:26 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:27 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:27 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:27 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:27 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:27 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:27 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:27 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:28 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:28 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:28 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:28 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:28 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:28 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:28 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:28 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:28 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:29 linear.py:306] Loading a weight without `output_dim` attribute in MergedColumnParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:29 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:29 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
WARNING 12-13 18:04:29 linear.py:429] Loading a weight without `output_dim` attribute in QKVParallelLinear, assume the weight is the same for all partitions.
Yeah that's normal for now, it'll work fine. I'm updating the notebook now. I'll also add a toggle to switch between Kobold and OpenAI API.
I've updated the notebook.
Hey, I've noticed that whenever I try to use GPTQ quantized models, it doesn't work. I've noticed it sees it as AWQ and not GPTQ, I don't know why that is because it worked fine yesterday. I tried to use "TheBloke/Toppy-M-7B-GPTQ", but it gives me this before it shuts down the notebook.