-
Hello, I have encountered an issue where the sum of the topic-word distribution also does not sum to 1. I am running version 0.12.1, with hyperparameters `tw=TermWeight.PMI,` `gamma=1,` `alpha=0.1,` …
-
https://codemirror.net/6/ – This is important for me because the Harlowe mode has always been a bit sluggish for big passages, and while I'm doing some optimisation on that front right now, I think CM…
-
### What happened?
I've already quantized a 2b variant of this model, and one of its instruct fine tune, on a subset of the same data (the first 1000 samples are the same in the same order -- the e…
-
Hi, I'm confused code details in alpha update.
for g, ig in zip(dalpha, implicit_grads):
g.data.sub_(eta, ig.data)
for v, g in zip(self.model.arch_parameters(), dalpha):
if…
-
```
What steps will reproduce the problem?
1. Use the following urls:
>> target ='http://www.usgodae.org:8081/thredds/dodsC/testAll/2004050300_eta_211.nc ';
>> target = 'http://192.168.31.13/thred…
-
```
What steps will reproduce the problem?
1. Use the following urls:
>> target ='http://www.usgodae.org:8081/thredds/dodsC/testAll/2004050300_eta_211.nc ';
>> target = 'http://192.168.31.13/thred…
-
Hi, I finetuned the model using `[finetune_realesrgan_x4plus_pairdata.yml]` for about 5000 iterations and some other settings like:
```num_worker_per_gpu: 2
batch_size_per_gpu: 1
```
setting lea…
-
**The backbone is r2p1d, and the results on the test list you provided after training are as follows (decor=0.4.0, with added TEST SCALE=112, the pretrain checkpoint you provided):**
[09/08 15:44:5…
-
Hi !
When I try to run run_structsearch with tmdd, I get this error message. I tried with others models / dataset and I get almost the same message (ofvs.index). Can you tell me what that means plea…
-
使用exo+mlx多台mac运行llama-3.1-70b,返现量化时报错
报错的位置:
quantized.py文件
代码:
def call(self, x):
s = x.shape
x = x.flatten()
out = mx.dequantize(
self["weight"][x],
scales=self["scales"][x],
biases=self["…