-
I'm running this code (only relevant part) :
```python
xlnet_config = xlnet.XLNetConfig(json_path=FLAGS.model_config_path)
run_config = xlnet.create_run_config(is_training, True, FLAGS)
pr…
-
In OPUS 4 gibt es die Möglichkeit zwischen verschiedenen Layouts umzuschalten. Standardmäßig, wird das **opus4**-Layout verwendet. Die Layouts **default** und **plain** werden nicht verwendet und wurd…
-
Hi nifto2dicom
Thanks for your work and effort on this convertor, it is great.
As I tried to convert the MR nifti files to dicom, it failed, as giving back the error of Unknown modality.
I added t…
-
Hey
I am about to use the [SupConLoss](https://github.com/HobbitLong/SupContrast/blob/331aab5921c1def3395918c05b214320bef54815/losses.py#L11) for my specific application:
As I am embedding some gr…
-
Right now the prediction script is doing with bsz=1
https://github.com/saikrishnarallabandi/falkon/blob/master/tasks/speech/antispoofing/baseline/local/get_predictions.py
When parallelized, file…
-
I was trying to port https://github.com/iovisor/bcc/blob/master/examples/networking/http_filter/http-parse-simple.c using libbpf
I thought the following code should capture TCP packet, but it didn'…
-
在看文档时说训练sft模型时 需要将该 token 指定为< eom >,但是在哪里改呢?
![图片](https://github.com/OpenLMLab/MOSS/assets/6648329/7b0ec05c-cc79-457d-b443-f976c1a8bad3)
训练
num_machines=4
num_processes=$((num_machines * 8))
…
-
https://blb.ibs-bw.de/aDISWeb/app?service=direct/0/Home/$DirectLink&sp=S127.0.0.1:23002
https://bsz.ibs-bw.de/aDISWeb/app?service=direct/0/Home/$DirectLink&sp=S127.0.0.1:23212
-
I have been experimenting with RWKV v4 and v4neo but somehow it is using much more memory (about 2x) than my LM that uses Flash Attention. Not sure what I am doing wrong. Is this expected?
-
When I sample the model with command:
python visualize.py -seq_length 1000 -cuda -load_model mlstm_ns.pt -temperature 0.4 -neuron 2388 -init "I couldn't figure out"
and the process returned an e…