-
I am getting this error:
```
llama.cpp: loading model from /Documents/Proj/delta/llama-2-7b-chat/ggml-model-q5_1.bin
error loading model: unrecognized tensor type 14
llama_init_from_file: failed…
-
PyTorch :1.5.0
model : resnet50
Background:
I currently have two test environments, one is a physical machine with 8 gpus, and the other is a 4 gpus Docker on this physical machine.
Run the 4 gp…
-
Hi there, thanks for the excellent lib. I have used a lot of the great features of the library and it's pretty cool.
However, I need to have item based recommendation ("like similar items"). In gener…
-
Hi FlexFlow team,
I used the methods mentioned in #1099 to test the latency(GPU: RTX-4090), but i get a confused result:
1)LLaMA-7B + 1个SSM(llama-160M), latency: 25.1 s
2)LLaMA-7B(without ssms), la…
-
在window机器上,paddle==1.5,在\PaddleCV\metric_learning,执行python eval.py --model=ResNet50 --batch_size=16 --pretrained_model=output,对模型进行eval,报错信息如下:
![image](https://user-images.githubusercontent.com/3785…
-
Hi,
I use this code and train with spanbert large config, but I only get 78.37 F1 with test dataset.
Do you train with epoch 20 and get F1 80%?
-
Hello,
I'm having an issue with the active learning script. I'm running:
`CUDA_VISIBLE_DEVICES=7 python active_learning_script.py --seed 1 --model resnet18 -sn -mod --al-type gmm`
and I have …
-
I used your code with AMP FP16 from pytorch 1.6. I achieved a good accuracy on validation set but showing the training accuracy is wrong. Do you have any suggestion to fix it? @xsacha @cavalleria . Th…
-
### Bug description
After the refactory moving loss and metrics to model.compile(), the loss and eval metrics are worse for a YouTubeDNN retrieval models. For the LastFM dataset for example (using th…
-
**model config***
encoder layers: 6
decoder layers: 6
hidden size: 1024
inner size: 4096
head number: 16
dim per head: 64
src vocab size: 40480
trg vocab size: 42720
is_post_ln: 0
no_scale_e…