-
I get the the tensor size error at the end. The command I am running is this:
`
python eval_retrieval.py --bert_model bert-base-uncased --from_pretrained save/RetrievalFlickr30k_bert_base_6layer_6co…
-
I've been bouncing around various StableDiffusion optimisations the last couple of weeks, and figured I would link out to some of the ones I remember in hopes that they can be explored/added into the …
-
(python3-venv) aarch64_sh ~> cm run script --tags=run-mlperf,inference,_find-performance,_full,_r4.1 --model=dlrm_v2-99 --implementation=reference --framework=pytorch --category=datacenter…
-
Hi there,
I was running `cuBERT_benchmark.py` and noticed that CuBERT does not utilize all threads when using multiple CPUs (even when setting MKL_NUM_THREADS and OMP_NUM_THREADS). It seems that o…
-
(mlperf) susie.sun@yizhu-R5300-G5:~$ cmr "run mlperf inference generate-run-cmds _submission" --quiet --submitter="MLCommons" --hw_name=default --model=resnet50 --implementation=reference --backend=tf…
-
### Feature request
Flash Attention 2 is a library that provides attention operation kernels for faster and more memory efficient inference and training: https://github.com/Dao-AILab/flash-attentio…
-
@kaushaltrivedi Cannot allocate memory error
**Error Logs:**
`06/23/2020 03:00:43 - INFO - root - Num examples = 1000
06/23/2020 03:00:43 - INFO - root - Num Epochs = 6
06/23/2020 03:0…
-
This is not a recent regression, and perhaps it won't be fixed for that reason, but I thought I'd file it anyway.
I maintain Go bindings for this library, and by sheer luck I had benchmarks when I …
-
Hi, when trying to run this on my machine (MacBook Pro M2), everything works fine. However, when trying to run inside Docker I get a seg fault when calling `extract_keywords`:
```
>>> from keybert…
-
hi Louis:
thanks for your contribution.
can you tell me how to modify the code to correct this error?
thank you soooooooooo much!!!
![image](https://user-images.githubusercontent.com/21167278/112…