-
I'm training on mask2former_beit_adapter_large_896_80k_ms model with custom dataset and I have 2 issues that I'm having hard time to figure them out.
1. I tried running train.py but after each epoc…
-
I'm trying to reproduce the reported latency numbers, but I get the following error when running benchmark_inference.py:
```
(base) ltewari@ipp1-3101:~/scratch/foundation-model-stack$ CUDA_VISIBLE…
-
**Description:**
I have implemented a custom model (`PyTorchMLPRegressor`) in my project( Multi Target Regression Model ), and I'm facing an issue with the `predict` function. It seems that when I ru…
-
https://arxiv.org/abs/1812.00073
https://github.com/tensorflow/ranking
-
### 🐛 Describe the bug
Basically I have 2 T4 GPU's and I am using kaggle when i try and load it in It shows some outputs that were put there for debugging but nothing else
## Input
```
import to…
-
I had a strange problem using Flux RNN, my training data contains myX:one-hot vector, and myY:a number. The training data shown below worked very well using feedforward network(epoch=20,R2=0.9), but v…
-
`embed.py` script calculates embeddings of inputs in `sample-metadata.json` and stores them in output file `output.jsonl`
```
python scripts/embed.py \
--ids data/sample.ids --metadata data/samp…
-
I got this run-time error `RuntimeError: mat1 and mat2 shapes cannot be multiplied (288x1600 and 6400x1600)` while trying to execute WISE method on gpt2-xl model. Could you give me some suggestions to…
-
## どんなもの
### 論文
[Attention Is All You Need](https://arxiv.org/pdf/1706.03762.pdf)
### 著者・所属機関
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kais…
-
### 🐛 Describe the bug
So I've been trying to compile some auto-regressive transformer models and they consist of for loops within the model architecture. I'm posting a simplified version of the arch…