-
1.In the clustering stage, are unknown classes only divided into one class? if so, for different kinds of unknown classes(some of which may have different features), how to reduce the distance between…
-
Here you will find a long list of the articles thats need to be coded. They are divided into sections, one for each coder (TR = Timo, MR = Melanie, JC = Joseph, AB = Agata, LK = Liam). Each item in th…
-
Self-Supervised Learning of Pretext-Invariant Representations PIRL
https://arxiv.org/abs/1912.01991
-
I've implemented an LSTM encoder decoder architecture to generate paraphrases. In the decoder output, I'm using a `Dense(vocab_size)` layer with softmax activation and my loss is categorical cross_en…
-
## Environment info
- `transformers` version: 4.5.0.dev0
- Platform: Linux-5.4.0-54-generic-x86_64-with-glibc2.29
- Python version: 3.8.5
- PyTorch version (GPU?): 1.8.0+cu111
- Tensorflow vers…
-
Hi, thanks for your sharing.
What is the meaning of the batch?
Thanks
-
Hi,
I'm thinking of performing a pretraining with KITTI, as I have limited computing resources, and using Waymo is out of my reach.
Do you have any direction you can point me at, as to where in…
-
I'm dealing with a weird scenario where I have two different audio denoising models and I want to train them in two specific procedures. I like how the System() class has a `common_step` function, so …
-
Hi, the default setting in the code is different from the paper, lr for example. So which setting should I apply? I tried the default setting in the code. And I get the fine-tune-best_prec1 result aro…
-
Dear Community, I am using DALI for my postdoctoral project.
I am running on multiple GPUs with MPI.
I have 24 nodes with 8 GPUs each but for the moment I am just using one node since I want to …