-
In the original paper, the author said he used a randomly sampled one interaction for each user as the validation data, but in your code, you just used test data as the validation data, so I feel conf…
ghost updated
5 years ago
-
Heya!
Right now, ClearML automagically logs the configuration for models it recognizes, but not all parameters of interest are logged.
For example, we occasionally use a linear regression model in…
-
Hi, it's me again. I am trying to generate data for the combined equation (E3) and I am struggling to obtain trajectories without aliasing artifacts / divergence for parameters approaching the KdV equ…
-
Would really appreciate it if the successful hyper-parameters were provided, i.e., how the learning rate should be initialized and decayed. I re-implemented ResNet-50 following the example. But I am t…
-
Hi,
The optimization didn't work. So I just added the line :
"metrics_callback.on_validation_end( trainer)" after line 209 .
I also modified the class :
class MetricsCallback(Callback…
-
@WongKinYiu I have been trying to set right hyper parameters for yolov3-spp but for complete open-images dataset after 300-400 iterations server restarts I have previously trained with 3 of the class…
-
### Question
I want to train conformer model from scratch but after tweaking some hyperparameters (to make the model smaller) i.e. reducing the convolutional layers etc.
In the ASR finetuning tuto…
-
Hi,
What are the hyperparameters used for training the SOTA model on train+val? I tried the SOTA hyperparams in the repository with cv=3 instead of 0, but I'm getting poor results on the testset (~84…
-
**Describe the Issue**
llama.cpp exposes the options `--grp-attn-n` and `--grp-attn-w` for the _Group size_ and _Neighbor window size_ hyper parameters from the SelfExtend [paper](https://arxiv.org/…
jojje updated
1 month ago
-
This is a great work. However, when I try to reproduce results on the ImageNet dataset, there is a 1% accuracy gap between mine and that in your paper.
Would you mind providing the hyper-parameters…