-
What are the hyperparameters for replicating SeaLLMs/SeaLLM-7B-v1 and SeaLLMs/SeaLLM-7B-v2 ?
-
Hello,
I am trying to train the diffusion model on cifar-10 dataset
but as mentioned on the readme, i need to setup the hyperparameters, where can i found the? or how to set them ?
-
Dear Dr. Weikang Wan and Team,
I recently came across your fascinating work on the LOTUS algorithm, as detailed in your paper "LOTUS: Continual Imitation Learning for Robot Manipulation Through Un…
-
Hello @hengruizhang98,
I am getting large errors for column wise density estimation when evaluating TabDDPM on some datasets (Beijing and Magic), did you use different hyperparameters for this model …
-
Really nice work! Could the author also provide the hyperparameters used for training the phi-3 mini backbone (i.e., pretrain.sh and finetune_lora.sh)?
In addition, during my training process, I n…
-
**
If your issue is a usage question, submit it here instead:
- The imbalanced learn gitter: https://gitter.im/scikit-learn-contrib/imbalanced-learn
**
If we want to see if oversampling has an eff…
-
Hello, I wonder how you set hyperparameters, such as learning rate, batch size, and the number of epochs. We just obtained a 68 score on the Crame-D dataset using the command in the readme.
python…
-
Thanks for the great work. One thing I'm curious about is that does it actually work well on SFT for LLMs? It is not covered in the paper, as well. I tried the following parameters on a 2B-sized model…
-
Trying to debug larger width environments (7 currently).
Things to try:
1. Different metric (Average Q-value from 2015 paper https://arxiv.org/pdf/1312.5602.pdf).
```
5.1 Training and Sta…
-
https://github.com/pluskal-lab/MassSpecGym/blob/2d16eb959a023a15cd0e485888415465789df20e/massspecgym/models/base.py#L12-L14
Can we just use `self.save_hyperparameters()` then reference these variab…