gradient-penalty Search Results

1000+ results
for gradient-penalty

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

lucidrains/gigagan-pytorch #46

Gradient Penalty is very high in the start

Hi! i was running few experiments and noticed that GP is extremely hight in first few 100 steps. GP > 60000, and then gradually going down to around GP = 20 is it normal behaviour? In my previo…

inspirit updated 1 year ago
10
isantesteban/snug #11

Training pipeline?

Hi, on training, I have deformed and skinned garment from snug model and I fixed collision as post processing on the predicted garment. Then I am using loss functions to improve learning. So, I want …

opendeeple updated 9 months ago
19
neural-structured-additive-learning/safareg #3

Random intercepts with `fac_processor`

Hi! When comparing the estimations of random intercepts in a simple simulation between `mgcv` and `deepregression`, I obtain different results. The results with `deepregression` tend to be more sh…

vhmedina updated 11 months ago
3
primepake/wav2lip_288x288 #29

Percep: 0.0 | Fake: 100.0, Real: 0.0

@primepake Hello, thanks for your nice work. I have encountered some difficulties in training on my own dataset (**followed your data preparation suggestions**) using your sharing code recently. Whi…

aishoot updated 8 months ago
23
microsoft/LightGBM #4074

Dask tests randomly fail with socket error code 104

``` [LightGBM] [Fatal] Socket send error, code: 104 distributed.worker - WARNING - Compute Failed ``` Full logs: ``` 2021-03-15T22:41:00.2549100Z ============================= test session st…

StrikerRUS updated 3 years ago
15
modelscope/ms-swift #177

chatglm3-6b sft.sh过程中的acc指标和infer.sh后的acc差距很大，请问可能是什么原因？

### 场景相关性识别 ### 样本示例 {"instruction": "query：苹果手机\n title：iphone 6s，99新 \n请判断上述query和title是否相关？", "input": "", "output": "是"} ### sft.sh 参数 nproc_per_node=1 base_path="xxx" train_data="xxx" v…

kaer1990 updated 10 months ago
1
arcee-ai/mergekit #198

Idea: Downscaling the K and/or Q matrices for repeated layer…

Has anyone tried downscaling the K and/or Q matrices for repeated layers in franken-merges? This should act like changing the temperature of the softmax and effectively smooth the distribution: **H…

jukofyork updated 2 months ago
63
vitalwarley/research #50

KFC: Kinship Verification with Fair Contrastive Loss and Mul…

- Código: https://github.com/garynlfd/kfc - Paper: https://arxiv.org/abs/2309.10641 Encontrei-o enquanto procurava código para #49.

vitalwarley updated 9 months ago
6
hiyouga/LLaMA-Factory #2949

Expected is_sm80 || is_sm90 to be true, but got false

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction CUDA_VISIBLE_DEVICES=0,1 python src/train_bash.py can run sucessfully. deepspeed --num_gpus 2…

sparkfax updated 7 months ago
2
microsoft/LightGBM #5917

Forwarding values to custom loss function for semiparametric…

## Summary Forward columns of your dataset directly to a custom objective function. ## Motivation This is useful for semi-parametric models like poisson process regression where y | x, t ~ …

meh2135 updated 4 months ago
4

上一页 1...79 80 81 82 83 84 85...100 下一页

1000+ results for gradient-penalty

1000+ results
for gradient-penalty