logit Search Results - Githubissues

1000+ results
for logit

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

WJJLL/SVD-SSA #1

疑惑？

你们的svd-logits和inception模型的aux_logits有什么区别呢？像admix这个，使用了aux_logits比没有使用的时候，黑盒上平均会高5%

sayhelloto1988 updated 12 months ago
1
tensorflow/probability #883

Variational conv layers divergence not applied in tf.functio…

For some reason, `_ConvolutionVariational` uses a [boolean flag](https://github.com/tensorflow/probability/blob/master/tensorflow_probability/python/layers/conv_variational.py#L236) to avoid calling `…

bgroenks96 updated 3 years ago
1
DeepRNN/image_captioning #28

Attention Formula dismatch with the implementation

I found that in the paper， the formula of MLP attention is usually desribed as below： ![image](https://user-images.githubusercontent.com/16586180/39976766-fd23c30e-5767-11e8-9a16-9d0238512c82.png) …

HrsPythonix updated 5 years ago
1
datalorax/equatiomatic #32

Non-lmer, non-Bayesian models to support

This is mostly just a checklist of the more important models we might want to support with fancy math. In theory, these are all supported automatically with broom (though we might want to have a gener…

andrewheiss updated 3 years ago
9
statsmodels/statsmodels #4267

GLM function not returning weights

I have run into a problem using the GLM function with Binomial family. I used the code below to create an instance of the glm: logit_instance = sm.GLM(default_array, predictors_matrix, family=sm.fa…

pinxterenm updated 6 years ago
11
deep-spin/entmax #19

Entmax fails when all inputs are -inf

When all inputs to entmax are -inf, it fails with ``` RuntimeError Traceback (most recent call last) in 1 from entmax import entmax15 2 logits = torch…

erickrf updated 2 years ago
1
salesforce/CoST #25

Traing Loss problem.

When I used your algorithm and parameters to train on both the WTH dataset and my own dataset, I found that the loss was very low in the first epoch, but increased sharply in the second epoch, and sub…

740402059 updated 7 months ago
3
unslothai/unsloth #538

Implementing weighted loss function

Mistral has a new finetuner repository where you can assign *weights* to specific messages, and those will be taken into account when the loss is calculated. I wanted to implement something similar fo…

skerit updated 1 month ago
4
99991/SimpleTinyLlama #4

Issue with attention mask unsqueeze in attention weights

After completing a batch inference, I discovered a bug in the attention weight computation. The attention mask was being added to the attention weights with an unsqueeze operation that was using the …

meadewaking updated 5 months ago
1
keras-team/tf-keras #409

Categorical Cross Entropy normalization issue ?

In `categorical_crossentropy`, I suspect this normalization line to be not useful and leads to 2 unexpected behaviors https://github.com/keras-team/keras/blob/b80dd12da9c0bc3f569eca3455e77762cf2ee8e…

PierrickPochelu updated 9 months ago
10

上一页 1...89 90 91 92 93 94 95...100 下一页

1000+ results for logit

1000+ results
for logit