-
Hi, I'm trying to get the SHAP Values from the following neural network:
```
model_ser = Sequential()
model_ser.add(Embedding(input_dim=vocabulary_size, output_dim=embedding_size, input_length=…
-
**System information**
- Have I written custom code (as opposed to using a stock example script provided in TensorFlow): Yes
- OS Platform and Distribution (e.g., Linux Ubuntu 16.04): Ubuntu 20.04
…
-
Hi! Sorry for another question!
When executing the batch script, all iterations other than the first one only contains the last stimulus:
```
Stimulus name: stim_plain
[stim] configuration:
[…
-
Nice work in this paper, I want to know that:
the paper mentioned that all linear ops are quantized into int4, what about **mat-multiply ops in the attention module?** Is the activation gradient in …
-
Mask is being applied to both output activation and target label in cost functions. Applying mask to the resulting gradient (https://github.com/seung-lab/znn-release/blob/master/python/train.py#L102) …
-
### What is the issue?
I am new to Ollama and have noticed that when I ask a query using Ollama, the model's responses are quite poor. However, if I ask the same query using https://www.llama2.ai/, I…
-
As we start onboarding more dtypes we ideally want them to work in as many different situations as possible so opening this tracker and will update the table as things change. If I should be adding mo…
-
#### Issue Description
Hi there!
I use a Neural Network for Deep Q Learning. After training it gives me the same outputs for every input.
My Input is an array with a size of 72 in which are eit…
-
您好,使用原始代码在2张A100 80G上面微调qwen,显存占用两张卡上都只有919M,但是在数据加载过程中?内存占用一直在增加,直到180多G后内存爆了,程序终止。请问这个问题怎么解?
训练log:
![image](https://github.com/TideDra/VL-RLHF/assets/36758049/09277b55-ea0a-4cfd-875b-792f457441a2…
-
Please make sure that this is a bug. As per our
[GitHub Policy](https://github.com/tensorflow/tensorflow/blob/master/ISSUES.md),
we only address code/doc bugs, performance issues, feature requests a…