-
I have been seeing very weird behavior when training and running Mistral or Mixtral with samples being exactly the length of `max_position_embeddings`. The strange behavior manifested itself with comp…
-
### 🐛 Describe the bug
When attempting to export the UDOP model to ONNX from the transformers library, the torch.onnx.export() command fails with a RuntimeError. Below is a minimal example to repro…
-
#### 问题描述 / Problem Description
Traceback (most recent call last):
File "tools/train.py", line 208, in
main(config, device, logger, vdl_writer)
File "tools/train.py", line 180, in main
…
-
```
from datasets import load_dataset
from random import randint
# Load our test dataset
eval_dataset = load_dataset("json", data_files="test_dataset.json", split="train")
rand_idx = ran…
-
# Prerequisites
Please answer the following questions for yourself before submitting an issue.
- [x ] I am using the latest TensorFlow Model Garden release and TensorFlow 2.
Latest version clon…
-
The CIViC documentation lacks clear definitions of what the drug interaction types (`combination`, `sequential`, and `substitutes`) mean and when it is appropriate to use one over another.
Starting…
-
### Environment details
If you are already running SDV, please indicate the following details about the environment in
which you are running it:
* SDV version: 1.14.0
* Python version: 3.10.12…
-
(@si-npg, @rhao: Split off from #10)
Look at [NPGObjProvenance_2.xlsx](https://github.com/american-art/npg/files/475057/NPGObjProvenance_2.xlsx).
- @workergnome do you plan to try your provenance par…
-
[The format of the issue]
Paper name/title:
Paper link:
Code link:
amusi updated
2 months ago
-
I have gotten Rope scaling working for old GPTQ since it is now in transformers. In AutoGPTQ there is no way to set the transformers config as a parameter and it would have to be added.
I can try t…