-
### Feature request
We try to propose the addition of a new and widely-adopted scheduler strategy for language model pretraining in the Transformers repository. Upon reviewing the current schedulers …
-
## ❓ How to fine-tune DETR using a small dataset (3k examples)
Hi everyone,
I'm using DETR in my master's thesis: it concerns the development of a door recognizer.
This is my first experience wit…
-
I am trying to run the following python script, which uses the medium model:
```
import torchaudio
from audiocraft.models import MusicGen
from audiocraft.data.audio import audio_write
model =…
-
Currently emissions in MVS are defined by an `emission_factor` and the flow of the asset. Life cycle emissions will probably be defined by an `emission_factor` and the installed capacity.
These emis…
-
### script
```
numactl -C 11-15 python ./benchmark_docker_throughput.py \
--backend vllm \
--dataset /data/ShareGPT_V3_unfiltered_cleaned_split.json \
--model Qwen2-7B-Instruct \
--tru…
-
command "pip install -v --disable-pip-version-check --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" ./"
error log:
"DEPRECATION: --build-option and --global-option are depr…
-
Exception in thread Thread-6 (generate):
Traceback (most recent call last):
File "/opt/conda/lib/python3.10/threading.py", line 1016, in _bootstrap_inner
self.run()
File "/opt/conda/lib/py…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Current Behavior
用的是/chatGLM-6B-int4模型,在cpu推理时一切正常。当在gpu推理时,如果history为空,则不报错。如果history不为空,则直接报cuda错误。具体错误如下:
…
-
## Context: the new `global_random_seed` fixture
#22749 introduces a new `global_random_seed` fixture to make it possible to run the same test with any seed between 0 and 99 included. By default, w…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues and did not find a match.
### Who can help?
_No response_
### What are you working on?
I fine-tuned a T5 model…