-
I'm trying to finetune **Mistral-Nemo-Base-2407** with a `text` dataset of long inputs. Usually, the SFTrainer will truncate it to fit the specified context size.
However, I get an error when using…
-
When I replicated the experiment on A40, the indicator did not reach the effect in the author's paper, and the map50 was only about 0.75, did the author use the pre-training weight of yolov5-l when ru…
-
**Describe the bug**
After setting the code, I always get an error with the following description:
**AttributeError: 'list' object has no attribute 'find'**
I tried to run the code on my local…
-
I want to load tokenizers from tiktoken to use for model training. Right now, this page https://huggingface.co/docs/transformers/main/en/tiktoken is only info I could find on tiktoken integration.
…
-
I'm glad the torch.compile is speeding up very quickly. On A5000 it can speed up 60%, but there's no acceleration at l4. I want to know why is it happen?
Here is my code, you can set --compile when r…
-
### Bug Description
Question 1:
Use a little txt file: The Milvus call the function ‘_create_hybrid_index()’ but the collection is not call 'self._conllection.load()',then the collection cant re…
-
inspired by this four-stage framework, I've implemented Transformer-STR:
https://github.com/opconty/Transformer_STR
thanks
-
### Feature request
We want to standardize the logic flow through Processor classes. Since processors can have different kwargs depending on the model and modality, we are adding a `TypedDict` fo…
-
### System Info
```Shell
Copy-and-paste the text below in your GitHub issue
- `Accelerate` version: 1.0.0
- Platform: Linux-6.10.11-amd64-x86_64-with-glibc2.40
- `accelerate` bash location: /dis…
-
https://github.com/ElvisClaros/GOT-OCR2.0/tree/main
!git clone https://github.com/Ucas-HaoranWei/GOT-OCR2.0.git
%cd /content/GOT-OCR2.0/GOT-OCR-2.0-master/GOT
%cd /content/GOT-OCR2.0/GOT-…