-
Running this code with the dataset DocRED, I got the result:
| epoch 0 | step 100 | min/b 0.11 | lr [5.45950864422202e-07, 1.8198362147406733e-06, 5.459508644222019e-06] | train loss 5290.696
| e…
-
Hey, interesting project - thanks for sharing.
In order to use it over here in the EU we would need this to run on 868MHz, which I assume would not be a big deal. How about LBT - did you consider imp…
-
Hello, I've seen Peertube transcode video after upload (really good idea, for security, standardization…) but, here I am: Peertube use MP4 & AAC, the first is under `patent encumbed` the second is a p…
-
This issue collects some findings obtained during the fine-tuning process for the german model for the classification and NER task.
- Flair is not able to fully realize the capacity of the GPU (eve…
-
### Is there an existing Discovery issue on this topic?
- [X] I have searched the existing issues
### Objective
Your task is to dive deeper into the world of APIs and Chrome DevTools. This is a dai…
-
Firstly, thank you for the awesome project. I'm new to LLMs so I hope this suggestion makes sense.
LoRA is a technique used to reduce the number of parameters during finetuning, that is really hitt…
-
I am currently trying to optimize an AL pipeline using small-text therefore I am looking into the source code and came across something slightly bizarre looking, namely in the `\integrations\transform…
-
Currently, the tokenizer_updates branch allows the tokenizer to support new languages by either adding characters or trained tokens. However, the model then needs to be trained/finetuned with these to…
-
**Describe the bug**
When I am training my model, everything goes well in the first batch, but an error occurs in the second batch:
`RuntimeError: Trying to backward through the graph a second time …
-
The Joint_trajectory_controller seems to generate some extra waypoints instead of just executing a defined trajectory. This seems as redundant functionally which overlaps with solutions like MoveIt wh…