-
Very great job! May I ask if the fine-tuning on this work is instruction fine-tuning?
-
### System Info
```shell
Platform:
- Platform: Linux-5.15.0-1056-aws-x86_64-with-glibc2.29
- Python version: 3.8.10
Python packages:
- `optimum-neuron` version: 0.0.23
- `neuron-sdk` …
-
### SFT data
1. Started the SFT stage with publicly available instruction tuning data ([Chung et al., 2022](https://arxiv.org/pdf/2210.11416))
2. Fewer but high quality > Millions of data but low …
-
Hello! Thank you for publishing Dhara.
I have been playing with it and wanted to share some performance numbers. I am using a NAND Flash chip on an embedded system. When I use my raw nand driver, I…
-
**Problem:** Currently, if we want to print rapidly changing values to the terminal (such as the angular heading of the robot during a test turn), it will quickly pollute the terminal with heading val…
-
Hello, I noticed that the fine-tuning of depth-anything-v1 was performed using the KITTI dataset (with sparse depth maps as ground truth). However, depth-anything-v2 was fine-tuned using Virtual KITTI…
-
### Search before asking
- [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and f…
-
I get this error message if I set the max_len to 300 or any higher than 100 for that matter whenever I'm training to train with FP8. I'm using cuda-12.4.0-2 and the nightly cuda 12.4 pytorch builds an…
-
I can't load `last.ckpt` of my fine-tuned model:
```bash
---------------------------------------------------------------------------
KeyError Traceback (most rece…
domef updated
1 month ago
-
## User story
1. As a ML engineer
2. I want / need to utilise the two Hetzner machines provided by our industry partner
3. So that we can use them for the fine-tuning of our LLM.
## Acceptance crite…