-
Hi @dirkgr! Here is a feature that would be very much desirable for decontamination, but I'm not sure how difficult it would be to implement into BFF:
The essential part of the feature would be to …
-
# URL
- https://arxiv.org/abs/2310.13127
# Affiliations
- Zhihan Zhang, N/A
- Shuohang Wang, N/A
- Wenhao Yu, N/A
- Yichong Xu, N/A
- Dan Iter, N/A
- Qingkai Zeng, N/A
- Yang Liu, N/A
…
-
请问大佬,qwen2-vl 的pretrain是否有计划支持呢
-
### The `Auto` API: Enhancing Developer and LLM Experience with Weaviate
User-friendliness and intuitiveness of interaction are becoming as crucial as a system's technical capabilities. Recognizing…
-
I'm trying to run prompt training with an LLMasJudge float loss alike G-Eval: 0-0.2-0.4-0.6-0.8-1 values. And the Trainer crashes since it expects the eval values to be 0 or 1
```
ValueError: acc_sc…
-
### Problem
We want to add support for this new model that unlike the previous ones also supports vision. The readme for the model is described below:
---
language:
- en
- de
- fr
- it
- pt…
-
### Description
LLM training in GPU cluster constantly run into NCCL / bad host issues. Ray can help to make running NCCL test in a cluster much easier.
We should be able to:
- Make it easy t…
-
Hello,
I am fairly new with LLM in general (only started to study 2 weeks ago). So if I say/ask something silly, please excuse me.
And I stumble upon this blog post from HuggingFace
https://hug…
-
## Description:
In AutoGluon's multimodal framework, Distributed Data Parallel (DDP) is the primary strategy employed for leveraging multiple GPUs across most problem types. A known limitation of D…
-
Can we log fine tuned llama models using mlflow ?