-
Is it possible to add to https://nvidia.github.io/TensorRT-LLM/ the code copy widget that you already have on https://nvidia.github.io/TensorRT-Model-Optimizer/?
For example if you go to https://nvidi…
-
# Establish XML Standard for Chain of Thought within JSONL for LLM Training
## Objective
Create an XML standard for structuring Chain of Thought (CoT) data within JSONL files for our open-source A…
-
Thanks for sharing your work. Can you provide the VRAM requirements for each type of training (full, vision only, LLM only)?
-
### Feature request
Hi,
I have a misunderstanding regarding training LLMs. When we train the model, we calculate the loss by having the model predict the next word and then compute the difference …
-
trying to run training (CosyVoice/examples/libritts/cosyvoice/run.sh), while doing the inference step i get this error:
'CosyVoiceModel' object has no attribute 'inference'
when looking inside infer…
-
Hi! Thank you for your outstanding work!
I have been working on improving the LangBridge approach, and I noticed your paper referenced it. As you discussed, LangBridge uses soft prompts generated b…
-
### Prerequisites
- [X] I have read the [documentation](https://hf.co/docs/autotrain).
- [X] I have checked other issues for similar problems.
### Backend
Local
### Interface Used
CLI
### CLI Co…
-
We could seemingly train the dataset on an image:vertexes pairing to get what would be essentially the equivalent of a depth-map to mesh language model?
I'd be very interested in the training code …
-
Hello Deep Search Team!
Thank you for this contribution to open source!
We are considering using your library to parse PDF files for LLM training, so we will potentially need to scale things up.…
-
You will see the problem in the text below, this is with using gpt-4o and version 0.5 of agent zero, but have similar issues with other models
User message ('e' to leave):
> Write a college level …