llm-training Search Results

1000+ results
for llm-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT-LLM #2288

need a copy code widget to be able to copy code snippets

Is it possible to add to https://nvidia.github.io/TensorRT-LLM/ the code copy widget that you already have on https://nvidia.github.io/TensorRT-Model-Optimizer/? For example if you go to https://nvidi…

stas00 updated 1 week ago
3
daveshap/Raspberry #63

Establish XML Standard for Chain of Thought within JSONL for…

# Establish XML Standard for Chain of Thought within JSONL for LLM Training ## Objective Create an XML standard for structuring Chain of Thought (CoT) data within JSONL files for our open-source A…

daveshap updated 1 month ago
4
2U1/Molmo-Finetune #5

VRAM requirements for training

Thanks for sharing your work. Can you provide the VRAM requirements for each type of training (full, vision only, LLM only)?

thanhnguyentung95 updated 1 month ago
2
huggingface/transformers #31125

Understanding loss in Training LLM

### Feature request Hi, I have a misunderstanding regarding training LLMs. When we train the model, we calculate the loss by having the model predict the next word and then compute the difference …

mostafamdy updated 5 months ago
2
FunAudioLLM/CosyVoice #488

'CosyVoiceModel' object has no attribute 'inference'

trying to run training (CosyVoice/examples/libritts/cosyvoice/run.sh), while doing the inference step i get this error: 'CosyVoiceModel' object has no attribute 'inference' when looking inside infer…

doryashar updated 6 days ago
8
CONE-MT/MindMerger #7

Need for Theoretical Background on MindMerger Approach

Hi! Thank you for your outstanding work! I have been working on improving the LangBridge approach, and I noticed your paper referenced it. As you discussed, LangBridge uses soft prompts generated b…

Kosei1227 updated 1 week ago
1
huggingface/autotrain-advanced #795

[BUG] Any updates to errors due to Gradient Accumulation?

### Prerequisites - [X] I have read the [documentation](https://hf.co/docs/autotrain). - [X] I have checked other issues for similar problems. ### Backend Local ### Interface Used CLI ### CLI Co…

jackswl updated 5 days ago
1
nv-tlabs/LLaMA-Mesh #12

[Curious] Image to Mesh

We could seemingly train the dataset on an image:vertexes pairing to get what would be essentially the equivalent of a depth-map to mesh language model? I'd be very interested in the training code …

matbee-eth updated 23 hours ago
1
DS4SD/docling #133

State of GPU support

Hello Deep Search Team! Thank you for this contribution to open source! We are considering using your library to parse PDF files for LLM training, so we will potentially need to scale things up.…

ViktorooReps updated 1 week ago
4
frdel/agent-zero #97

Getting loops of creating agents which do the same thing end…

You will see the problem in the text below, this is with using gpt-4o and version 0.5 of agent zero, but have similar issues with other models User message ('e' to leave): > Write a college level …

devnull75 updated 2 months ago
4

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for llm-training

1000+ results
for llm-training