llm-training Search Results

1000+ results
for llm-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

sanjaybip/llm-frameworks-libraries #1

Recommendation: Colossal-AI for Powerful LLM Training and In…

Dear community, I'm excited to share Colossal-AI, a deep learning framework for training and inference tasks in large language models (LLMs). Colossal-AI stands out for its exceptional speed, su…

Yanjia0 updated 6 months ago
1
intelligent-machine-learning/dlrover #1005

We are going to build a LLM training agent help searching tr…

hxdtest updated 3 months ago
1
abetlen/llama-cpp-python #1598

Gemma 2 : flash_attn is not compatible with attn_soft_cap - …

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [ Yes] I am running the latest code. Development is very rapid so there are no tagged versions as…

iamsaurabhgupt updated 1 month ago
2
facebookresearch/LASER #279

Can't download: 403 error on some CC segments.

2024-02-14 21:01 INFO 2048692:root - Downloaded https://dl.fbaipublicfiles.com/laser/CCMatrix/v1.0.0/2020-10_0278.tsv.gz [200] took 8s (5766.4kB/s) 2024-02-14 21:01 INFO 2048692:root - Starting downl…

enn-nafnlaus updated 7 months ago
1
ludwig-ai/ludwig #3538

Implement Sample Packing for Efficient LLM Training

**Is your feature request related to a problem? Please describe.** LLM training is expensive, allowing sample packing is a more efficient way of training. **Describe the use case** I am trying to…

fire updated 1 year ago
4
flowersteam/Grounding_LLMs_with_online_RL #26

Code Reproduction Issues

Dear @flowersteam, trying to reproduce your results for coursework. I find there are a number of issues in running the code. Here is a list of what I found so far. ## Importing Several files…

andreapisa9 updated 1 month ago
1
NVIDIA/NeMo #10034

AttributeError: 'MegatronGPTModel' object has no attribute '…

#### Description I am retraining a LLaMA3 model. Due to the limited size of my dataset, I attempted to use `freeze_updates` as referenced in the [NVIDIA NeMo documentation](https://docs.nvidia.com/…

lianghsun updated 3 weeks ago
1
eric-ai-lab/MiniGPT-5 #47

STAGE 1 TRAINING

Thank you for your great work! In the stage 1 training mentioned in the paper, is the input of llm images and text，because the description ‘After the pretraining stage, the model is capable of genera…

srymaker updated 4 months ago
1
SciSharp/LLamaSharp #835

[BUG]: When the number of GpuLayerCount is more than 5, no d…

### Description Hi, I am using the latest version of LLamaSharp and my model is Llama-3 70b gguf version, when the number of GpuLayerCount is 0 to 5, although it is not very fast, I get the answer, b…

nazihaghighi updated 1 week ago
10
Mozilla-Ocho/llamafile #516

Bug: llama 3.1 and variants fail with error "wrong number of…

### Contact Details github ### What happened? I came here to report the issue / bug / my incompetence around the error of: `llama_model_load: error loading model: done_getting_tensors: wrong numbe…

camAtGitHub updated 3 weeks ago
9

上一页 1...15 16 17 18 19 20 21...100 下一页

1000+ results for llm-training

1000+ results
for llm-training