llm-training Search Results

1000+ results
for llm-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

THUDM/ChatGLM3 #1337

微调示例运行问题

### System Info / 系統信息 cuda =12.1 torch =2.3.0 transformers=4.40.0 ,显卡是Tesla V100-PCIE-32GB ### Who can help? / 谁可以帮助到您？ @Btlmd ### Information / 问题信息 - [X] The official example scripts / 官方的示例…

hgh-xxh updated 1 day ago
1
ServiceNow/Fast-LLM #56

[bug] Sparse copy runs out of shared memory with many expert…

# 🐞 Describe the Bug Facing an `OutOfResources` error with 64 fine-grained experts and dropless MoE enabled, even though there is sufficient GPU memory. # 🔄 Steps to Reproduce Steps to reprod…

sohamparikh updated 1 day ago
1
ai-robots-txt/ai.robots.txt #53

The case of DuckAssistBot: real-time vs LLM bots

I think DuckAssistBot is good test case for where we want to draw the line between AI crawlers and other crawlers. The README currently says > This is an open list of web crawlers associated wit…

nisbet-hubbard updated 1 week ago
8
hitachi-nlp/FLD #3

When could we see the source code/data of FLDx2? 👀

Hi! FLD and FLDx2 are both outstanding works. I believe they offer a new data-centric approach to enhancing the logical reasoning capabilities of LLMs. I am very grateful that your work has provided n…

realCrush updated 1 day ago
1
terrastruct/d2 #2082

LLM training data for d2lang

Hi I want to train llm model on d2 for diagram generation using genai. I am looking for dataset for d2lang code. Is there any way or someone can help to provide data for model training. I am sp…

meetzuber updated 2 months ago
1
geekan/MetaGPT #1530

data-interpreter needs a dataset generation function

Sometimes , ds tasks need trained data from web page , or generated by llm. data-interpreter should determine to get useful trained data from webpage, or generate useful data by it self. I mean i…

CoderYiFei updated 1 month ago
1
irthomasthomas/undecidability #939

prompt-tuning-playbook/README.md at main · varungodbole/prom…

- [ ] [prompt-tuning-playbook/README.md at main · varungodbole/prompt-tuning-playbook](https://github.com/varungodbole/prompt-tuning-playbook/blob/main/README.md?plain=1) # LLM Prompt Tuning Playbook…

ShellLM updated 1 week ago
1
Sinaptik-AI/pandas-ai #1426

PandasAI stop working

### System Info Windows 11 Python 3.11.4 pandasai 2.0.24 ### 🐛 Describe the bug I was using PandasAI, and it was working perfect. But , out of the blue it stops working, and it's not working anym…

abgonzalez updated 2 weeks ago
1
huggingface/transformers #34699

TypeError: Accelerator.__init__() got an unexpected keyword …

### System Info transformers: 4.39.3 python: 3.10.12 ### Who can help? _No response_ ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks -…

bardenthenry updated 6 days ago
6
AkihikoWatanabe/paper_notes #1524

Balancing Speed and Stability: The Trade-offs of FP8 vs. BF1…

# URL - https://arxiv.org/abs/2411.08719 # Authors - Kazuki Fujii - Taishi Nakamura - Rio Yokota # Abstract - Large Language Models (LLMs) have attracted significant attention due to their hum…

AkihikoWatanabe updated 3 days ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for llm-training

1000+ results
for llm-training