efficient-llm Search Results

1000+ results
for efficient-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #33467

Support context parallel training with ring-flash-attention

### Feature request Hi, I'm the author of [zhuzilin/ring-flash-attention](https://github.com/zhuzilin/ring-flash-attention). I wonder if you are interested in integrating context parallel with [zh…

zhuzilin updated 3 weeks ago
5
NVIDIA/TensorRT-LLM #1591

Feature Request: "Model Zoo" for quantization

TensorRT-LLM has great potential for allowing people to run larger models efficiently with limited hardware resources. Unfortunately, the current quantization workflow requires significant computation…

atyshka updated 2 months ago
6
axolotl-ai-cloud/axolotl #1947

Llama will not save properly

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/axolotl-ai-cloud/axolotl/labels/bug) didn't find any similar reports. ###…

mfirth-truffle updated 4 days ago
3
OoriData/BSW4ClimateCon #21

refactor for async and smarter LLM use

- [ ] async - [x] less wasteful LLM calls I'm cooking on the Database stuff right now, and it's clear that there's a few things we can do to make the daily run much more efficient. The searches…

choccccy updated 4 months ago
2
iamtalwinder/gif-maker #2

use GPU if True else CPU

If GPU is available in the machine of the user. Instead of using CPU for processing the gif(s) files, using GPU would prove a much more efficient and effective solution in terms of time complexity. …

NitkarshChourasia updated 4 months ago
1
jianzhnie/LLamaTuner #30

百川7B 模型微调结果

jianzhnie updated 1 year ago
4
SciPhi-AI/R2R #770

R2R ollama Docker GPU Support

Thank you so much for this project and your efforts to make GraphRAG accessible for the masses! **Is your feature request related to a problem? Please describe.** Systems with an appropriate GPU…

GMGassner updated 3 months ago
5
oneaiguru/GenAICodeUpdater #3

dspy use

Certainly! Let's dive into a comprehensive brainstorm on how your code and project can evolve to achieve your goals. We'll explore various ideas, metrics, and improvements that could help you optimize…

oneaiguru updated 1 week ago
1
pytorch/pytorch #128706

does FSDP support AMSP (a new DP shard strategy)

### 🚀 The feature, motivation and pitch there's a new DP shard strategy which is more flexible and general, see more detail at https://arxiv.org/abs/2311.00257 AMSP: Reducing Communication Overhead o…

guoyejun updated 4 months ago
2
huggingface/transformers #24304

SpikeGPT

### Feature request Extract the spiking nature of the LLM and port that [set] of features over for training/inference,. https://github.com/ridgerchu/SpikeGPT ### Motivation the benefits would r…

thistleknot updated 1 year ago
5

上一页 1...10 11 12 13 14 15 16...100 下一页

1000+ results for efficient-llm

1000+ results
for efficient-llm