inference-acceleration Search Results

1000+ results
for inference-acceleration

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/LightGBM #2791

[Discussion] efficiency improvements

This is to call the efficiency improvement for LightGBM, include but not limited to: - Tree learning algorithm acceleration - I/O related, like dataset loading and model saving - Inference speed im…

guolinke updated 8 months ago
22
intel/intel-npu-acceleration-library #89

Enable graph mode for LLM inference

Hi, I have read the "examples\NPU compilation tutorial.ipynb" about graph mode and eager mode, which helped me a lot. I was wondering if I could use graph mode in LLM inference to reduce the weights…

xduzhangjiayu updated 3 months ago
9
hpcaitech/ColossalAI #3665

[DOC]: What is the hardware used in th Energon-AI inference …

### 📚 The doc issue In the Inference (Energon-AI) [Demo](https://github.com/hpcaitech/ColossalAI#GPT-3-Inference), what is the hardware used in th Energon-AI inference acceleration ? Can you show…

lz02k updated 1 year ago
1
exo-explore/exo #238

[BOUNTY - $1000] Compile tinygrad to swift

- I want to keep exo 100% python if possible - Would like to compile swift or objc inference code in tinygrad - The deliverable here is a merged PR in tinygrad and a small demonstration in exo of ho…

AlexCheema updated 5 hours ago
18
ultralytics/hub #839

When I loaded my dataset

### Search before asking - [X] I have searched the HUB [issues](https://github.com/ultralytics/hub/issues) and found no similar bug report. ### HUB Component Datasets ### Bug I got time out erro…

alimbetov updated 1 month ago
1
hughperkins/coriander #51

Compatibility with TensorFlow XLA/NVIDIA TensorRT?

Hi This is general question about deep learning inference acceleration with coriander. TF XLA good idea for inference optimization but limited available CUDA. And NVIDIA also release TensorRT as in…

c00lrain updated 4 years ago
3
AIFSH/ComfyUI-Hallo #44

no sd model found?

![Screenshot_20240808_204712](https://github.com/user-attachments/assets/a0546fd7-2e52-44ad-9d65-9d8e3c385dd0) how can i set the sd model? It is null? without i get this error: ``` find mo…

hexxter updated 2 months ago
1
intel-analytics/ipex-llm #4828

Nano: Add step by step tutorials for PyTorch

- [x] PyTorch Train tutorial - [x] PyTorch Inference - [x] runtime acceleration with openvino - [x] runtime acceleration with onnx-runtime - [x] quantization with INC - [x] quant…

yangw1234 updated 2 years ago
2
facebookresearch/maskrcnn-benchmark #946

libtorch trace supported

Since there many ops were using C++ implemented as well as CUDA. Would it be useful to make it trace and inference in libtorch? I believe it would make much more speed acceleration using libtorch a…

lucasjinreal updated 5 years ago
2
withinmiaov/A-Survey-on-Mixture-of-Experts #3

How about add MeteoRA to your survey?

We propose [MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models](https://arxiv.org/pdf/2405.13053). Our proposed MeteoRA (Multiple-Tasks embedded LoRA) is a scalable and efficient framewor…

ParagonLight updated 2 months ago
2

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for inference-acceleration

1000+ results
for inference-acceleration