efficient-inference Search Results

1000+ results
for efficient-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tensorflow/flutter-tflite #171

[Feature-Request] Incorporation of `tflite_flutter_helper` f…

As many of you might already be familiar, `tflite_flutter_helper` is a popular helper library, specifically for Image processing while dealing with tflite. This was earlier developer by tensorflow tea…

saurabhkumar8112 updated 3 months ago
17
opea-project/GenAIComps #831

[RFC] OPEA Inference Microservices Integration for LangChain…

# OPEA Inference Microservices Integration for LangChain This RFC proposes the integration of OPEA inference microservices (from GenAIComps) into LangChain [extensible to other frameworks], enabli…

avinashkarani updated 3 weeks ago
2
tracel-ai/models #49

Feature chemical models of prediction and generation support…

Notice tracel-ai from burn framework, this software must substitute to high performance predictions, like robotics, predict from data lake. Some molecular pretrained models use RoBERTa as base model, …

linjing-lab updated 1 week ago
2
rhymes-ai/Allegro #25

Windows - RuntimeError: No available kernel. Aborting execut…

Trying to get this working under Windows. I clone the repository, create a new venv and try and install requirements.txt. xformers fails with ``` Collecting xformers==0.0.28.post1 Downloadi…

SoftologyPro updated 1 month ago
10
dingo-actual/om #4

Create PyPi Package

dingo-actual updated 5 days ago
1
ultralytics/ultralytics #13879

How to Optimize YOLOv8 Preprocessing and Postprocessing Time…

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…

auner2456889 updated 3 weeks ago
6
huggingface/optimum-habana #1451

Meta-Llama-3 model text-generation example output is unexpec…

### System Info ```shell deepspeed 0.14.4+hpu.synapse.v1.18.0 optimum-habana 1.14.0 docker image: vault.habana.ai/gaudi-docker/1.18.0/ubuntu22.04/habanalabs/pytorch-ins…

aslanxie updated 6 days ago
5
kubeedge/ianvs #126

Cloud-edge collaborative speculative decoding for LLM based …

- Description: - The autoregressive decoding mode of LLM determines that LLM can only be decoded serially, which limits its inference speed. Speculative decoding technique can be used to decode L…

hsj576 updated 3 months ago
15
ggerganov/llama.cpp #10295

Feature Request: shared tokens in batches with `logits = tru…

### Prerequisites - [X] I am running the latest code. Mention the version if possible as well. - [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.…

Lyrcaxis updated 2 weeks ago
9
AkihikoWatanabe/paper_notes #601

Efficiently Scaling Transformer Inference, Reiner Pope+, N/A…

# URL - https://arxiv.org/abs/2211.05102 # Affiliations - Reiner Pope, N/A - Sholto Douglas, N/A - Aakanksha Chowdhery, N/A - Jacob Devlin, N/A - James Bradbury, N/A - Anselm Levskaya, N/A…

AkihikoWatanabe updated 1 year ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for efficient-inference

1000+ results
for efficient-inference