text-generation-inference Search Results

1000+ results
for text-generation-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/optimum-neuron #677

text-generation-inference docker builds are not reproducible…

### System Info ```shell system: inf2.48xlarge instance OS: Amazon Linux 2023 build was done with v0.0.23 but mainline has same issue ``` ### Who can help? @dacorvo ### Informatio…

charlesmelby updated 1 month ago
3
dusty-nv/jetson-containers #378

No docker image for text-text-generation-inference on JetPac…

``` $ ./run.sh $(./autotag text-generation-inference) Namespace(packages=['text-generation-inference'], prefer=['local', 'registry', 'build'], disable=[''], user='dustynv', output='/tmp/aut…

ihubanov updated 9 months ago
5
vllm-project/vllm #9807

[Feature]: Integrate Writing in the Margins inference patter…

### 🚀 The feature, motivation and pitch [Writer](www.writer.com) has introduced ["Writing in the Margins" algorithm (WiM)](https://arxiv.org/html/2410.05258v1) that boosts results for long contex…

melisa-writer updated 1 week ago
3
huggingface/text-generation-inference #280

LLaVa support

### Feature request Hello! It would be awesome to have LLaVa support (upload an image to the API and have it embed it via CLIP etc) https://github.com/haotian-liu/LLaVA text-generation-webui alre…

SinanAkkoyun updated 2 months ago
2
lshqqytiger/stable-diffusion-webui-amdgpu-forge #47

SD1.5 Vpred checkpoints don't work

### Checklist - [X] The issue exists after disabling all extensions - [X] The issue exists on a clean installation of webui - [ ] The issue is caused by an extension, but I believe it is caused b…

Vol4ikk updated 2 weeks ago
10
microsoft/Megatron-DeepSpeed #264

[BUG] There are many bugs in running examples_deepspeed/gene…

The most serious one is the function forward_step in megatron/text_generation_utils.py. In inference of LLM, it needs to input all tokens before the current token or cache the K and V of those. But …

c-dafan updated 2 months ago
1
huggingface/text-generation-inference #2413

Tool call performs worse on v2.2.0 as compared to latest

### System Info ```bash gpu=0 num_gpus=1 model=meta-llama/Meta-Llama-3.1-8B-Instruct docker run -d \ --gpus "\"device=$gpu\"" \ --shm-size 16g \ -e HUGGING_FACE_HUB_TOKEN=$token \ -p 8082:80 …

varad0309 updated 2 months ago
6
PantoMatrix/PantoMatrix #150

Custom Audio Inference issues /自定义音频推理及应用问题请教

Hello, thanks for your great work! I have encountered several problems during the reproduction process and would like to ask for advice: 1. I tried to generate actions using my own audio and used M…

Ancolie18 updated 3 weeks ago
10
huggingface/text-generation-inference #2143

Unable to load the local model file into LoRA adaptors

### System Info ` text-generation-launcher 2.1.0 ` ### Information - [X] Docker - [X] The CLI directly ### Tasks - [ ] An officially supported command - [ ] My own modifications ### Reprod…

mhou7712 updated 1 month ago
29
vllm-project/vllm #5234

[Feature]: Add efficient interface for evaluating probabilit…

# Proposed Feature Add an efficient interface for generation probabilities on fixed prompt and completion pairs. For example: ```python # ... load LLM or engine prompt_completion_pairs = [ …

xinyangz updated 2 weeks ago
2

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for text-generation-inference

1000+ results
for text-generation-inference