tgi Search Results - Githubissues

1000+ results
for tgi

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

EricLBuehler/mistral.rs #39

Feature Request: Streaming Completion Requests

It would be beneficial to have the capability to stream tokens to a client as they are generated, similar to other text generation interfaces. From what I've observed, this feature does not seem to be…

LLukas22 updated 3 months ago
4
MusicPlayerDaemon/MPD #2048

mp3 files are played completely noisy

## Feature request My playlist contains a lot of internet radio stations. All stations are sending mp3 streams. In MPD all streams are noisy. A test with VLC and same sources was succesfull. What ca…

JoergZ2 updated 1 month ago
2
atrisovic/paper_analysis_toolkit #6

Download error

Code: ``` def download_pdf(url, paperId): """Download specified PDFs if not in repository""" filename = "pdfs/" + paperId + ".pdf" # Check if the file already exists if os.pa…

atrisovic updated 4 weeks ago
2
justinribeiro/zotero-google-scholar-citation-count #14

Zotero 7 Support

Placeholder for prepping for the Zotero 7 release. - [x] install.rdf → manifest.json - [x] update.rdf → updates.json - [x] XUL Overlays → bootstrap.js - [x] chrome.manifest → runtime chrome regi…

justinribeiro updated 1 month ago
14
huggingface/text-generation-inference #1545

Template Not Found When Using OpenAI format Chat Completion

### System Info Docker Image: ghcr.io/huggingface/text-generation-inference:sha-1734540 Instance: AWS A10G via Huggingface Interfence Endpoint ### Information - [X] Docker - [ ] The CLI directly …

binarycrayon updated 2 months ago
23
ivgtr/github-weeklyTrends #343

Weekly GitHub Trending! (2024/04/22 ~ 2024/04/29)

# Weekly GitHub Trending! (2024/04/22 ~ 2024/04/29) ## Python trending 11repo's ### [meta-llama](https://github.com/meta-llama) / [llama3](https://github.com/meta-llama/llama3) 公式 Meta Llama 3 GitHub …

ivgtr updated 1 month ago
8
EricLBuehler/mistral.rs #99

Streaming in Tokens is significantly slower than not streami…

If streaming is enabled the generation slows down significantly. The following script: ```python Runs = 4 def request(stream:bool): client = openai.Client(api_key="foobar", base_url=EN…

LLukas22 updated 2 months ago
6
stanford-crfm/helm #1997

Support remote inference on Triton + TensorRT or vLLM or TGI

The preferred way to run models is to stand up an inference server (e.g., Triton + TensorRT or vLLM or TGI) locally and then hit it from HELM as an API. This way, HELM can benefit from all the crazy …

percyliang updated 4 months ago
2
huggingface/text-generation-inference #702

converting docker images to singularity

### Feature request I am trying to run tgi in a HPC cluster. I tried pulling the docker images with singularity. The problem is that in that case the custom kernels do not work and the cuda complains…

MiladInk updated 2 months ago
7
huggingface/text-generation-inference #1338

TGI over-reserving memory

### System Info docker image 1.3.0 public runpod template: https://runpod.io/gsc?template=3uvdgyo0yy&ref=jmfkcdio ### Information - [X] Docker - [ ] The CLI directly ### Tasks - [X] An offic…

RonanKMcGovern updated 5 months ago
5

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for tgi

1000+ results
for tgi