tgi Search Results - Githubissues

1000+ results
for tgi

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/text-generation-inference #2413

Tool call performs worse on v2.2.0 as compared to latest

### System Info ```bash gpu=0 num_gpus=1 model=meta-llama/Meta-Llama-3.1-8B-Instruct docker run -d \ --gpus "\"device=$gpu\"" \ --shm-size 16g \ -e HUGGING_FACE_HUB_TOKEN=$token \ -p 8082:80 …

varad0309 updated 2 months ago
6
huggingface/text-generation-inference #2621

No module named moe_kernel in Flash Attention Installation w…

**Task - Flash Attention Installation from Source. [Completed] Run- TGI2.3.1 with models that support for Flash attention enabled models.** [Issue does not occur in TGI2.2.0] **Error -** 2024-1…

abhasin14 updated 3 weeks ago
2
vllm-project/vllm #3501

[Feature]: Add S3/HF Hub dynamic download for LoRA adapters

### 🚀 The feature, motivation and pitch Request for dynamic download of LoRA adapters from S3 or HF Hub based on what `model` adapter id is passed in the request. ### Alternatives No alternatives …

joaopcm1996 updated 2 weeks ago
5
huggingface/llm-ls #99

Can not using the ollama in docker container. ERROR: [LLM] h…

It is a great plugin and I love it. But I found an error here. ``` [LLM] http error: error sending request for url (http://localhost:11434/api/generate): connection closed before message completed …

meicale updated 5 months ago
1
lm-sys/FastChat #2037

Speed comparison with https://huggingface.co/chat/

The speed difference is astounding compared to https://huggingface.co/chat/ when running llama2-70b-chat. I wonder what I am doing wrong. I have A100 gpus, but the maximum on a single node are 4, …

surak updated 1 year ago
5
NetEase-FuXi/EETQ #24

add Qwen2

Please add Qwen2 support ``` EETQ_CAUSAL_LM_MODEL_MAP = { "llama": LlamaEETQForCausalLM, "baichuan": BaichuanEETQForCausalLM, "gemma": GemmaEETQForCausalLM } ```

ehartford updated 4 months ago
6
THUDM/AgentTuning #48

可以给个简单点的工具调用示例吗

qq594495953 updated 10 months ago
1
opea-project/GenAIComps #715

opea/llm-docsum-tgi fails when parse false streaming setting

When run the comps/llms/summarization/tgi/langchain docker container, when pass the "streaming": false parameter in the curl request: `curl http://${your_ip}:9000/v1/chat/docsum -X POST -d '{"q…

hteeyeoh updated 1 month ago
1
tschak909/platoterm64 #44

Plus/4 Port

Got a work-in-progress brewing for a Plus/4 port of PlatoTerm, though it is contingent on a couple of bugfixes in VICE and CC65. * PlatoTerm code is in the [port/plus4 branch](https://github.com/rh…

rhalkyard updated 5 years ago
1
snakemake/snakemake #1801

GurobiError caused by pulp checking for all available licens…

**Snakemake version** 7.10.0 **Describe the bug** When gurobipy is installed in the conda environment snakemake will use pulp to check whether a license is available before running your snakema…

Tomkourou updated 11 months ago
4

上一页 1...44 45 46 47 48 49 50...100 下一页

1000+ results for tgi

1000+ results
for tgi