bentoml Search Results - Githubissues

1000+ results
for bentoml

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

bentoml/BentoML #4148

feature: support starlette BackgroundTask

### Feature request starlette support backgroundTask https://www.starlette.io/background/ ~~~python # starlette example from starlette.applications import Starlette from starlette.response…

KimSoungRyoul updated 1 year ago
9
bentoml/BentoML #2267

--production flag slows down my requests

**Describe the bug** When I add the `--production` flag to the `bentoml serve` command, model serving becomes extremely slow compared to without the flag. The `--production` flag seems to make many…

jerpint updated 1 year ago
23
bentoml/OpenLLM #225

bug: AttributeError: 'BaseQuantizeConfig' object has no attr…

### Describe the bug I was trying to utilize `gptq` inference as in the examples in the homepage ``` openllm start falcon --model-id TheBloke/falcon-40b-instruct-GPTQ --quantize gptq --device …

aliozts updated 1 year ago
2
bentoml/OpenLLM #386

How to change the server port?

Command: openllm start opt I want to start the server on something other than port 3000. Can I change this port?

rounak610 updated 1 year ago
2
bentoml/OpenLLM #99

mpt is listed under supported models but not available in op…

### Describe the bug mpt is listed under supported models but not available in openllm build command, this is the error message. openllm build mpt Usage: openllm build [OPTIONS] {flan-t5|dolly-v…

bvandorf updated 1 year ago
2
bentoml/OpenLLM #481

bug: Cannot Download Model Files with Extension '.bin'

### Describe the bug When attempting to use the OpenLLM to run a finetuned model, it fails to download any files with a '.bin' extension (usually model weights). This issue results in missing file…

ryan-minato updated 1 year ago
2
bentoml/OpenLLM #55

feat: Add vLLM engine for generation tasks

### Feature request As it is done with Triton Inference server, it would be great to integrate vLLM (https://github.com/vllm-project/vllm) as a higly optimized engine for LLM generation based on cont…

Matthieu-Tinycoaching updated 1 year ago
1
bentoml/BentoML #4154

bug: containerize fails (buildx version parse)

### Describe the bug `bentoml containerize` fails The stacktrace: ``` Building OCI-compliant image for vendor_ranking:124f3a586b498bde251a152bfbe73415495273c8 with buildx Encountered except…

alexeyshockov updated 1 year ago
3
bentoml/OpenLLM #303

bug: Tensors not on same device with `--no-binary install`

### Describe the bug As recommended as a stopgap measure in [issue 299](https://github.com/bentoml/OpenLLM/issues/299), I installed OpenLLM with the `--no-binary` flag and tried to launch and query…

QLutz updated 1 year ago
2
bentoml/OpenLLM #231

bug: Basic query fails with LLaMa 13B

### Describe the bug Trying to run a LLaMa 13B model and query it, I encounter a type error. ### To reproduce Installed OpenLLM by running ```bash pip install "openllm[llama, vllm, fine-tun…

QLutz updated 1 year ago
3

上一页 1...50 51 52 53 54 55 56...100 下一页

1000+ results for bentoml

1000+ results
for bentoml