offline-llm Search Results

1000+ results
for offline-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

OpenPPL/ppl.llm.serving #60

关于性能分析的一点疑惑

## What are the problems?(screenshots or detailed error messages) 想问下有性能分析的工具嘛？profiler相关，还是只能用nsight profile这种自己去看一些算子性能 ## What are the types of GPU/CPU you are using? GPU：A100-80G-SXM4 ## What…

Zhiy-Zhang updated 5 months ago
1
mit-han-lab/llm-awq #184

No such file or directory: "VILA1.5-13b-AWQ/llm/model-00001-…

I've done the following: > Alternatively, one may also skip the quantization process and directy download the quantized VILA-1.5 checkpoints from [here](https://huggingface.co/Efficient-Large-Model…

kousun12 updated 3 months ago
8
intel/AI-Playground #81

Unable to open AI playground.. It hangs at loading screen.

## Describe the bug Unable to open AI playground.. It hang at loading screen. Tried installed the Latest Microsoft Visual C++ Redistributable Version - The latest version is 14.40.33816.0 No Pytho…

jaact updated 3 days ago
4
coleam00/bolt.new-any-llm #20

Open in a devcontainer?

**Is your feature request related to a problem? Please describe:** I would like to run this in a docker container to get the obvious benefits of containers. **Describe the solution you'd like:**…

sumith updated 3 days ago
3
vllm-project/vllm #3154

Generation with Prefix-cache are slower than the ones withou…

I'm running the tutorial [vllm/offline_inference_with_prefix.py](https://github.com/vllm-project/vllm/blob/main/examples/offline_inference_with_prefix.py) and measuring the generation times, again bel…

vin136 updated 5 months ago
11
shalb/charts #7

Make `HUGGINGFACE_OFFLINE` configurable

Currently [HUGGINGFACE_OFFLINE=1](https://github.com/shalb/charts/blob/7c29f2185336ed8d9cb14ffae9942f6b95462d12/huggingface-model/templates/application.yaml#L101-L102) is hardcoded in the helm templat…

slyt updated 3 months ago
3
intel-analytics/ipex-llm #10820

Llama-CPP Install Issue - Windows

The newest drivers are in use, the system is a Ryzen 2700x CPU with 16GB of RAM and a 16GB A770 GPU on Windows 11. The instructions in the docs were followed precisely. Upon attempting to execut…

ElliottDyson updated 6 months ago
4
kiwix/kiwix-js #1239

Consider adding support for calling a local (or remote) LLM …

This is highly speculative in terms of usefulness, and the UI would need to be considered carefully. Use case would be for summarizing articles retrieved from the ZIM. Over time, it might be possible …

Jaifroid updated 5 months ago
4
NVIDIA/spark-rapids-tools #1125

[FEA] Be able to recommend specific GPU SKU according to SQL…

**Is your feature request related to a problem? Please describe.** We are able to infer the recommendation by the qualification tool but the recommendation is based on vague GPUs. In recent experi…

wjxiz1992 updated 3 months ago
2
vllm-project/vllm #9153

[Bug]: InternVL bounding box prediction does not work

### Your current environment The output of `python collect_env.py` ```text python collect_env.py Collecting environment information... PyTorch version: 2.4.0+cu121 Is debug build: False C…

MoritzLaurer updated 6 hours ago
14

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for offline-llm

1000+ results
for offline-llm