llama-rs Search Results

896 results
for llama-rs

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

o-jill/tigerdenversi #6

重みファイルの読み込み

safetensorsファイルではなくcsv形式のデータを読み込んで初期値としたい。 VarStore ```rust pub struct VarStore { pub variables_: Arc, //

o-jill updated 2 weeks ago
5
zed-industries/zed #19346

OpenAI extension: Does not respect default_model setting

### Check for existing issues - [X] Completed ### Describe the bug / provide steps to reproduce it Summary: Attempting to override the `default_model` does not apply when using the `openai` p…

elithrar updated 1 day ago
5
utilityai/llama-cpp-rs #454

vulkan build failed

gen_vulkan_shaders failed at ``` Command::new(vulkan_shaders_gen_bin) .args([ "--glslc".as_ref(), "glslc".as_ref(), "--input-dir".as_ref(), vulkan_shaders_src…

hjiayz updated 1 month ago
6
huggingface/text-generation-inference #2572

OutOfMemory error running Meta-Llama-3.1-405B-Instruct-fp8 o…

### System Info TGI version: 2.2.0 (but I tested 2.3.0 too) Machine: 8x H100 (640 GPU RAM) ``` 2024-09-25T14:29:44.260160Z INFO text_generation_launcher: Runtime environment: Target: x86_64-unkn…

ad01bl updated 3 weeks ago
2
dora-rs/dora #660

Dora-RS Autonomous System seems have some issue to deal the …

**Describe the bug** A clear and concise description of what the bug is. **To Reproduce** Steps to reproduce the behavior: 1. Dora start daemon: `dora up` 2. Start a new dataflow: `dora start …

caixxuan updated 3 weeks ago
3
mdrokz/rust-llama.cpp #18

Feature flag metal: Fails to load model when n_gpu_layers > …

Can't utilize GPU on Mac with ``` llama_cpp_rs = { git = "https://github.com/mdrokz/rust-llama.cpp", version = "0.3.0", features = [ "metal", ] } ``` Code ``` use llama_cpp_rs::{ opti…

phudtran updated 10 months ago
8
unslothai/unsloth #793

Error when deploying on HF inference endpoints

Hi there, First thank you for unsloth, it's great! I've finetuned a llama-3-8b-Instruct-bnb-4bit and pushed it to hf hub. When I try to deploy it using [hf Inference Endpoints](https://huggingfa…

adamrobertolo78 updated 2 months ago
1
WasmEdge/WasmEdge #3124

feat: LLM inference libraries support plan

### Summary There are various LLM inference libraries. WasmEdge already integrated llama.cpp, but we want to bring more to the community. ### Details Already supported: 1. PyTorch 2. TFLi…

hydai updated 1 month ago
3
EricLBuehler/mistral.rs #675

Distributed inference and tensor parallelism plans

With the recent advent of large models (take Llama 3.1 405b, for example!), distributed inference support is a must! We currently support naive device mapping, which works by allowing a combination of…

EricLBuehler updated 5 days ago
3
huggingface/optimum-tpu #43

Issue getting Llama3 8b running on GKE

I'm trying to deploy Llama3 8b on GKE using optimum but running into some troubles. Following instructions here: https://github.com/huggingface/optimum-tpu/tree/main/text-generation-inference. I bu…

francescov1 updated 3 months ago
22

上一页 1...2 3 4 5 6 7 8...90 下一页

896 results for llama-rs

896 results
for llama-rs