llm-architecture Search Results

1000+ results
for llm-architecture

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

QwenLM/Qwen2.5 #1101

"llama_model_load: error loading model: check_tensor_dims: t…

When I quantified the Qwen2.5-1.5B-instruct model according to **"Quantizing the GGUF with AWQ Scale"** of [docs](https://qwen.readthedocs.io/en/latest/quantization/llama.cpp.html) , it showed that th…

Autism-al updated 5 days ago
2
langchain-ai/langchain #27717

Voice Input Support for Ollama Models

### Discussed in https://github.com/langchain-ai/langchain/discussions/27404 Originally posted by **kodychik** October 16, 2024 ### Checked - [X] I searched existing ideas and did not find …

efriis updated 1 month ago
1
TimMo-dev/P7 #33

Write about LLMs in a Technology/Implementation section

A continuation from task #15. Should include an in-depth description of the technology behind the LLMs and of the training and inference. Finish the section This issue should neatly be tied together …

mpmunch updated 2 weeks ago
1
instructlab/instructlab #2516

Update llama_cpp_python from 0.2.79 to 0.3 for executing new…

**Is your feature request related to a problem? Please describe.** The current deployed version of instructlab requires llama_cpp version 0.2.79, and I want to run the new IBM Granite architecture, w…

richardesp updated 4 weeks ago
1
ml-explore/mlx-examples #1030

Architecture Requests for Mamba

I would like support the following architectures: - Mamba - MambaByte - Mamba-2 - Mamba-hybrid (mamba + transformer) - Mamba-2-hybrid (mamba2 + transformer) These architectures are becoming qu…

hg0428 updated 1 month ago
3
casper-hansen/AutoAWQ #655

"llama_model_load: error loading model: check_tensor_dims: t…

When I quantified the Qwen2.5-1.5B-instruct model according to "GGUF Export" in the examples.md in the docs, it showed that the quantization was complete and I obtained the gguf model.But when I load …

Autism-al updated 5 days ago
1
ggerganov/llama.cpp #10547

Eval bug: issues with draft model and Cline+VSCode

### Name and Version ``` .\llama-cli.exe --version ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no ggml_cuda_init: found 2 CUDA devices: Device 0: NVIDIA…

Nepherpitou updated 1 day ago
8
cncf/tag-runtime #130

Add CNAI Reference Architecture LLM

More details here: https://docs.google.com/document/d/11_6pvPzd956QONIxHuDP155eBRrd89xSC1tYVRy3KvI/edit#heading=h.z0eti03fxfmv

zanetworker updated 5 months ago
1
ggerganov/llama.cpp #10560

Misc. bug: KV cache loads only into CPU RAM

### Name and Version ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: yes ggml_cuda_init: found 1 CUDA devices: Device 0: NVIDIA GeForce RTX 4090, compute capab…

Apanoff updated 3 days ago
1
zhengtxecon/data_tech_hirng #3

Meeting Agenda 2024/11/11

- Aggregated measures: - Difficult to aggregate measures of individual dimensions into a single index - Directly ask for an aggregate measure from LLM? It is not transparent and difficult to pr…

zhengtxecon updated 3 days ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for llm-architecture

1000+ results
for llm-architecture