tinyllama Search Results

1000+ results
for tinyllama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ggerganov/llama.cpp #5486

Poor performance with unquantized FP16 models on Vulkan back…

When running an FP16 GGUF model fully offloaded to GPU with Vulkan backend, the performance is much worse than running on an AVX2 CPU. Quantized models, however, perform much faster when offloaded to …

deiteris updated 7 months ago
2
vllm-project/vllm #7940

[Bug]: RuntimeError: operator torchvision::nms does not exis…

### Your current environment Collecting environment information... INFO 08-28 14:32:56 importing.py:10] Triton not installed; certain GPU-related functions will not be available. WARNING 08-28 14:3…

murray-z updated 1 month ago
12
janhq/jan #2489

Discussion: Validation with nitro on different CPU instructi…

**Problem Link:** Check out the issue on GitHub: [Issue #2432](https://github.com/janhq/jan/issues/2432). **Why This Matters:** Jan is designed to work best with newer technology, using something…

hiro-v updated 5 months ago
10
Lightning-AI/litgpt #966

Change `merge_lora.py` output path name

This is a suggestion for making the documentation and user experience to make it more finetuning-script agnostic. So, currently, - `finetune/lora.py` writes a `.../lit_model_lora_finetuned.pth`…

rasbt updated 8 months ago
8
erfanzar/EasyDeL #106

Exception while running any model - einops.EinopsError: Err…

**Describe the bug** ``` python -m examples.serving.causal-lm.llama-2-chat --pretrained_model_name_or_path="TinyLlama/TinyLlama-1.1B-Chat-v1.0" --max_sequence_length=1024 --max_new_tokens=256 …

jchauhan updated 8 months ago
1
ml-explore/mlx-examples #336

mx.load(wf) cause RuntimeError

I am able to run this code no problem in the miniconda venv that I installed just for MLX solely: from mlx_lm import load, generate model, tokenizer = load("/Users/joy/mlx_model/solar_q8") But …

gladjoyhub updated 9 months ago
2
EleutherAI/lm-evaluation-harness #1497

The document related to the Task Manager is no longer availa…

Evaluation of the LM that showed some logging information to provide a document for the task manager is no longer available. ```bash # Execution command NUMEXPR_MAX_THREADS=72 lm_eval --model hf …

ZoneTwelve updated 8 months ago
3
h2oai/h2o-llmstudio #567

[CODE IMPROVEMENT] Deprecate h2oai_pipeline on HF

### 🔧 Proposed code refactoring Instead of pushing our custom `h2oai_pipeline.py` to HF, we should use new chat template features. See example: https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat…

psinger updated 9 months ago
1
langchain-ai/langchain #15969

Bug in langchain_community/embeddings/huggingface_hub.py Hu…

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the LangChain documentation with the integrated search. - [X] I used the GitHub search to find a…

jrabyIBM updated 6 months ago
7
ggerganov/llama.cpp #5431

Provision of example to download a model

Hi Could you please add some codes about loading pretained models from huggingface? I downloaded a light model with .bin format but it didn't work. My model: https://huggingface.co/karpathy/tinyll…

mahdi259 updated 9 months ago
2

上一页 1...80 81 82 83 84 85 86...100 下一页

1000+ results for tinyllama

1000+ results
for tinyllama