meta-model Search Results

1000+ results
for meta-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

EleutherAI/lm-evaluation-harness #2301

Fail to reproduce the perplexity of Llama-2 7B on wikitext

Hi, when I use the command for evaluating Llama-2 7B on wikitext2: lm_eval --model hf --model_args pretrained=meta-llama/Llama-2-7b-hf --tasks wikitext --device cuda:0 --batch_size 1 …

Yonghao-Tan updated 9 hours ago
2
YeonwooSung/MLOps #106

How Meta trains large language models at scale

[meta engineering blog post](https://engineering.fb.com/2024/06/12/data-infrastructure/training-large-language-models-at-scale-meta/) - Meta requires massive computational power to train large lang…

YeonwooSung updated 2 months ago
1
mongodb/laravel-mongodb #2875

morphOne $model (mysql) -> relation (mongo)

- Laravel-mongodb Version: 4.2.0 - PHP Version: 8.3.4 - Database Driver & Version: php8.3-mongodb latest ### Description: I have a model with a mysql collection e.g. $product. with a meta-field…

busaku updated 1 week ago
1
unslothai/unsloth #1012

Inferencing on CPU (using fine tuned version of llama 3.1)

I have fine tuned "meta-llama-3.1-8b-bnb-4bit" model using unsloth. I have downloaded the lora weights and able to do inferencing using those on Colab GPU. But i want use this fine tuned model for …

ApurvPujari updated 1 day ago
14
moxin-org/moxin #199

Display spinning/loading animation while an LLM response mes…

## Problem Currently, the UI makes it impossible to tell if a model has finished streaming its response back to the user, or if it is still underway and is just taking a long time to calculate the re…

kevinaboos updated 1 month ago
1
vllm-project/vllm #7767

[Bug]: for mistral-7B, local batch inference mode causes OOM…

### Your current environment vllm version: 0.5.4 gpu 24GB memory ### 🐛 Describe the bug ```bash CUDA_VISIBLE_DEVICES=0 vllm serve mistralai/Mistral-7B-Instruct-v0.3 --api-key yyy --port 1…

yananchen1989 updated 1 week ago
3
ollama/ollama #5522

deepseek-coder-v2:236b - Error: llama runner process has te…

### What is the issue? I've had this issue for a while with earlier version of ollama and latest with and Intel SPR 8480+ and RTX 4090. The num_gpu parameter has been removed from model file so I can…

scouzi1966 updated 4 days ago
14
huggingface/transformers #33280

TypeError: MistralForSequenceClassification.forward() got an…

### System Info I got this error when I tried to use sentiment classification pipeline with "nvidia/Mistral-NeMo-Minitron-8B-Base". It works fine with llama 3.1. TypeError: MistralForSequenceClas…

edchengg updated 1 week ago
2
vllm-project/vllm #7194

[Bug]: Incomplete tool calling response for pipeline-paralle…

### Your current environment vllm v0.5.4 Setup A) single docker container with vllm, no pipeline-parallelism ``` docker run ... vllm/vllm-openai:v0.5.4 --model "meta-llama/Meta-Llama-3.1-70B-…

sfbemerk updated 1 month ago
16
ollama/ollama #5494

H100s (via Vast.ai) generate GPU warning + fetching/loading …

### What is the issue? I tried 1xH100 box and got an error during installation. Got the same output from another bigger 2xH100 box too: ``` root@C.11391672:~$ curl -fsSL https://ollama.com/instal…

wkoszek updated 3 days ago
11

上一页 1...10 11 12 13 14 15 16...100 下一页

1000+ results for meta-model

1000+ results
for meta-model