evaluate-llm Search Results

1000+ results
for evaluate-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #1968

RuntimeError: Inplace update to inference tensor outside Inf…

1*8H100 DGX BOX Torch version: 2.1.1 CUDA version: 12.1 VLLM: 0.2.3 The inference works just fine in tensor parallel 1 but when using **tp > 1** I am getting this error below: WARNING 12-0…

imraviagrawal updated 2 months ago
14
EleutherAI/lm-evaluation-harness #2431

vllm with tensor_parallel_mode is not working at all because…

`CUDA_VISIBLE_DEVICES=0,1 lm_eval --model vllm \ --model_args pretrained=/home/jovyan/data-vol-1/models/meta-llama__Llama3.1-70B-Instruct,tensor_parallel_size=2,dtype=auto,gpu_memory_utilization=…

95jinchul updated 1 month ago
1
X-LANCE/SLAM-LLM #132

The answer is given to SLAM as input during training, wouldn…

### System Info PyTorch version: 2.4.0+cu121 ### Information - [X] The official example scripts - [ ] My own modified scripts ### 🐛 Describe the bug I was fine-tuning SLAM on my own da…

Yahya-Saleh updated 1 month ago
1
whittle-org/whittle #175

support for structural pruning methods

**Is your feature request related to a problem? Please describe.** We should provide an interface for structural pruning methods, such as N pruning based on weight magnitude or methods like Wanda,…

aaronkl updated 1 week ago
3
FSoft-AI4Code/CodeMMLU #3

How can i get the answers in the dataset?

The CodeMMLU is a great piece of work! I noticed that the dataset provides task_id, question, and choices columns, but is there an answer column? How should I handle this dataset if I want to f…

Guozhenyuan updated 1 month ago
2
harvard-hbs-d3/d3-open-webui #1

Open WebUI still hallucinating quotes

We updated: enabled OCR and changed Top k to 40. We used the "Generative AI and the Nature of Work" paper and it still hallucinated 3 quotes. This ticket is to have a conversation between D3 and AM. …

ndbolligerD3 updated 6 days ago
1
huggingface/evaluate #433

Evaluate LLM models like llama/alpaca using evaluate library…

Hi team, thanks for open source this awesome tool. I am new to the tool and try to ask some questions on LLM evaluation 1. Seems `evaluate` already create some evaluators (Some libs call it tasks I…

Jeffwan updated 1 year ago
2
AkihikoWatanabe/paper_notes #1501

Scaling LLM Test-Time Compute Optimally can be More Effectiv…

# URL - https://arxiv.org/abs/2408.03314 # Authors - Charlie Snell - Jaehoon Lee - Kelvin Xu - Aviral Kumar # Abstract - Enabling LLMs to improve their outputs by using more test-time comput…

AkihikoWatanabe updated 2 weeks ago
2
ShaotongLi-Max/Sakura_Florescence_Prediction #1

Peer review 1

**Summary** This paper investigates the relationship between temperature and cherry blossom bloom duration in Japan using a dual model approach. By combining historical and modern data from satellite…

demainwang updated 5 days ago
1
confident-ai/deepeval #979

Knowledge retention metric does not work

**Describe the bug** Running tests for Knowledge Retention (following the documentation: [https://docs.confident-ai.com/docs/metrics-knowledge-retention]) generates error: TypeError: Claude.generate(…

domciakocan updated 2 months ago
1

上一页 1...23 24 25 26 27 28 29...100 下一页

1000+ results for evaluate-llm

1000+ results
for evaluate-llm