llm-eval Search Results

TencentARC/SEED-Story #31

/group/40034/shuaisyang/seed_project/StorySalon/llm_eval/mm_…

Hello, may I ask where to find the data for these JSON files located at /group/40034/shuaisyang/seed_project/StorySalon/llm_eval/mm_eval.json?

lq-blackcat updated 1 day ago

microsoft/mttl #142

Issue When Trying to Replicate the Result

Hi team, I recently came across your excellent work on Modular LLM and am excited to build upon it. I found the MBC library you trained at hf://zhan1993/mbc_library_phi2_icml and wanted to give it a…

thuuyen98 updated 2 days ago

llm-jp/scripts #22

Automate evaluation (llm-jp-eval)

## 概要 [自動評価スクリプト](https://github.com/llm-jp/scripts/tree/main/evaluation/installers/llm-jp-eval-v1.3.1)の自動実行スクリプト作成 - 付随して[covert script](https://github.com/llm-jp/scripts/tree/main/pretrain/scrip…

YumaTsuta updated 3 months ago

pytorch/torchchat #1334

Multimodal Eval Enablement (Looking for Developer to Impleme…

### 🚀 The feature, motivation and pitch ***Please note that since the actual implementation is going to be simple, and the design has already been reviewed, the purpose of this GitHub Issue is to l…

Olivia-liu updated 1 week ago

huggingface/transformers #34699

TypeError: Accelerator.init() got an unexpected keyword …

### System Info transformers: 4.39.3 python: 3.10.12 ### Who can help? _No response_ ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks -…

bardenthenry updated 1 week ago

LLaVA-VL/LLaVA-NeXT #297

llava-video使用llms-eval测试出错

按照evaluation部分，目前的llava好像已经没有llava_vid，在lmms-eval下面好像也有类似的错误 [#242 in lmms-eval](https://github.com/EvolvingLMMs-Lab/lmms-eval/issues/242) 想问一下有没有什么快速的解决方案。。如果自己进行适配的话该怎么操作。。

yuanrr updated 1 month ago

JinaLeejnl/StreamingDialogue #2

the generate.py do not use the proposed attention sink?

hi, when i try run your code, i want to eval the result of person chat and msc, when I run generate.py I find there is the original LLM from transformers, can you tell me how to use it, thank you

oujieww updated 3 days ago

run-llama/llama_index #16898

[Bug]: Eval Dataset passed as None when trying to Finetune E…

### Bug Description On llama-index 0.11.22 and llama-index-finetuning 0.2.1. I was attempting to follow the documentation to finetune the BAAI/bge-small-en-v1.5 model on my own dataset. I attempted…

mjohal3 updated 1 week ago

symflower/eval-dev-quality #365

Spring Boot "write-test"

- [x] plain test case - [x] tell LLMs to generate spring boot - [x] flag results that are not spring boot --- - https://github.com/symflower/eval-dev-quality/pull/363 - https://github.com/sym…

bauersimon updated 2 weeks ago

llm-jp/scripts #28

scripts for llm-jp-eval v1.4

# 概要 llm-jp-eval v1.4のスクリプトを作成する。 # 詳細 llm-jp-eval v1.4のインストール・Runスクリプトの作成実行方法の選択 * オフライン評価 (vllm) * 10倍高速化 * 変換スクリプトが必要

YumaTsuta updated 2 months ago

1000+ results for llm-eval

1000+ results
for llm-eval