llm-eval Search Results

1000+ results
for llm-eval

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

LeoLin4258/rwkvcn-docs #13

Uupdate the benchmark of RWKV

目前计划添加的数据： - [Uncheatable Eval](https://github.com/Jellyfish042/uncheatable_eval)：使用最新的动态数据测试 LLM 性能，包含 RWKV - [RULER_RWKV](https://github.com/Ojiyumm/RULER_RWKV)：RWKV 模型的 [RULER](https://arxiv.org/…

shoumenchougou updated 6 days ago
2
huggingface/evaluate #615

Benchmark evaluation for language models.

Not sure if this feature belongs to this library or would it require a complete separate library. I am proposing the creation of a library where llm benchmarks can be ran. For example, evaluating a mo…

mina58 updated 1 month ago
1
meta-llama/llama-recipes #645

upgrade typing_extensions version

### System Info PyTorch: 2.3 Cuda: 12.1 ### Information - [ ] The official example scripts - [ ] My own modified scripts ### 🐛 Describe the bug I got error when i ran the command generated from …

lxning updated 1 month ago
1
langchain-ai/langchain #26335

ChatOllama is not supporting bind_tools as good as ChatGroq …

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the LangChain documentation with the integrated search. - [X] I used the GitHub search to find a…

raj-acrivon updated 2 weeks ago
3
run-llama/llama_index #15939

[Question]: Issues with Context Generation and Metric Suppor…

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question When evaluating a RAG retrieval service using the llama-index evaluation method, I encou…

adityamity updated 2 weeks ago
31
horseee/LLM-Pruner #58

Evaluation：UnicodeDecodeError: 'utf-8' codec can't decode by…

Thank you very much for doing such great open-source work! i try: CUDA_VISIBLE_DEVICES=X bash scripts/evaluate.sh PATH_OR_NAME_TO_BASE_MODEL PATH_TO_SAVE_TUNE_MODEL PATH_TO_PRUNE_MODEL EPOCHS_YOU…

manlenzzz updated 3 weeks ago
1
geekan/MetaGPT #1462

API openrouter does not work

I tried the documentation: llm: api_type: 'openrouter' base_url: 'https://openrouter.ai/api/v1' api_key: 'sk...' model: meta-llama/llama-3-70b-instruct:nitro Then I got this issu…

edisop updated 1 month ago
2
AkihikoWatanabe/paper_notes #1418

LLM-jp-3 1.8B・3.7B・13B の公開, LLM.jp, 2024.09

https://llmc.nii.ac.jp/topics/post-707/

AkihikoWatanabe updated 1 week ago
3
EleutherAI/lm-evaluation-harness #2294

Worse evaluation performance with PEFT adaptors

Hello, thank you for providing this excellent model and repository. I encountered an issue while conducting my experiments with your codebase, and I’d appreciate your insights. In my experiments, I…

YananLi18 updated 2 weeks ago
1
All-Hands-AI/OpenHands #4157

[Bug]: Sandbox image build failed on eval

### Is there an existing issue for the same bug? - [X] I have checked the troubleshooting document at https://docs.all-hands.dev/modules/usage/troubleshooting - [X] I have checked the existing iss…

neubig updated 15 hours ago
7

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for llm-eval

1000+ results
for llm-eval