evaluating-models Search Results

1000+ results
for evaluating-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

irthomasthomas/undecidability #908

[2310.06770] SWE-bench: Can Language Models Resolve Real-Wor…

- [ ] [[2310.06770] SWE-bench: Can Language Models Resolve Real-World GitHub Issues?](https://arxiv.org/abs/2310.06770) # [SWE-bench: Can Language Models Resolve Real-World GitHub Issues?](https://ar…

ShellLM updated 2 weeks ago
1
sgravel/tracts #35

Problems of installing tracts and running example files

Dear Prof. Gravel, I am a researcher studying the genetic ancestry of the indigenous Siraya people in Taiwan. I have greatly benefited from your papers working on inferring admixture history based on…

wenyako updated 3 days ago
6
microsoft/InnerEye-DeepLearning #840

Inaccaurate documentation for evaluating against pre-trained…

### Is there an existing issue for this? - [X] I have searched the existing issues ### Issue summary Some documentation is lacking / inaccurate for evaluating against pre-trained models. ### What …

peterhessey updated 1 year ago
1
evalplus/evalplus #228

Request for evaluating models running on TGI

Hi, I was going through this repo and I looked at the arguments that may be specified along with the model being evaluated. I see that in the engine arguments, there's no TGI but there is HuggingFa…

samin-batra updated 5 days ago
1
McGill-NLP/llm2vec #123

Unable to load merged model for MTEB evaluation

I have trained a model using supervised contrastive. I saved the model using - `l2v.save('/llm2vec_models/final_merged_model', merge_before_save=True, save_config=True)` Now when I try to run m…

sandeep-krutrim updated 1 month ago
2
ollama/ollama #6565

Does ollma have the feature to save model response in log fi…

OS: Linux ollama version: 0.3.7-rc5 model: starcoder2:3b I am deploying ollama for code completion and set OLLAMA_DEBUG=1, but the log file only saves the model request but not the model reponse …

keezen updated 1 week ago
2
OCA/automation #5

[16.0] automation_oca : can't use context_today() or datetim…

## Module automation_oca ## Describe the bug I create a new automation with a filter on write date ex : [("stage_id", "in", [10]),('write_date','

gaelTorrecillas updated 2 weeks ago
2
rmusser01/tldw #195

Evaluation: RAG implementation

Issue is to track evaluation of RAG implementations. Frameworks: - F Papers: - F - F One-Offs: - https://github.com/microsoft/promptflow/tree/main/examples/flows/evaluation/eval-qna-rag…

rmusser01 updated 1 week ago
1
cornellius-gp/gpytorch #1242

Evaluating a set of available Gaussian Process models

Dear Team, I am currently working on a project to predict vehicle trajectories using Gaussian Process regression. In my work, there is a need to train a set of Gaussian Process models each meant fo…

AshAswin updated 3 years ago
3
Paitesanshi/LLM-Agent-Survey #27

Introducing a new paper on role-playing LLM agents (ACL 2024…

Hi, what a fantastic resource for developing intelligent LLM agents! I wanted to highlight a recent paper presented at ACL 2024 Findings: [TimeChara: Evaluating Point-in-Time Character Hallucinatio…

ahnjaewoo updated 2 weeks ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for evaluating-models

1000+ results
for evaluating-models