-
- [ ] [[2310.06770] SWE-bench: Can Language Models Resolve Real-World GitHub Issues?](https://arxiv.org/abs/2310.06770)
# [SWE-bench: Can Language Models Resolve Real-World GitHub Issues?](https://ar…
-
Dear Prof. Gravel,
I am a researcher studying the genetic ancestry of the indigenous Siraya people in Taiwan. I have greatly benefited from your papers working on inferring admixture history based on…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Issue summary
Some documentation is lacking / inaccurate for evaluating against pre-trained models.
### What …
-
Hi,
I was going through this repo and I looked at the arguments that may be specified along with the model being evaluated. I see that in the engine arguments, there's no TGI but there is HuggingFa…
-
I have trained a model using supervised contrastive. I saved the model using -
`l2v.save('/llm2vec_models/final_merged_model', merge_before_save=True, save_config=True)`
Now when I try to run m…
-
OS: Linux
ollama version: 0.3.7-rc5
model: starcoder2:3b
I am deploying ollama for code completion and set OLLAMA_DEBUG=1, but the log file only saves the model request but not the model reponse …
-
## Module
automation_oca
## Describe the bug
I create a new automation with a filter on write date ex : [("stage_id", "in", [10]),('write_date','
-
Issue is to track evaluation of RAG implementations.
Frameworks:
- F
Papers:
- F
- F
One-Offs:
- https://github.com/microsoft/promptflow/tree/main/examples/flows/evaluation/eval-qna-rag…
-
Dear Team,
I am currently working on a project to predict vehicle trajectories using Gaussian Process regression. In my work, there is a need to train a set of Gaussian Process models each meant fo…
-
Hi, what a fantastic resource for developing intelligent LLM agents!
I wanted to highlight a recent paper presented at ACL 2024 Findings: [TimeChara: Evaluating Point-in-Time Character Hallucinatio…