-
Very sorry for the late reply. Regarding evaluation, you can see the latest version of our paper: assets/06203.pdf. We evaluate our model in Our proposed [Scenario Understanding Benchmar…
-
Excellent work! Why isaid classes without background?
-
Thanks for your wonderful works!
As noted in your paper, there seems to be a lack of public benchmarks for academic documents. Would you kindly consider releasing your test dataset as a benchmark, …
-
## Original Task
Citing from the original course task:
> Training a strong Hebrew Sentence Encoder from a pretrained Decoder While recent years
have brought many additions to the open-source set …
-
**Submitting author:** @LC-SAR (Ling Chang)
**Repository:** https://github.com/AlignSAR/alignSAR
**Branch with paper.md** (empty if default branch): main
**Version:** v1.1
**Editor:** @rwegener2
**Rev…
-
**What would you like to be added/modified:**
1. Build a collaborative code intelligent agent alignment dataset for LLMs:
- The dataset should include behavioral trajectories, feedback, and i…
-
I encountered several issues when trying to train on the lom dataset and reproduce the author's results:
1. First, I found that in lmo2poet.py, the division method for the lmo dataset is quite diff…
-
Current scenario: only some datasets are compressed in the `data.h5` files using LZF.
For homogeneity, it would be good to compress **all** the datasets using the same compression. We should bench…
-
Replicating the steps and guides from the KaggleX Cohort 4 Project guide, fine-tune and deploy a Gemini model.
Using LoRA, QLoRA, or any other fine-tuning approach:
- [ ] Train an LLM model on the LP…
-
When following the [Python benchmark guide](https://github.com/lancedb/lance/blob/main/python/DEVELOPMENT.md#benchmarks) I get the following errors:
```$ pytest python/benchmarks -m "not slow"
===…