-
Hi, I have been trying to run the run_evaluation.sh with the provided checkpoints downloaded and unzipped to the checkpoints directory. I am running into this error:
evaluate.py: error: argument -…
-
For multilingual arc, the [original implementation](https://github.com/nlp-uoregon/mlmm-evaluation/blob/main/lm_eval/tasks/multilingual_arc.py) has 25 shots but in lm_eval, it doesn't
https://github.…
-
Hi,
I am currently working on the finetuning of "distiluse-base-multilingual-cased-v1", using MultipleNegativesRankingLoss and RerankingEvaluator, over a dataset of 700k (query, sentence) pairs. I'…
-
Hi all,
dataset:
https://huggingface.co/datasets/aari1995/false_friends_en_de
[associated paper](https://drive.google.com/file/d/1jgq0nBnV-UiYNxbKNrrr2gxDEHm-DMKH/view?usp=share_link)
this d…
-
Step 1: Train a model:
```
model_name = "sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2"
model = SetFitModel.from_pretrained(
model_name,
multi_target_strategy="multi-outpu…
-
- **My train file has 8000 sentences but when i implemented this code it shows number of samples =817**
INFO:__main__:Creating features from dataset file at data/
8000
817
100%|██████████| 817/8…
-
Hi, thank you for your great work.
distiluse-base-multilingual-cased has one more dense layer compared to the pool-only model. How is this dense layer added?
We are constructing a Chinese long text…
-
* See #12 for file format of the input
* Compute Spearman’s rank correlation coefficient between this scoreand the human judgments
* Consider variations to use Euclidean distance and other metri…
-
Hey,
the file simlex-german.txt (https://github.com/nmrksic/eval-multilingual-simlex/blob/master/evaluation/simlex-german.txt) contains a wrong score value.
In the original german translation fr…
-
I am trying to build a Abstractive PreSumm model for Korean
At the beginning, I used bert-multilingual model but I've found its tokenizer was poor so I've decided to use a sentecepiece which was tr…