-
for a large dataset about 10M QA pairs
would it be a better performance on accuracy if we divide the dataset by the length of sentences.
and feed it to different training model and decoding it acco…
-
Dear author,
I would like to ask about the timestamps in the re-aligned annotation files of How2Sign dataset. Are "_START_REALIGNED END_REALIGNED_" the timestamps corresponding to sentences in reco…
-
Hello,
Have been looking into this model from few days. Thought of training this model on a new data set so I took a sample dataset with 15 pairs of sentences including human gold score. Trained mo…
-
I was going to comment at #745, but I think this translates to a more general discussion about vocabulary building. Although I don't know if this would be considered a meta issue.
## Character cove…
-
## Adding a Dataset
- **Name:** MedSTS
- **Description:** 1,068 sentence pairs annotated by two medical experts with semantic similarity scores of 0-5 (low to high similarity).
- **Task:** STS
- *…
-
Hey, I have downloaded the YouCook2 features and am training a common space learning network to associate positive sentence-clip pairs for video retrieval.
I have loaded the video features and the tex…
-
The schema requires entries (gene pair units) with the following (minimum) information:
genePairId | gene1 | gene2 | docid | scope
- options for scope: document > paragraph > sentence > event | …
-
Reading lines...
Read 141382 sentence pairs
Trimmed to 11132 sentence pairs
Counting words...
Counted words:
eng 2953
fra 4540
start training...
Traceback (most recent call last):
File "tra…
-
* Name of dataset: syneval
* URL of dataset: https://github.com/BeckyMarvin/LM_syneval
* License of dataset: EMNLP 2018
* Short description of dataset and use case(s): a collection of tasks that ev…
-
@nreimers While training of cross encoder I am getting this error. Training is completed and while evaluation starts this error pops up. What is the solution to this?