-
**Describe the bug**
In normal mode when pressing the key sequence ci( or ci[ of ci{, the cursor does not jump to after the opening parenthesis, bracket or brace and does not remove the word in the p…
-
I try to train a mask_align model with default config in the repo (only change data paths) and DE-EN training data from https://github.com/lilt/alignment-scripts. In some of training steps the losses …
-
The model's output is a torch.cuda.FloatTensor. How can I get real score between 2 sentences?
-
Looks like the train time is pretty long on AWS instances with K80s. Adding MultiGPU data parallelism would be a good way to mitigate this (as done in https://www.tensorflow.org/tutorials/using_gpu#us…
-
We want to **fine-tune** 'bert-large-nli-stsb-mean-tokens' on **multi-label classification task**. So that we can use the output model to get **embeddings** out.
We have a bunch of sentences classif…
-
Running on M2 Max Macbook Pro with 32GB memory, inference only.
Have found that encoding is very slow compared to BGE-Large-en-v1.5 (couple hundred ms on BGE-Large and 12+ seconds on BGE-M3). The e…
-
Hi, I came across FactKB in a project to evaluate summarization and found it useful for my study. I tried to run the code and found that the pretrained model only accepts max 512 tokens which seems to…
-
First, thanks for writing this. I'm smack in the middle of your target audience, an everyday programmer without much knowledge of cryptography, and reading the first few sections has proved enlighteni…
-
Hi there,
thank you very much for this extremely helpful/useful library!
quick question: are there multilingual cross-encoders available? Going through the docs, I could not find an explicit refe…
-
Hello,
I use the multi-lingual translation task and find some issues:
1. When I use the round_robin_dataset and multi_corpus_sampled_dataset for more than 20 language pairs, the data loading will …