-
Hi,
I'm trying to fine-tune a Persian BERT model on a binary classification dataset prepared on Persian.(paraphrase, non-paraphrase)
the thing is happening is before first epoch is completely done, …
-
Lots of multilingual datasets listed here https://docs.google.com/spreadsheets/d/1qf0iYejG-9RgEEi13qB_SK_178-eNaeJDmSDNSj260A/edit?gid=1875159366#gid=1875159366 from https://blog.voyageai.com/2024/06/…
-
The readme makes it sound very simple: "Replace bert with xphonebert"
Looking a bit closer looks like it's quite a feat to make StyleTTS2 talk in non-english languages (https://github.com/yl4579/Styl…
-
Would be ideal to add the list of actual language codes to the [multilingual README](https://github.com/google-research/bert/blob/master/multilingual.md), so other systems can do lookups, and to preve…
-
https://huggingface.co/transformers/model_doc/wav2vec2.html
SpeechRecognition using BERT
-
https://www.w3.org/TR/css-fonts-5/#generic-font-families
https://www.w3.org/TR/css-fonts-4/#generic-font-families
Per https://github.com/w3c/csswg-drafts/issues/8128#issuecomment-1962180465 i woul…
-
Some language pairs are oddly missing from ...WikiMatrix/list_of_bitexts.txt, against my intuitions on which ones would have more data and thus more matching sentences.
For example, Armenian (`hy`)…
-
I've been looking at implementing some alternative hanging punctuation code as envisionned in https://github.com/koreader/koreader/issues/2844#issuecomment-464483142.
I figured I may need, in lvtex…
-
Hi, I encounter the same problem as in https://github.com/facebookresearch/LAMA/issues/10.
And I found the reason why 2 examples are filtered is that the `obj_label` are `1970s` and `1990s`. And in `…
-
This is going to collect missing spaces a fter a period as discussed in https://github.com/petergtz/alexa-wikipedia/issues/37.