-
Hi,
Just stumbling across this tool, which looks promising!
I've been wondering:
- is there an option to display the list of available datasets (and their available languages) on opus? That'd b…
-
Hi,
I was confused for several days.
I followed the steps of `PBSMT/run.sh` to do my work, and I think the most important step is "Running MUSE to generate cross-lingual embeddings". I aligned t…
-
Hello,
I hope you can help me understand some part of the rescaling logic.
In the `score` method, rescaling is done using:
https://github.com/Tiiiger/bert_score/blob/3a974d46e484f892fd88ad93eff85…
-
Hi,
Is there a way to query the data in such a way that I could get all available corpora that have a certain language? For example, right now I would need to create a monolingual aggregated corpu…
-
I'm going to implement a translation using monolingual corpora. In here, I'm trying to use MASS-unsupNMT for my translation implementation. I run the MASS-unsupNMT for English-French dataset. Now I …
-
## Abstract (요약) 🕵🏻♂️
In this paper, we show that Multilingual BERT (M-BERT), released by Devlin et al. (2018) as a single language model pre-trained from monolingual corpora in 104 languages, is **…
-
Thank you very much for this dataset! While exploring the dataset I noticed quite a few of the entires in the monolingual dataset actually have code-switching present. For example:
```
jimharper…
-
Hi,
I am new to the NLP community and thus I don't know if this is a known issue.
I think that Google Scholar misses a few citations to my work. I don't know if the issue is that they don't parse…
bekou updated
4 years ago
-
I try create Finnish language Chatbot.
Question: Is it possible create dialog pairs using OpusTools?
Something like:
`No joo se on totta Mitä on mielessäsi`
Any help, hints etc.. appreciat…
-
Hi, we tried to reproduce back translation on EN->DE following[ _Understanding Back-Translation at Scale_](https://arxiv.org/pdf/1808.09381.pdf) but the bleu score did not improve much by adding monol…