-
How can we add the abkhazian language?
There are a few resources like https://gitlab.com/Bachstelze/alp and https://github.com/danielinux7/Multilingual-Parallel-Corpus .
Can we port those models to …
-
I am interested in integrating InstructIR into MTEB. Currently, the dataset for InstructIR is only available on GitHub (https://github.com/kaistAI/InstructIR) and not on Hugging Face. Could you advise…
-
### A URL for this dataset
https://fedora.clarin-d.uni-saarland.de/rsc/
### Dataset description
> The Royal Society Corpus (RSC) is based on the first two centuries of the [Philosophical Transactio…
-
A few very common words in Portuguese are not included in the stopword list. At least `é`, `ser` and `ter` should be included for consistency, since these are verbs whose other inflected forms are alr…
-
Hello,
I've just downloaded IDyOM onto my laptop to use as part of my master's thesis, however after following the database management procedure as seen in the wiki section, IDyOM seems ubale to re…
-
Post questions here for one or both of week's orienting readings:
Evans, James and Pedro Aceves. 2016. [“Machine Translation: Mining Text for Social Theory”](https://www.annualreviews.org/doi/abs/…
-
-
Hi @vivekv73y and Sowmya :wave:
Please read the key below to understand _how_ to respond to the feedback provided. Some items will require you to take action while others only need some tho…
-
### Dataset name
NorBench
### Dataset link
https://github.com/ltgoslo/norbench
### Dataset languages
- [ ] Danish
- [ ] Swedish
- [X] Norwegian (Bokmål or Nynorsk)
- [ ] Icelandic
- [ ] Faroese
-…
-
I'm trying to process a set of 10,000 files using JStylo (with 12 possible authors), and it takes an extremely long time to generate the features (using the WriteLimits set). I've had it running for o…