-
I have a corpus of ~75,000 abstracts that I want to make a KG out of using OntoGPT. After 4 hours, it only got through 50 documents -- not super promising!
I took a look through the docs to see if …
-
https://github.com/project-anuvaad/anuvaad-parallel-corpus
-
https://camel.abudhabi.nyu.edu/arabacquis/
https://camel.abudhabi.nyu.edu/madar-parallel-corpus/
-
The current code in the tutorial for aggregating coherence at various numbers of topics is very memory intensive and can cause python to crash. This is because it aggregates all of the lda models in t…
-
Some repositories use internal sample identifier that is not mapped to referencing PID's
Each repository case is specific:
ACDH-CH:
* no mapping between url record ID and the HDL for OAI service
…
-
Hi,
I am a beginner (machine translation) and I would like to ask how to use my own parallel corpus for training and translation. Training and translation of the specific orders and operations? Is th…
-
Is this even possible? Can we with minimal a priori knowledge can we separate sentences in all languages in all scripts enough so that when combined with a Gale-Church sentence Aligner, we can get de…
-
Since BERT is based on Transformer architecture, is there any reason to use BERT embeddings for a NMT model that is already a transformer ?
My take is that BERT embeddings are trained on a very lar…
-
This issue was created automatically with bugzilla2github
# Bugzilla Bug 2687
Date: 2020-10-06T14:10:09+02:00
From: Børre Gaup <>
To: Chiara Argese <>
CC: borre.gaup, lene.antonsen, sm…
-
Like Korp, Karp and Lärka, add a link to these articles?
- [*SVALA: Annotation of Second-Language Learner Text Based on Mostly Automatic Alignment of Parallel Corpora*](https://www.ep.liu.se/ecp/ar…