corpora Search Results - Githubissues

1000+ results
for corpora

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

zaidalyafeai/Arabert #1

Corpora

Here we combine all the datasets we can collect - [OSCAR's CommonCrawl Dataset](https://traces1.inria.fr/oscar/) - [Arabic BERT Corpus](https://www.kaggle.com/abedkhooli/arabic-bert-corpus) - [Hi…

zaidalyafeai updated 4 years ago
1
centerforaisafety/wmdp #15

Unable to reproduce results

Hi, I was tried to run the experiments on `run_rmu_zephyr.ipynb`, but for the evaluation, I was unable to use the same batch size as in the original code due to limited GPU memory. I was running th…

hnanhtuan updated 2 weeks ago
1
PolMine/RcppCWB #94

Build failure of 0.6.5: `RcppCWB.so, 6): Symbol not found: _…

@PolMine For some reason this version fails for me: ``` ---> Building R-RcppCWB xinstall: mkdir /opt/local/var/macports/build/_opt_PPCSnowLeopardPorts_R_R-RcppCWB/R-RcppCWB/work/build Executing: …

barracuda156 updated 4 days ago
2
EhimeNLP/MEAT #2

Training data not accessible

The link shared in footnote : http://www.statmt.org/wmt20/quality-estimation-task.html for downloading the "publicly available bilingual corpora that were used to train the target machine translation…

BarahFazili updated 2 weeks ago
1
sillsdev/silnlp #466

Extract functions not extracting properly

bulk_extract_corpora and extract_corpora do not remove all lemmas and strong numbers from translations such as hbo_uhb and others from Door43

SirBac0n updated 1 month ago
4
RichardLitt/language-niche-research #1

Subtitle corpora

To do: - [x] Get relevant n-grams of the corpora. - [ ] Compare different n-grams for co-occurrence in both English and US corpora. - [ ] Check out surprisal tool - used to be in NLTK. Find out why …

RichardLitt updated 9 years ago
4
sillsdev/serval #476

Keyterm data always gets added - and then we always train

Should we add a separate flag for "only pretranslate"? Or should we automagically work if there is no matching corpora, we don't include the keyterms?

johnml1135 updated 4 weeks ago
5
arg-tech/corpora_to_csv #1

corpora_to_csv

debelatesfaye updated 7 months ago
1
own-pt/openWordnet-PT #118

parallel corpora

http://opus.lingfil.uu.se/ http://www.statmt.org/europarl/ How can we use them?

arademaker updated 6 years ago
1
clarin-eric/resource-families-issues #178

CLIPS : corpora e lessici di italiano parlato e scritto

http://hdl.handle.net/11372/LRT-865 - [ ] Unclear annotation - [ ] Missing licence

jakoble updated 4 weeks ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for corpora

1000+ results
for corpora