-
I am hoping to measure some java or python benchmarks with fuzzbench. So I migrate a java library (java-xmlbuilder) from OSS-Fuzz by copying Dockerfile, build.sh and XmlBuilderFuzzer.java. Then I try…
-
INFO:mteb.evaluation.MTEB:
## Evaluating 1 tasks:
─────────────────────────────── Selected tasks ────────────────────────────────
Retrieval
- MSMARCOv2, s2p
INFO:mteb.evaluation.MTEB:
…
-
There should be an optional header section, that (potentially) holds the following information:
+ which version of annatto the workflow was successfully run with. This would require a write-back to t…
-
Could you clear to what should the Step 4. Format to Simpler Json Files do .
my case : i have my own data-set . i am trying to apply these steps on it. Now I performed to Step 3. Sentence Splittin…
-
A la https://github.com/tensorflow/datasets/issues/120, it would be helpful to have an estimate of how large each dataset is before downloading. Ideally, a breakdown by feature would be nice.
Curr…
-
Hi
I am trying the example.py, however got the error below. Is this a lib issue?
`Traceback (most recent call last):
File "example.py", line 94, in
ex.generate_corpus(article_links)
File "/…
jdxyw updated
7 years ago
-
* Find the Persian news Corpus
* Define a corpus file standard. (To be discussed with other Corpus builders) - Most probably one sentence in each line
* Upload the zipped version of the corpu…
-
When attempting to load the DGS Corpus's default configuration on either my own workstation or in Colab, I run out of memory and crash.
Here are some screenshots
![image](https://github.com/sign-…
-
* Find the Persian Wiki Dump
* Cleanse it ( Remove the XML/HTML tags)
* Define a corpus file standard. (To be discussed with other Corpus builders) - Most probably one sentence in each line …
-
I am getting an error while running vocab builder.
Code and files used for vocab bulider:
!git clone https://github.com/kwonmha/bert-vocab-builder.git
!wget https://github.com/LydiaXiaohongLi/Al…