-
* Check out https://github.com/isamplesorg/navocab
* Run `vocab uijson` as part of the build
* Need to run one for each type of vocabulary
* Make sure that we include `-e` when we run the materials…
-
Follow on from https://github.com/IATI/D-Portal/issues/239 (which is nearly _10_ years old!)
- The following Regions are on the default ``recipient-region`` vocabulary: https://codelists.codeforiat…
-
Distinct from #4, there are other issues that are more complex and might not make the first cut of problems to solve.
This list is from the community. See the [post ](https://groups.google.com/a/cla…
-
Currently we have the fastText subword embeddings available, but it's not included in the pre-trained word embeddings tutorial which is often the first tutorial people checks out. It would be great to…
-
I was wondering if some fields in the datasets could be "search" links ? For example, you have the list of authors in a dataset, and each author's name is a link to search all datasets of the author's…
-
```
Hi,
I'm trying to do keyphrse extraction for my documents against LCSH skos
dictionary (about 400 MB). The problem I'm experiencing is an out of memory...I
had a look at the code and as far I …
-
Hi,
I'm interested in contributing to implementing the BPE tokenizer.
Since we're using gpt-2 encoding (as shown in the preprocessors), I think we can use the original implementation of `tiktoke…
-
```
Hi,
I'm trying to do keyphrse extraction for my documents against LCSH skos
dictionary (about 400 MB). The problem I'm experiencing is an out of memory...I
had a look at the code and as far I …
-
```
Hi,
I'm trying to do keyphrse extraction for my documents against LCSH skos
dictionary (about 400 MB). The problem I'm experiencing is an out of memory...I
had a look at the code and as far I …
-
... for when the annotation provides some feature or functionality to the target resource(s), either directly or by using the body resource(s).
For example, a client would benefit from knowing that…