-
## Original Task
Citing from the original course task:
> Training a strong Hebrew Sentence Encoder from a pretrained Decoder While recent years
have brought many additions to the open-source set …
-
RTL languages shouldn't affect training, but doing so will require some work on the Firefox side. This meta bug tracks any work that is needed. We should complete a subset of the easier to segment LTR…
-
A long time ago I've built a program to convert the Hebrew Wiktionary [dumps](https://dumps.wikimedia.org/hewiktionary/latest/) to very structure json file.
this is the project:
https://github.com…
-
I would like to ask for @alonbl feedback/greenlight before preparing my PR. I am interested in addressing several issues I see in the current Hebrew transliteration:
1. 05ef (triple yod)- can now be …
-
In order to apply LLM2Vec to DictaLM we need:
- [x] Identify base model - https://huggingface.co/collections/dicta-il/dicta-lm-20-collection-661bbda397df671e4a430c27
- [x] Enable bi-directional at…
-
## Goal
As a developer, I want to acquire the data from MARBLE to use their mappings from words to lexical domains. These mappings indicate to which semantic domain a verse (or word or phrase) belong…
-
Document Embeddings does not allow local models and therefore creates a privacy hazard.
As I don't assume that this was done due to malicious design by the Bioinformatics Lab at University of Ljubl…
-
Create a short document that specifies datasets the projects wishes to create, extend and support.
This includes:
- Expected Licence
- Data format
- Meta data format and attributes
- Usability …
-
Hi there 👋
Let's translate the course to `Hebrew` so that the whole community can benefit from this resource 🌎!
Below are the chapters and files that need translating - let us know here if you'd…
-
Hi! Love the project!! 🤩
Are there any plans or thoughts about implementing a mechanism to handle (convert?) existing, old, non-neutral text?
I'm a software engineer with background in ML and I'm c…