-
Turkish LLM Fine-Tune Datasets:
- https://huggingface.co/datasets/umarigan/openhermespreference_tr
- https://huggingface.co/datasets/umarigan/openhermes_tr
- https://huggingface.co/datasets/umari…
-
Here's a thread to add more languages to lmqg as well as https://autoqg.net/ . If you would like to contribute, please comment here with a potential QA dataset we can use to train QAG model on the lan…
-
Hi @savasy, thank's for your great work on those tasks, do you have some plans for fine tuning bert on the [POS](https://github.com/mrm8488/shared_colab_notebooks/blob/master/fine_tuning_on_UDPOS_Engl…
-
Hey, I tried to train [DistilBERTurk](https://huggingface.co/dbmdz/distilbert-base-turkish-cased) model for question answering by using [run_squad.py](https://github.com/huggingface/transformers/blob/…
-
Hi
I am trying to run a code with wikipedia of config 20200501.es, getting:
Traceback (most recent call last):
File "run_mlm_t5.py", line 608, in
main()
File "run_mlm_t5.py", line 359,…
-
I understand that the sentences in IMST originate in the [METU Turkish Corpus (MTC)](https://ii.metu.edu.tr/metu-corpora-research-group) and while the original corpus contains whole documents, in the …
-
Hi there 👋
Let's translate the course to Turkish so that the whole community can benefit from this resource 🌎!
Below are the chapters and files that need translating - let us know here if yo…
-
The readme makes it sound very simple: "Replace bert with xphonebert"
Looking a bit closer looks like it's quite a feat to make StyleTTS2 talk in non-english languages (https://github.com/yl4579/Styl…
-
Thank you for providing these excellent datasets. I am currently using the "Vitamin and Supplements", "Beyaz Perde All Movies", and "Beyaz Perde Best Movies" datasets from this repository for a sentim…
-
In this issue you can either:
- Add papers that you think are interesting to read and discuss (please stick to the format).
- vote: should be done using :+1: on comments
Example: https://githu…