-
Recent discussions have suggested that the UD documentation could benefit from a more detailed definition of "word". We can use this issue to discuss the existing definition and possible improvements.…
-
Dataloader name: `m3exam/m3exam.py`
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?m3exam
| Dataset| m3exam |
|-------------|---|
| Description | M3Exam is a novel benchmar…
-
I recently got this issued when testing any kind of gemini model. I 'm now using Maker Suite API. And in Google AI Studio from Maker Suite some time I also see this problem but it not so often as the …
-
**Issue by [monday0rsunday](https://github.com/monday0rsunday)**
_Fri Dec 5 07:09:24 2014_
_Originally opened as https://github.com/codelucas/newspaper/issues/93_
----
I try to use newspaper for v…
-
I usually extend vocab to make the model closer to Vietnames language. The code is below. However, it seems that the tokenizer of LLaMA-3 is no longer work with SentencePiece. Even LlamaTokenizer is n…
-
Hi, thanks for the wonderful library. The Italian (also Portuguese) languages hang my app that uses nspell. Is there anything new on that front (there is a closed issue on Italian dic). Will that prob…
-
@sergiolaverde0
I've opened this issue, so we can communicate about the language install feature. I'll close this issue when the feature is finished.
I've tried to install korean and list insta…
-
What can we improve on AI side?
-
The example from https://github.com/microsoft/DeepSpeed-MII/blob/main/README.md:
```
import mii
mii_configs = {"tensor_parallel": 1, "dtype": "fp16"}
mii.deploy(task="text-generation",
…
-
Hi undertheseanlp,
Your work and passion are great. I'm your big fan and I have been using underthesea package a lot of my times when doing a NLP project.
Recently, I have a bug when using the `word…