-
Hi @tenkus47,
Is it possible to have more details in README in order to try this repo locally? (ie. database, env variables example and other necessary stuff) Thank you and congrats,
Lionel
-
### TensorFlow version
2.17.0
### OS platform and distribution
Google Colab
### Current behavior?
Hi, everyone.
I am practicing implementing a Transformer model that machine translat…
-
Got This Below error in Notebook 5_2_munging_frankenstein.ipynb
Please hep on this
LookupError Traceback (most recent call last)
in ()
----> 1 tokenizer = nltk.dat…
-
# Prerequisites
Please answer the following questions for yourself before submitting an issue.
- [x] I am running the latest code. Development is very rapid so there are no tagged versions as of…
-
### Question
Hi,
I'm training a SequenceTagger to do the NER task using my customed dataset with the customed features. After training was done, I got a file named test.tsv which is the prediction…
-
Right now the different options for the available segmenters can only be found in the code:
```
whitespace=nltk.WhitespaceTokenizer
linebreaks=LinebreakTokenizer
blanklines=nltk.BlanklineTokeniz…
-
Hi @goodbai-nlp ,
This is great work! Thanks for making your model available on huggingface. Makes things easier.
However, I am not sure I follow the instructions for generating AMRs. I want to…
-
Currently we have no good way to train a `vocab.json` and `merges.txt` file for the `BytePairTokenizer`. This is the vocabulary format used by gpt-2, RoBERTa, and DeBERTa v1. It would be nice to allow…
-
Hi,
I am trying to use [boun-tabi-LMG/TURNA](https://huggingface.co/boun-tabi-LMG/TURNA), a Turkish T5 model, with sentence-transformers as it has been specifically pre-trained for Turkish.
Whil…
-
Thanks for your awesome work. I have some questions about the concept of initial tokens and the implementation of learnable initial tokens.
1. In Fig.2, you reach the conclusion that existing LLMs …