-
Hi,
is there any script in this repository to additional training a pretrained-Bert on unlabeled data?
-
Hi Team,
Firstof all , Kudos to your work !!
The keyBERT model produces duplicate keywords such as "Optimize" and "Optiization".
Could you please help me in resolving this issue?
-
@QMSS-G5063-2022/teaching_team
SHA: e4784df1f3f35076831f60c1332a5a65d0cf0b56
-
When transcribing a 3min audio with basic parameters and no stem, the resulting .srt file only consists of a part from the original audio sometimes its the start, sometimes the end and sometimes somet…
-
In this issue you can either:
- **Add papers** that you think are interesting to read and discuss (please stick to the format).
- **vote**: should be done using :+1: on comments
-
请问这里en2de和de2en的两个pt文件是`Obtain the masks`这一步得到的吗?我这边每个语言对只得到了一个文件,要想得到两个文件需要怎么做?望解答,感谢~
![image](https://user-images.githubusercontent.com/105580413/180939459-7134541f-72d6-4fe4-80d2-cd5c00a98ff2.png…
-
I'm interested in contributing scripts which allow users to incorporate data augmentation techniques directly without using external libraries.
I can start with stuff like synonym replacement, random…
-
Hi, I trained a **langid model** with my dataset following these [steps](https://stanfordnlp.github.io/stanza/langid.html#training-your-own-model) and ending with this method:
```python
python -m st…
-
The file `Irish_Data > processed_ga_files_for_BERT_runs > train.txt` mentioned in issue #32 is severely out of date and there is no documentation what setting was used. Please update and add a readme.…
-
This thread is a master thread for collecting problems and reports related to incorrect and/or problematic predictions of the pre-trained models.
## Why a master thread instead of separate issues?
…
ines updated
6 months ago