-
@d97hah suggested that we could also use Latent Semantic Analysis or Random Indexing to compare similarity between texts.
-
### 问题描述
conda env:
```
paddlenlp 2.5.2 pypi_0 pypi
paddlepaddle-gpu 2.3.2 py37_gpu_cuda10.2_many_linux https://mirrors.tuna.tsinghua.…
-
For the title field we get strange values from CMDI
We get the title "mar24_09" instead of "Swedish Goteborg Corpus" for:
http://ckan.dasish.eu/ckan/dataset/68de715e04f6ac2a4faf8d7e5a017174b4ea096d368…
-
## TODO Languages:
Top languages with at least three tasks per language:
- [ ] Spanish
- [x] https://huggingface.co/datasets/squad_es
- [ ] https://huggingface.co/datasets/ehealth_kd
…
-
### Metadata
Authors: Prajit Ramachandran, Peter J. Liu and Quoc V. Le
Organization: Google Brain
Conference: EMNLP 2017
Link: https://goo.gl/n2cKG9
-
In this issue you can either:
- Add papers that you think are interesting to read and discuss (please stick to the format).
- vote: should be done using :+1: on comments
Example: https://githu…
-
### Metadata
- Authors: Anuroop Sriram, Heewoo Jun, Sanjeev Satheesh and Adam Coates
- Organization: Baidu Research, Sunnyvale, CA, USA.
- Release Date: 2017 on Arxiv
- Link: https://arxiv.org/pdf…
-
Hello,
I am trying to further-pretrain the official BARThez model (French BART) checkpoint available at moussaKam/barthez with the denoising task.
The command used was the following :
```
ex…
-
I have a few questions that I hope will not much of your time.
- Is there support for IPA or some other phonetic pronunciation for words that are incorrectly pronounced or that you have a specific …
-