monolingual-corpora Search Results

133 results
for monolingual-corpora

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

HedvigS/gramfinder-typology-of-terminology-and-searching-OCRed-grammars #4

distributional semantics

@d97hah suggested that we could also use Latent Semantic Analysis or Random Indexing to compare similarity between texts.

HedvigS updated 8 years ago
12
PaddlePaddle/PaddleNLP #5768

mBART使用时翻译不完整 Translation using mBART is not entirely done.

### 问题描述 conda env: ``` paddlenlp 2.5.2 pypi_0 pypi paddlepaddle-gpu 2.3.2 py37_gpu_cuda10.2_many_linux https://mirrors.tuna.tsinghua.…

holyseven updated 1 year ago
4
DASISH/md-mapping #8

Default mapping for title in cmdi-mapping have strange value…

For the title field we get strange values from CMDI We get the title "mar24_09" instead of "Swedish Goteborg Corpus" for: http://ckan.dasish.eu/ckan/dataset/68de715e04f6ac2a4faf8d7e5a017174b4ea096d368…

borsna updated 10 years ago
4
malteos/finetune-evaluation-harness #10

Multilingual tasks

## TODO Languages: Top languages with at least three tasks per language: - [ ] Spanish - [x] https://huggingface.co/datasets/squad_es - [ ] https://huggingface.co/datasets/ehealth_kd …

malteos updated 1 year ago
2
howardyclo/papernotes #7

Unsupervised Pretraining for Sequence to Sequence Learning

### Metadata Authors: Prajit Ramachandran, Peter J. Liu and Quoc V. Le Organization: Google Brain Conference: EMNLP 2017 Link: https://goo.gl/n2cKG9

howardyclo updated 5 years ago
1
masakhane-io/masakhane-reading-group #1

Papers Voting

In this issue you can either: - Add papers that you think are interesting to read and discuss (please stick to the format). - vote: should be done using :+1: on comments Example: https://githu…

jaderabbit updated 4 years ago
22
howardyclo/papernotes #6

Cold Fusion: Training Seq2Seq Models Together with Language …

### Metadata - Authors: Anuroop Sriram, Heewoo Jun, Sanjeev Satheesh and Adam Coates - Organization: Baidu Research, Sunnyvale, CA, USA. - Release Date: 2017 on Arxiv - Link: https://arxiv.org/pdf…

howardyclo updated 5 years ago
6
prajdabre/yanmtt #28

UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in …

Hello, I am trying to further-pretrain the official BARThez model (French BART) checkpoint available at moussaKam/barthez with the denoising task. The command used was the following : ``` ex…

oliviersalaun updated 2 years ago
4
DigitalPhonetics/IMS-Toucan #195

Toucan Questions

I have a few questions that I hope will not much of your time. - Is there support for IPA or some other phonetic pronunciation for words that are incorrectly pronounced or that you have a specific …

MrEdwards007 updated 3 days ago
36
apertium/apertium-init #51

Add CI configs

sushain97 updated 3 years ago
27

上一页 1...1 2 3 4 5 6 7...14 下一页

133 results for monolingual-corpora

133 results
for monolingual-corpora