subword-segmentation Search Results

152 results
for subword-segmentation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

marian-nmt/marian-dev #667

marian-decoder stops on line without words

### Bug description When a line that starts with too many encoded apostrophes (i.e. &apos;) is passed as input, marian-decoder stops on it, ignoring the rest of the input. For example, giving it …

jelmervdl updated 4 years ago
5
marian-nmt/marian-dev #658

[Question] Is the sentencepiece alpha in Marian CLI the one …

Is the `--sentencepiece-alphas` in Marian CLI the same as the alpha on https://github.com/google/sentencepiece/blob/master/src/bpe_model.h#L43 to support BPE dropout when called at https://github.com/…

alvations updated 4 years ago
9
google/sentencepiece #399

Access to trained models' parameters experimented on NMT exp…

Hi, First of all, thank you for your great work and nice library. I was inspired by your work which tries to inform the NMT model "the word composition". I'm currently doing my research on the ef…

JJumSSu updated 4 years ago
1
google/sentencepiece #371

Subword regularization on BPE models

As described by @eric-haibin-lin in https://github.com/google/sentencepiece/issues/335 it is currently not possible to use `SampleEncodeAsPieces`, `SampleEncodeAs{Pieces,Ids}` on a BPE model (displays…

nicolaspanel updated 4 years ago
13
flairNLP/flair #779

BPE Embeddings

Hi I want to test Flair (and also Bert and ELMo) embeddings for NMT. I currently use SentencePiece to segment my corpus as it significantly provides best performances over other methods I can…

valentinmace updated 4 years ago
8
google/sentencepiece #461

Query in subword regularization, in readme.

Following line is mentioned at the beginning of Subword regularization in README.md. >To enable subword regularization, you would like to integrate SentencePiece library (C++/Python) into the NMT …

rossbrown9879 updated 4 years ago
1
lvapeab/nmt-keras #130

Regd Rare Words/OOV Tokens ?

Need a few clarifications regarding how to handle rare words and heuristics in the [configuration](https://github.com/lvapeab/nmt-keras/blob/master/config.py#L70) - How does heuristic 2 handle case…

VP007-py updated 4 years ago
9
OpenNMT/OpenNMT-py #1625

Segmentation fault after applying subwords methods

You need to apply some subwords methods. Have look [here](http://forum.opennmt.net/t/using-sentencepiece-byte-pair-encoding-on-model/3027). _Originally posted by @francoishernandez in https://githu…

aastha19 updated 5 years ago
1
PaddlePaddle/ERNIE #397

ernie tiny 的finetune_classifier异常

下载了ernie tiny的config, 启动finetune_classifier时，参照reademe的说明： # 1 线上GPU 容器环境下： ``` ERNIE tiny 模型采用了subword粒度输入，需要在数据前处理中加入切词(segmentation)并使用sentence piece进行tokenization. segmentation 以及 tokenization …

leyiwang updated 4 years ago
4
VKCOM/YouTokenToMe #37

BPE-Dropout support

> It stochastically > corrupts the segmentation procedure of BPE, > which leads to producing multiple segmentations within the same fixed BPE framework. > Using BPE-dropout during training and th…

kalaidin updated 4 years ago
1

上一页 1...8 9 10 11 12 13 14...16 下一页

152 results for subword-segmentation

152 results
for subword-segmentation