beam-search Search Results

kensho-technologies/pyctcdecode #55

Lack of spaces in output

I still have the problem which was also mentioned a lot on the huggingface challenge discord earlier, that pyctcdecode doesn't really like putting spaces in the transcription, e.g.: `hetcontenenschi…

iskaj updated 9 months ago

ArvinZhuang/DSI-transformers #9

[Bug] Low Performance Due to Constraint in Docid Generation …

Three types of docid representations are introduced in the paper "Transformer Memory as a Differentiable Search Index," namely, `Unstructured Atomic Identifiers`, `Naively Structured String Identifier…

isHuangXin updated 9 months ago

liucongg/GPT2-NewsTitle #21

微博新闻摘要数据测试集性能很差

大佬你好，我用https://github.com/YunwenTechnology/Unilm 提供的微博新闻摘要数据（从中随机挑选10000篇作为训练集，1000篇作为测试集）测试了下GPT2，发现rouge-1只有不到20%，而UniLM给出的结果有40.58%，请问这大概是什么原因？是GPT2的效果就是不好吗

xdnjust updated 2 years ago

PaddlePaddle/Research #41

关于PLATO模型：为什么需要隐变量？

按照文章的介绍，是为了更好地进行一对多生成，但事实上seq2seq模型本身就可以通过采样生成（而不是beam search确定性生成），所以原则上seq2seq模型本身就包含了一对多生成能力，文章所说的常规seq2seq不能很好地做一对多生成的断言似乎不能成立。那么，隐变量的意义何在呢？此外，我没看到关于隐变量的正则项，那么如何保证隐变量的分布不会退化为一个one hot分布呢（即变成只有…

bojone updated 4 years ago

NVIDIA/FasterTransformer #438

Error of running opt model on 4 GPUs

### Branch/Tag/Commit main ### Docker Image Version nvcr.io/nvidia/pytorch:22.09-py3 ### GPU name V100 ### CUDA Driver Driver Version: 450.80.02 CUDA Version: 11.8 ### Reproduced Steps ```…

lambda7xx updated 1 year ago

vllm-project/vllm #6531

[Bug]: inter-token latency is lower than TPOT in serving ben…

### Your current environment v0.5.2. vLLM env is not an issue so I will just skip the collection process ### 🐛 Describe the bug I am running benchmark tests and notice one potential problem. …

Jeffwan updated 1 month ago

d8ahazard/sd_smartprocess #39

error when running CLIP

I'm getting this traceback with errors when running the CLIP captioning: ``` Traceback (most recent call last): File "C:\Automatic1111\extensions\sd_smartprocess\smartprocess.py", line 360, in …

Omegadarling updated 1 year ago

NirmalenduPrakash/Document-Summarizer #3

Can't find the dataset with github link

Sorry for bothering you, I just want to say your implementation is really nice, and I want to learn NLP too, but the dataset you provided with the github link doesn't look like the file that you locat…

minhtam2048 updated 4 months ago

bytedance/lightseq #114

support for MBART (big models)?

Hello, tThank you for your contribution. Howeverm I notice that all mbart models exceed 2GB. Do you have any plan to fix this issue?

leoozy updated 2 years ago

XiangLi1999/PrefixTuning #15

Applying PrefixTuning with T5ForConditionalGeneration model

Hello! I'm trying to use PrefixTuning with T5 model. After reading source codes in seq2seq, I figure that generally speaking, prefix is added to the BART model by using the parameter _past_key_values…

yssjtu updated 3 years ago

1000+ results for beam-search

1000+ results
for beam-search