unilm Search Results - Githubissues

757 results
for unilm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/unilm #1264

Beit3 for image captioning

I want to use Beit3 using weight beit3_large_patch16_480_coco_captioning for image captioning on my custom images. I have download the weights and .spm file and using the following command: !python -…

rohitpaul23 updated 10 months ago
2
microsoft/unilm #1492

Cannt download the BEATs_iter3+ (AS2M) (cpt2)

This XML file does not appear to have any style information associated with it. The document tree is shown below. AuthenticationFailed Server failed to authenticate the request. Make sure the valu…

aleeyang updated 1 month ago
10
huggingface/transformers #16410

Edgeformer

# 🌟 New model addition ## Model description [EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq Generation](https://arxiv.org/abs/2202.07959) EdgeFormer: A Parameter-Efficien…

patrickvonplaten updated 1 year ago
17
liucongg/GPT2-NewsTitle #21

微博新闻摘要数据测试集性能很差

大佬你好，我用https://github.com/YunwenTechnology/Unilm 提供的微博新闻摘要数据（从中随机挑选10000篇作为训练集，1000篇作为测试集）测试了下GPT2，发现rouge-1只有不到20%，而UniLM给出的结果有40.58%，请问这大概是什么原因？是GPT2的效果就是不好吗

xdnjust updated 1 year ago
5
microsoft/unilm #1408

train text diffuser on customized dataset

**Describe** Model I am using textdiffuser: Hi, I am training textdiffuser using my customized dataset, and I wonder how to build segmentation mask information. It seems that there is no code for g…

lwb2099 updated 1 month ago
2
chenjie97/SimBert_PyTorch #2

请问博主的uinlm体现在哪里，transformers里的bert的attention默认是双向的

SusannaWull updated 2 years ago
4
ollama/ollama #2821

Can we have the newest 1-bit model

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits https://thegenerality.com/agi/ https://arxiv.org/abs/2402.17764

chuangtc updated 4 months ago
1
artitw/text2text #9

Fine-Tuning process

Hi! I would like to know the process of fine-tuning UniLM with inverted SQUAD (hardware, training time, number of steps, parameters, etc.) Would that be possible? Thanks in advance!

ghost updated 3 years ago
3
microsoft/unilm #833

RuntimeError: gather_out_cuda(): Expected dtype int64 for in…

**Describe the bug** I am using UniLM-V1 https://github.com/microsoft/unilm/tree/master/unilm-v1/src/biunilm/decode_seq2seq.py. for generation using beam size 3 for indian languages, and getting abo…

Aniruddha-JU updated 1 year ago
2
tossyi/paper-reading #3

[2021] Adapt-and-Distill: Developing Small, Fast and Effecti…

## Paper Link https://arxiv.org/abs/2106.13474 https://github.com/microsoft/unilm/tree/master/adalm ## Upload 2021/06/25 ## What is paper about? ## Paper Contributions ## Key Points …

tossyi updated 2 years ago
2

上一页 1...3 4 5 6 7 8 9...76 下一页

757 results for unilm

757 results
for unilm