fastbert Search Results

46 results
for fastbert

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

autoliuweijie/FastBERT #14

Miss attention FLOPS?

Hi, I found in MultiHeadedAttention, thop only count the FLOPS of linear layer, missing the attention operation.

YeDeming updated 4 years ago
4
autoliuweijie/FastBERT #16

复现时的问题

你好，我在复现您的实验（没有进行任何修改）的时候在主干网络的训练时准确率是逐渐提高的，在蒸馏阶段验证集和测试集的acc每一个epoch都和主干网络的最后一个epoch相同，请问是我哪里出错了吗？

1125690278 updated 3 years ago
8
utterworks/fast-bert #19

Classification Metrics usage

How can i use the confusion matrix for each class and the other metrics in this link https://github.com/kaushaltrivedi/fast-bert/issues/17 ??

ahmedbahaaeldin updated 2 years ago
13
utterworks/fast-bert #175

Sagemaker training fails with error : UnexpectedStatusExcept…

I tried following this tutorial. https://medium.com/@kaushaltrivedi/train-and-deploy-mighty-transformer-nlp-models-using-fastbert-and-aws-sagemaker-cc4303c51cf3 The training fails, please see belo…

pssnew2pro updated 4 years ago
2
utterworks/fast-bert #18

weights not initialized when saving/loading

When i train a fastbert model and save it using save_and_reload(), the model output is not consistent with the models output before saving. code to reproduce: ``` from fast_bert import BertClas…

SorenJ89 updated 5 years ago
4
autoliuweijie/FastBERT #4

复现效果中GPU推理加速比较低

你好，我在复现论文效果时遇到两个问题，请教一下。 1. 当我训练子分类器时，得到的效果没有直接用true label训练效果好； 2. 最终推理时，我在CPU上得到了11x的速度提升，但是GPU上只有2x。下面是我分享复现时的细节，并非全部与所问问题相关： - 我用的是中文二分类数据集，40w作为训练集，3w作为测试集，后面的效果都是在测试集上得出的； - teacher分类器和s…

dawson-chen updated 3 years ago
6
microsoft/semantic-kernel #478

Update Tokenizer to use Microsoft.ML.Tokenizers library

The existing tokenizer implementation supports only GPT models. The [Microsoft.ML.Tokenizers](https://www.nuget.org/packages/Microsoft.ML.Tokenizers/0.21.0-preview.22621.2) package provides a …

luisquintanilla updated 6 months ago
14
s-nlp/detox #9

T5 paraphraser baseline

Hello! Could you please share some details about the T5 paraphraser baseline? Namely, which model was used -- the original or the one fine-tuned on the subset of ParaNMT? And what parameters for ge…

BunnyNoBugs updated 1 year ago
5
fly51fly/aicoco #4

爱可可老师一周热门分享

fly51fly updated 4 years ago
99
alibaba/BladeDISC #937

About control flow

Hi, I want to know how your compiler deal with control flow. For example FastBert, which need dynamically exit. Or for some pipeline relation extraction model, which the number of token-pair are chang…

JaheimLee updated 1 year ago
1

上一页 1...1 2 3 4 5...5 下一页

46 results for fastbert

46 results
for fastbert