-
或许是因为找工作的原因,或者是在数据、模型、任务的繁多设置之间迷了方向,现在回过头看自己之前做的分词实验,觉得非常失败。
首先是很多信息没有记录全,导致现在回头看根本不知道到底是怎么做的了。
其次,数据结果太冗余。表现在两点,一是把prf、acc全给记录下来了。详细信息当然得保存下来,但是不应该展示在这里。二是为了做实验而做实验。我觉得在模型比较阶段,测试在开发集的效果就可以了——因为之前的实…
-
Hello,
I have the following issue. I have looked through all existing issues and it seems that it is a new issue. It happens when I try to train a model. The errors happen for both dummy data (I go…
-
你好,请问直接densnet连接ctc和加上lstm再连ctc哪个效果会比较好,作者有试过吗?
如果想加lstm的话,训练代码需要做什么改变吗
-
读了论文,我的理解文中的Position Attention如下:
```Python
# feature_map: (HW, channels)特征图
# w: (len_seq, channels)位置编码
q = w # (len_seq, channels) 实际上位置编码就是一个可训练参数,基本上就是个fc层
k = unet(feature_map) # (HW , …
-
Hi, I'm trying to extract bert features by `extract_bert_features.sh`. I find that the token features are extracted based on a document-level, which generates embeddings based on a sequence of sentenc…
-
What would be the main steps for building a real-time decoder on top of EESEN?
I read in the EESEN paper that composing the tokens, lexicon and grammar speeds up decoding a great deal, and I'd li…
-
Joint Learning of Domain Classification and Out-of-Domain Detection with Dynamic Class Weighting for Satisficing False Acceptance Rates
Joo-Kyung Kim, Young-Bum Kim. Amazon Alexa. Interspeech 2018
…
-
https://www.sciencedirect.com/science/article/abs/pii/S0950584923001830
# Bibtex
```
@article{cai2023software,
title={A software vulnerability detection method based on deep learning with comp…
-
Hello, How are you. First Thank You For Fast Replay on The Issue Thanks alot. You got Me Every Time on The Track.
My Question today is on architecture and Design you say on paper " you first empl…
-
Attention is not Explanation
Sarthak Jain, Byron C. Wallace
Accepted as NAACL 2019 Long Paper. Draft Version
https://arxiv.org/abs/1902.10186
github: https://github.com/successar/AttentionEx…