-
Post questions here for this week's exemplary readings: 2. W. Guo, A. Caliskan. 2020. “Detecting Emergent Intersectional Biases: Contextualized Word Embeddings Contain a Distribution of Human-like Bia…
-
Hello,
So I finetuned voicecraft on the french common voice-french dataset. It's quite exciting since it's my first time working on LLM and on full audio model (not just spectrogram -> classificat…
-
I encountered an issue when attempting to trace a CrossEncoder model using torch.jit.trace. The error occurs during the tracing process when calling the forward method. Below is a minimal reproducible…
-
## 0. Paper
[Universal Sentence Encoder](https://arxiv.org/abs/1803.11175)
Daniel Cer, Yinfei Yang, Sheng-yi Kong, Nan Hua, Nicole Limtiaco, Rhomni St. John, Noah Constant, Mario Guajardo-Ce ́spedes…
a1da4 updated
3 years ago
-
http://8.129.175.102/lfd2022fall-poster-session/25.html
-
I got problem when I run(using my own data)
```
python gens/gen_bert.py \
--model albert-xxlarge-v1 \
--dataset ACE05 \
--save_attention 1 \
--save_path ./wv/albert.ace05_with_he…
-
Hi !
I'm currently using the pretrained 'bert-base-nli-stsb-mean-tokens' model.
I was wondering if it was possible to add a WordWeight layer in the model and still work with the trained BERT param…
-
Hi,
I am trying to understand how Deep Explain works with classification models in text. For the same, I have taken up a simple 5-Class sentence classifier, with the following network architecture:…
damzC updated
5 years ago
-
There is a sentence in chapter 18, just before section 18.6, that says:
> Although in principle the `@fVal` attribute could point to any kind of feature value, its use is not recommended for simple…
bansp updated
4 weeks ago
-
As mentioned by `https://github.com/allenai/longformer/issues/75#issuecomment-651295800` , the default is only local attention. So how to generate good global attention? any suggestions about it?
I…