-
Hello @EricLBuehler, opening this issue as part of T5 Seq2Seq model architecture support in mistral.rs. (As discussed)
Relates to: #156
-
Hi, thanks for your wonderful work!
But I have some questions about your open code. I saw that **"--init_shallow_word"** is used in gpt2 model(GPT2LMHeadModel), so the prev_key and prev_value can be …
-
Hi
I am executing the Eval code on the spider data with picard using docker image
1. provided 777 permission to mentioned 2 directories
2. 'make eval'
3. inside docker -> Install stanza pack…
-
### Describe the issue as clearly as possible:
I love the Outlines project, especially because it allows for various model configurations while maintaining constraint.
However, I have been unable to…
-
.
Instructions for updating:
Use tf.cast instead.
Traceback (most recent call last):
File "webserver.py", line 149, in
model=model, input_pipeline=input_pipeline_infer, batch_size=batch_si…
-
## Issue
I am trying to use the Captum library to get attributions for my sequence-to-sequence T5 model. However the attributions are returned to the input token of the **decoder** rather than the i…
-
I get this error message when I try to run train.py:
`Traceback (most recent call last):
File "train.py", line 15, in
tf.app.run()
File "/usr/local/lib/python2.7/dist-packages/tensorflow/pyt…
urjan updated
8 years ago
-
Hello,
I was trying to load a finetuned model for the VSR task. I followed the indications on the repository and the jupyter notebook (below you can see that I tried to import modules from the avhu…
-
Why does the KeyError occurs?I can not understand.
Can you help me?Thank you
![](http://ofht327si.bkt.clouddn.com/2017-09-12%2006-50-24%E5%B1%8F%E5%B9%95%E6%88%AA%E5%9B%BE.png)
![](http://ofht327si…
-
Right now the code is just one big file. Break it down into classes. Make it GOT-independent, that is add code to use any corpus of data
vshan updated
7 years ago