-
Hi,
Thank you for your great work to develop Torchscale.
I have been trying to use your codebase but it seems like some modules such as "fairseq.models.squad" that were available during the de…
-
Sorry for bothering you and this may be a dumb question:
The Complex type in here is for what?
I'm not very good at math and if you guys can explain why we need to use complex it will be good.
-
Hi,
Thank you for your great work!
When I use your example code to compare the Inference Latency with Transformer-based LLM, the result is not as expected in the paper (15.6X). Could you please …
-
I would like to use this repo for my job. I cannot do so until you add a license to the repo. Can you please do so soon?
-
I've been trying to do the setup to use Kosmos-2 as described in https://github.com/microsoft/unilm/tree/master/kosmos-2#setup - but it seems like dependencies conflicts are preventing a successful in…
-
Hey kyegomez,
I'm interested in trying out the implementation.
Is it already possible to use a basemodel for this?
-
Hi, when I use retnet's parallel mode to train, it's very slow, I observe the gou memory usage, it's very small, what's going on?
Thank you!
```[tasklist]
### Tasks
```
-
when link https://publicmodel.blob.core.windows.net/torchscale/vocab/dict.txt
This XML file does not appear to have any style information associated with it. The document tree is shown below.
Pu…
-
**Describe**
If I only want to use one of the models in the repo, I have to download the whole repo.
But this is not neccessary.
It is difficult to download the whole repo quickly in a short period…
-
.
.
Hi, I plan to reproduce the results of the WMT-17 translation task as presented in the deepnet paper. Could you please let me know what the command for running the script shou…