-
Hi,
I am trying to run the model on Sagemaker, but I am getting the following warning:
"Some weights of the model checkpoint at declare-lab/flacuna-13b-v1.0 were not used when initializing Llama…
-
### Describe the bug
When I tried to load the safetensors file into models/checkpoint, I encountered the following error and could not load it. These safetensors files were used in the stable-diffusi…
-
Hi,
I used this GCN for predicting the nodes score. The mse is loss not reducing much. It reduced from 0.2 to 0.1963. I think it is still possible to reduce, I tried adding another conv layer, but …
-
## Description
I am using Polygraphy to evaluate the accuracy of a super-resolution model. However, when I use "mark all" option, the memory quickly increases and fills up all my available memory…
-
Hello,
Can anyone help me with how to run core42/jais-13b-chat model with ctranslate2? I ran the conversion script but ran into error. Script used:
```ct2-transformers-converter --model core42/…
-
-
### Background and motivation
Hi, thanks for your work.
But when I'm tring to migrate my PyTorch code to Oneflow code, I find that there are only few APIs in oneflow.distributions. So this part is …
-
Thanks for your wonderful work of SimA! And I'm trying to replicate your DeiT-S->SimA result but I don't find any hyperparameters settings. Is all hyperparameters inculding drop-path are the same with…
-
Hi,
First of all, thank you for this great project!
I am trying to work with a fully observable environment and I am doing so by making the following changes to env.py:
`def make_env(env_ke…
-
### 🚀 The feature, motivation and pitch
Hello community, I wanted to share a piece of code that helps me execute Neighbor Search among millions of candidates within a minute. I found this very easy a…