-
Hi,
Many thanks for the implementation!
In the paper, the authors say that the DT policy chooses an action given the previous K states and returns-to-go. However, as far as I can see, in your co…
-
## Expected Behavior
Concatenating categorical variables is a powerful feature engineering technique, often used in competitions. You could watch the 9 minuts of this video for understanding the to…
-
This is similar to what was discussed in https://github.com/huggingface/transformers/issues/9205, which proposed not to random init weights on `from_pretrained`, but this time it's about resume - curr…
-
In the TypeScript compiler, we have the concept of a "transformer". A transformer is a function which "transforms" one AST to another. TypeScript ships many such transforms, including all of the vario…
-
Hi,
I wanted to try it out for testing purpose. For that I downloaded both the `llava_med_in_text_60k_delta.zip` and llama weights. but when I tried to run following command -
```
python3 -m ll…
-
**What happened**:
Recently, a customer had an issue running a Dask-based PyTorch workflow that would quietly and inexplicably (no errors or worker logs) kill kernels on clusters or even operat…
-
Announcement is designed to publish using discussion.
-
Hey there,
I just wanted to make you aware of potential issues with shipping Sponges Mixins within your jar.
When using SpongeForge your version of Mixins is preferred over the version from the Sp…
-
i use unsloth to fine tune llama 3-8B..., after traning complete i save this model to hugging face by using 'push_to_hub', but it shows these files :
.gitattributes
README.md
adapter_config.json
…
-
Although the fact that HaTeX uses Haskell syntax already is a significant advantage for writing documents in it (as opposed to in LaTeX itself), “raw HaTeX” also has a few issues that keep bugging me:…