-
I see people are trying to extract the Mistral-22b ancestor from the MoE model by averaging the MLP layers and wondered if the 'model stock' method in Mergekit could be inverted:
- Use the averaged…
-
It's by far the largest crate in terms of binary size (2.5× as big as `main`). Building it takes almost 40 seconds and 1.8 GB resident on my machine (with debugging enabled, optimization disabled).
…
-
The current version of the scoring function `compute_scores` seems to weigh non-rare states higher than rare ones. Instead of selecting states that explored rare value combinations, it prefers the one…
-
This has been discussed in
* https://github.com/quarto-dev/quarto-cli/issues/6013
* https://github.com/quarto-dev/quarto-cli/discussions/6119
Currently there is `--metadata` and `--metadata-fi…
cderv updated
4 months ago
-
Stumbled upon this and was curious if it was still in development
-
Implement GPU version of `scipy.*` functions in `cupyx.scipy.*` namespace.
This is a tracker issue that summarizes the implementation status of each SciPy public module in CuPy. See the [comparison…
-
## Description
Looking at the SecureDrop code, I see that many commits are signed. That's great! But, I couldn't find a signing policy. This makes the signatures less useful. Anyone could creat…
-
Before I get started, I would like to thank the people behind Zig for their awesome work. Zig is the first language in a long time I can even potentially see replacing C/C++ for me, which is exciting.…
-
## Bevy version
0.8.1
## What you did
I've been working on bringing the [Spine](http://esotericsoftware.com/) runtime to Bevy and one of the export options is "Premultiplied Alpha." This is a…
-
Open sesame.pdb with Try Harder option set to 2. This will load a database with about 3500 mesh objects each with 515 variables defined.The GUI (and pan/zoom ops in the Viewer) can exhibit 1020 second…