-
Hey there,
First off, huge thanks for all the hard work on this amazing physics engine! It's been a blast to work with, and I appreciate all the effort that's gone into it.
I've been tinkering a…
-
![5D190FA6EE718064BEC8DBD812DCF1B3](https://github.com/user-attachments/assets/f7fd1920-6046-46e7-9162-f6b30ee15a8e)
I downloaded siglip-so400m-patch14-384 and write down the path. What else do I n…
-
### 🚀 The feature, motivation and pitch
Gemma-2 and new Ministral models use alternating sliding window and full attention layers to reduce the size of the KV cache.
The KV cache is a huge inferen…
-
**Is your feature request related to a problem? Please describe.**
Tabor is currently limited in that it does not allow users to describe the desired spatial referencing system for their layers.
*…
-
g++ -c -o layers/activation_layer.o -Wall -std=c++11 -shared -fPIC -Wno-error=deprecated-declarations -I/opt/nvidia/deepstream/deepstream/sources/includes -I/usr/local/cuda-12.6/include layers/activa…
-
在用qlora在两张32GbV100上微调Llama-3___2-3B-Instruct时最后保存模型的时候报错
slurm脚本为
```#!/bin/bash
#SBATCH --job-name=openrlhf
#SBATCH --partition=gpu_v100
#SBATCH --nodes=1
#SBATCH --ntasks-per-node=1
#SBATCH…
-
Your idea is great but I seem to be having trouble reproducing it.:RuntimeError: Error(s) in loading state_dict for LlamaForCausalLM:
Missing key(s) in state_dict: "model.layers.0.mamba.dt_bias", "mo…
-
If i create a new nuxt layer, and install this module. when i deploy it to cloud flare i get a 500.
steps to reproduce:
-npx nuxi init --template layer test-layer
- cd test-layer
- npm install …
-
### Is there an existing issue for this feature request?
- [X] I have searched the existing issues
### Is your feature request related to a problem?
When printing objects that have large flat areas…
-
This is extracted from #2988 as it was closed after the `jj annotate` command got added.
Since there are many systems like Codesearch or Codereview tools which want to integrate with `jj-lib` we s…