-
-
At present, for SPECTUB there is a problem that when doing forward projection, gradient calculations etc, the code wants to use multi-threading, but the SPECTUB matrix cannot (unless all views are cac…
-
### Your current environment
vllm version = 0.6.1
### Model Input Dumps
_No response_
### 🐛 Describe the bug
The output of `command:`
vllm version = 0.6.1. InternVLChat is in lis…
-
Am I right that this only works with "flat" height maps: F(x, y) = z where x, y, z are cartesian coordinates? For example, there is SRTM DEM set where each tile represents a part of Earth surface, so …
-
(venv) xiao@spider:~/ChatGLM-Finetuning$ CUDA_VISIBLE_DEVICES=0 deepspeed --master_port 8888 train.py \
> --train_path data/spo_0.json \
> --model_name_or_path ChatGL…
xxtyy updated
11 months ago
-
**Describe the bug**
for creating composite products (even when I ignore atmospheric correction) out of ABI imagery, peak memory usage exceeds 30 GB. I suspect something may be going wrong, as it als…
-
I think I would like to get into a proper way of handling AD on manifolds. I know we have quite some issues open here (#17, JuliaManifolds/Manifolds.jl#42, JuliaManifolds/ManifoldDiff.jl#27, JuliaMani…
-
## Abstract
- Adaptive methods such as Adam, Adagrad, RMSprop performa well in initial portion of training, but have been found to generalize poorly compared to SGD at the end
- Propose SWATS, a sim…
-
Looking through the code, I notice that there are mini-batches consisting of just negative examples that appear to be ignored entirely. If the code ignores certain combinations, how does using GradCac…
-
## 🐛 Bug
I trying to run linformer model with DistributedDataParallel from [this repo](https://github.com/tatp22/linformer-pytorch)
## To Reproduce
run this script
```python
import os
im…