gradient-projection Search Results

1000+ results
for gradient-projection

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

junhwi/next-gen-ai #15

24/03/10

hippothewild updated 7 months ago
5
UCL/STIR #458

openmp changes to allow projection matrix parallelisation

At present, for SPECTUB there is a problem that when doing forward projection, gradient calculations etc, the code wants to use multi-threading, but the SPECTUB matrix cannot (unless all views are cac…

KrisThielemans updated 4 years ago
2
vllm-project/vllm #9495

[Feature]: LoRA support for InternVLChatModel

### Your current environment vllm version = 0.6.1 ### Model Input Dumps _No response_ ### 🐛 Describe the bug The output of `command:` vllm version = 0.6.1. InternVLChat is in lis…

AkshataABhat updated 5 days ago
11
fogleman/hmm #17

Work with curved height maps?

Am I right that this only works with "flat" height maps: F(x, y) = z where x, y, z are cartesian coordinates? For example, there is SRTM DEM set where each tile represents a part of Earth surface, so …

okla updated 2 years ago
3
liucongg/ChatGLM-Finetuning #116

OSError: ChatGLM2-6B is not a local folder and is not a vali…

(venv) xiao@spider:~/ChatGLM-Finetuning$ CUDA_VISIBLE_DEVICES=0 deepspeed --master_port 8888 train.py \ > --train_path data/spo_0.json \ > --model_name_or_path ChatGL…

xxtyy updated 11 months ago
4
pytroll/satpy #2764

Too much memory usage for composite processing

**Describe the bug** for creating composite products (even when I ignore atmospheric correction) out of ABI imagery, peak memory usage exceeds 30 GB. I suspect something may be going wrong, as it als…

akasom89 updated 7 months ago
2
JuliaManifolds/ManifoldDiff.jl #28

Taking AD seriously

I think I would like to get into a proper way of handling AD on manifolds. I know we have quite some issues open here (#17, JuliaManifolds/Manifolds.jl#42, JuliaManifolds/ManifoldDiff.jl#27, JuliaMani…

kellertuer updated 1 year ago
10
kweonwooj/papers #76

Improving Generalization Performance by Switching from Adam …

## Abstract - Adaptive methods such as Adam, Adagrad, RMSprop performa well in initial portion of training, but have been found to generalize poorly compared to SGD at the end - Propose SWATS, a sim…

kweonwooj updated 5 years ago
2
luyug/GradCache #7

How does this provide the same gradient as a larger batch si…

Looking through the code, I notice that there are mini-batches consisting of just negative examples that appear to be ignored entirely. If the code ignores certain combinations, how does using GradCac…

sameerkhanna786 updated 2 years ago
6
pytorch/pytorch #46166

Error with DistributedDataParallel with specific model

## 🐛 Bug I trying to run linformer model with DistributedDataParallel from [this repo](https://github.com/tatp22/linformer-pytorch) ## To Reproduce run this script ```python import os im…

blizda updated 3 years ago
5

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for gradient-projection

1000+ results
for gradient-projection