-
![image](https://github.com/syncdoth/RetNet/assets/902005/8eef7829-88ae-49e1-a65f-cd882268e688)
Trying to compare with other transformer architectures. But as soon as the training starts, the gradi…
-
**Describe the bug**
Running `CNNClassifier` with nested_univ data type (multivariate time series) gives a loss/accuracy of 0.5 on each epoch. Each iteration takes 4 ms, and I highly suspect no actua…
-
在使用自己的数据集做实验的过程中,发现代码报错为:
![QQ图片20231207193728](https://github.com/MAZiqing/FEDformer/assets/42921883/e86a02cd-30e1-45ca-965e-26e50a1698b7)
我使用的参数是:
`
export CUDA_VISIBLE_DEVICES=1
for model in…
-
### Your current environment
vllm=0.6.3
### Model Input Dumps
You are using a model of type qwen2_vl to instantiate a model of type . This is not supported for all configurations of models and can …
-
When I tried to perform a multitargets regression on tabular data the model prediction output is "None" as shown in the screenshot below.
![Multitarget_issue_screen](https://github.com/user-attachm…
-
### Your current environment
The output of `python collect_env.py`
```text
PyTorch version: 2.4.0+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N…
-
There are several projects aiming to make inference on CPU efficient.
The first part is research:
- Which project works better,
- And compatible with Refact license,
- And doesn't bloat the dock…
-
Hi, I'm interested in using the OPOI subset of the DPAv4.2 dataset from your paper.
Is it possible there is a mixup in the code?
As I see it, the `convert_to_h5.py` selects the window [0:400_0…
-
When I tested sgdet on custom images, the results seemed to be wrong. I used this command to test :
`CUDA_VISIBLE_DEVICES=0 python -m torch.distributed.launch --master_port 10027 --nproc_per_node=1 t…
-
I started testing all the models in this repo against the flux 0.11.4 release on julia 1.5 (ref https://github.com/FluxML/ML-Coordination-Tracker/issues/9). In this issue i will collect all encoutered…
ghost updated
3 years ago