-
This the script I used for fine tuning.
```
export HF_DATASETS_OFFLINE=1
export TRANSFORMERS_OFFLINE=1
export PDSH_RCMD_TYPE=ssh
# NCCL setting
export GLOO_SOCKET_IFNAME=bond0
export NCCL_SO…
-
I noticed in the training script it say's "--fp8_base^" after I trained a quick model and tried it in comfy I got the model weight errors when running flux in fp16. Does anyone know what setting I use…
-
Hello, I'd like to find the ideal learning rate for my Deep AR model and dataset, however after reading the available tutorials I'm not sure I understand how to.
In the "Usage" section of the Learn…
-
I'm not in the field of deep learning and computer science, but I found this work very interesting. I am confused about what should I do if I want to use the trained model for prediction? Can I achie…
-
Dear author,
I am deeply thankful for the code you have generously shared, and I have learned a great deal from your paper. However, while learning your code, I encountered a question regarding t…
-
- Collaborative Filtering: 작품에 대한 데이터가 별로 없음 (작가 태그 페이지 정도)
- Implicit Feedback: Rating같은 건 없기 때문
In this case, Implicit Matrix Factorization using methods like Alternating Least Squares (ALS) can…
-
Feature request:
- Add a field for free tags in predictions
- Allow me to download the predictions I've made
Rationale:
In general, I'd expect my forecasting biases to be topic dependent and horizon …
-
Sorry, I reviewed the code carefully, and we cannot reproduce the results presented in the paper. Additionally, is it reasonable to use test set data during the information subgraph acquisition proces…
-
Hi , first of all im not someone involved with development of machine learning or AI and that sort of things , Im an IoT engineer who have interest in integrating my home solar with this Quartz Solar …
-
[Understanding black-box predictions via influence functions](https://arxiv.org/abs/1703.04730)
How can we explain the predictions of a black-box model? In this paper, we use influence functions -- a…