-
Hi @huanzhang12, it's me again hahaha.
So far I have been using the IBM ART to generate CLEVER scores, and I understand that you work with them to keep the repos updated. And I just have a question…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Current Behavior
# 模型加载代码
model = AutoModel.from_pretrained(pre_model_path, trust_remote_code=True, torch_dty…
-
For transformer models with small to medium-sized gemms, the advantages of using fp8 cublasLt gemms may be overshadowed by the additional computational overhead introduced by memory loads in the quant…
-
Hi, I'm trying to use Integrated Gradients on a simple DQN model in my MacBook using the MPS backend.
`model = IntegratedGradients(model)`
`attribution = xai.attribute(torch.tensor(ob, dtype = to…
-
Hi, I have a use case where I believe fine-tuning the model with few of the params are freezed will be beneficial. I've modified the `init_from_ckpt` function in `ldm/models/diffusion/ddpm.py` as foll…
-
I am testing the adjoint method to calculate the gradients from a SemiImplicitEuler solver. I met errors when calculate the gradients using BacksolveAdjoint method. Here is a working example. It woul…
-
I applied this model on my dataset of images, converted them into arrays and feed them to the model. The model gets compiled after that. I am also getting the model summary, but when i try to fit the…
-
I am just trying out NGB and the **LinAlgError** occured. It seems the matrix has a determinant of zero, according to this post https://stackoverflow.com/questions/10326015/singular-matrix-issue-with-…
-
## Motivation
Model ensembling is appealing in the RL context with a range of use cases, e.g., critic ensembles and parallel inference of multiple agents with the same actor structure. And I believ…
-
### Motivation and description
Currently the `scan` method is used to mark the nodes before applying the actual backpropagation in the graph.
https://github.com/FluxML/Tracker.jl/blob/master/src/…