-
include data from optimizer.
-
Hi,
Thanks for the nice package.
I am encountering issues when trying to use `Param_Discrete_Numeric`.
If I understand the code correctly the idea is to use this as a continuous variable dur…
-
Hello! Thank you for the clean + user friendly codebase!
I'm trying to finetune the VQ-VAE tokenizer and noticed some keys might be missing from the pretrained checkpoint listed on huggingface: `"o…
-
The CPUOptimizerOffload class is very clever, but overly relies on CUDA Streams, which aren't available w/o a CUDA device.
should use `torch.cpu.Stream` and `torch.cpu.current_stream` instead.
a…
-
Feature request for the Gradient Low-Rank Projection (GaLore) optimizer.
The GaLore optimizer computes low-rank gradients way to dramatically reduce memory. The ArXiv paper is [here](https://arxiv.…
-
Is that meaning using Adam optimizer is not converge? Or there is other issue with the code.
Below is the code and error:
loader2 = {'train_input': train_X1, 'train_label': train_Y1, 'test_input': t…
-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
0.9.0
### Reproduction
opacus使用只要对训练函数使用privacy_engine.make_private函数包裹即可,请问对于sft我该去哪里修改?
model = Ne…
-
I currently use a nested BaseModel as output of a signature for a project.
```
class Bar(BaseModel):
val: str = Field(desc="value desc")
class Foo(BaseModel):
bar: Bar = Field(desc="bar …
-
We need to make sure Everest has a way of validating the generic options for the `optimizer` section of the config, including the generic options that are passed to the the underlying optimizer. This …
-
| | |
| --- | --- |
| Bugzilla Link | [50482](https://llvm.org/bz50482) |
| Version | trunk |
| OS | Linux |
| CC | @DougGregor,@fhahn,@ojeda,@Kojoley,@RalfJung,@zygoloid,@wjristow |
## Exte…