-
**Describe the bug**
I try to finetune `llama3-8B` model with multi nodes but get an AtrributeError when finishing loading mcore format checkpoint and starting to build datasets, the error is below:
…
-
The package needs an example of adding models and GAM would be a natural next step. See [here](https://github.com/stan-dev/rstanarm/blob/master/R/stan_gamm4.R) and [here](https://github.com/stan-dev/…
-
For logistic regression, the rope range is a function of 3/sqrt(pi), as this is the sd of the logistic distribution (on the latent scale).
However, this is the _conditional_ sd, which is akin to sigm…
-
OS: Linux Ubuntu 22.04
MZN version: 2.8.5
Hello! I am currently working on a MiniZinc model involving float and integer variables, using a linear solver (in my case, SCIP 8.0.4). In the model, I h…
-
Hi,
I am a new user of Catboost and I was wondering if it is possible to implement model tree in catboost (or in gradient boosting regression trees in general).
My feeling is that by using linear …
-
Core error reporting:
" At t = 0.00115731 and h = 9.14149e-14, the corrector convergence test failed repeatedly or with |h| = hmin.
ier POST FCVODE()= -4
time = 0
SUNDIALS_ERROR: FCVODE() ret…
-
We have an error when we run the model with sharding overrides.
Here, we run simple MNIST benchmark model with passed sharding overrides through the tt-forge-fe. All configurations/overrides are hard…
-
Assuming most other defaults are used, models trained with `--model="ScaleShiftMACE"` appear to be different in at least one important, nonobvious way from models trained with `--model="MACE"`.
Spe…
-
Hello, I have some questions about the details related to dire ft in the paper. Is Dire FT using new categories of data (such as bedrooms mentioned in the article) to continue training on the original…
-
Traceback (most recent call last):
File "/mnt/share/cq8/kennxiao/code/NF4Quant/test.py", line 41, in
prepare_model_flute(
File "/data/miniconda3/envs/env-3.9.2/lib/python3.9/site-packages/…