-
Great work, this is really interesting. I just read the README, I hope to have some time to look at the code later but there are so many interesting hobby projects to work on!
About the global poo…
-
### Regression?
Yes
### System Info and Version
System/Version info
```sh
Hyprland, built from branch at commit 9a09eac79b85c846e3a865a9078a3f8ff65a9259 (props: bump version to 0.42.0).
Dat…
-
Post a reading of your own that uses deep learning for social science analysis and understanding, with a focus on network, graph, or tabular data.
-
Here are some weights from 9x9 to 25x25.(converted from #205(85a93684) lz weight),
https://drive.google.com/open?id=1jQjCbK1xWGBglD402WpBMyeEieDG-h83
and a very rough CPP code.
https://drive.…
-
For example,I have two RTX 3090 GPUs, and both the model and ref_model are 14 billion parameter models. I need to distribute these two models evenly across the two cards for training.
this is my code…
-
```
Traceback (most recent call last):aded
File "Sakura_DPO.py", line 318, in
fire.Fire(train)
File "/root/miniconda3/lib/python3.8/site-packages/fire/core.py", line 141, in Fire
com…
-
**Describe the bug**
I try to finetune `llama3-8B` model with multi nodes but get an AtrributeError when finishing loading mcore format checkpoint and starting to build datasets, the error is below:
…
-
Hi,
`batch_T (int) – number of time-steps per sample batch`
I don't understand the effect of `batch_T` in samplers. I see another `batch_T` in R2D1 too. So what is the difference? What is the relati…
-
#### DEADLINES
Company logos, descriptions, banners, advertising pages, tote bag inserts and similar must be provided by the applicable deadlines for inclusion in the promotional materials for PyCon.…
-
### Active Development
Yes
### Move source code
I'd like to discuss this more
### Tool name
Spatial Economics Toolbox for Fisheries
### Tool abbreviation
FishSET
### Author(s)
…