-
Hey,
thank you for your great work, I am interested in it a lot. I have a question about parallelization. Is it possible to solve many ups at the same time, as mini baches in deep learning? Thank you…
-
Hi and congrats on the package!
I'm one of the reviewers for the JOSS paper you submitted, so here I'll list my questions and concerns about the documentation. This issue will be updated as my read…
-
Official docs:
https://book.cairo-lang.org/ch02-02-data-types.html#type-conversion
https://book.cairo-lang.org/ch05-02-an-example-program-using-structs.html#custom-type-conversion
The idea is to…
-
Hello,
we have noticed some unexpected behaviors when fine-tuning a llama 3 model on 1 gpu and when fine-tuning the same model on the same data set with 2 gpus in parallel mode. See the attached te…
-
### System Info
- `transformers` version: 4.44.2
- Platform: Linux-5.10.220-209.869.amzn2.x86_64-x86_64-with-glibc2.26
- Python version: 3.10.14
- Huggingface_hub version: 0.25.1
- Safetensors ve…
-
Hello, thank you for your work.
I am reproducing PureACL and I have some questions.
1. GPU memory
I experimented on the RTX A5000 (24G) with the default setting: batch size 3, Adam optimizer with …
-
@jrhastro said @jkrick was interested in feedback I have on the notebook review process based on our experience with a similar process at STScI, so here I'm offering some suggestions on that basis. A…
-
Your tests look reasonable, I think checking for errors like NaN, inf etc is a good use of them for this kind of machine learning work. I think the workflow itself, when you connect the parts together…
-
Parallel processing and feature segmentation methods are well-established in the field of deep learning, particularly in network architectures such as those described in the Inception series. If the p…
-
To evaluate the behavior of the two agent types—**IndividualAgent** (competitive, individualistic behavior) and **SystemAgent** (collaborative, cooperative behavior)—design a series of experiments tha…