-
**Describe the bug**
When running `merge_lora_weights/merge.py` with TP and PP set to 1 on a fine-tuned minitron checkpoint, I run into the following error:
```sh
raise RuntimeError(f"world_size ({w…
-
# High cardinality predictors for #TidyTuesday museums in the UK | Julia Silge
A data science blog
[https://juliasilge.com/blog/uk-museums/](https://juliasilge.com/blog/uk-museums/)
-
I tried training the paraphraser with gpt2 (small) as the large model would not fit my 1080 Ti. Everything went alright until the last iteration, where I got the error below. The final checkpoint seem…
-
hello, Im having a rough time trying to this program works, I was using Monika AI just fine, so I wanted to try this, but Im having a hard time trying to make it works:
I follow all the steps, inst…
-
I am using Anaconda to build my own project. I am using Python version 3.10.14 and downloaded Ollama, pulled Mistral for my LLM, and pulled Nomic-Embed-Text for my embedding model. I followed the inst…
-
From jira-archive created by [sl-service-account](https://github.com/sl-service-account): secondlife/jira-archive#6503
# Information
Since there was recently talk of work possibly returning to M…
-
Thank you for the awesome and interesting research and project. I was wondering if anyone has encountered the following error when using multiple gpu. I have 4 Titan V gpus and to use them I've set th…
-
I am trying to predict answers for a new text with questions with a base albert model fine-tuned on squad2.0. These are my parameters:
python -m run_squad_v2
--albert_config_file albert/m…
-
### What is your suggestion?
Hi, I would like to be able to record package revisions in lockfiles.
I want to be able to record exactly what binaries went into a build, including which package revi…
-
Hi all,
Thanks for your excellent work. I met the following problem when using `triton == 2.0.0`. Forward succeed, but backward failed. How can I solve it, thanks.
`error: 'scf.for' op expects reg…