-
Let's start with saying that based on my reading of various papers Model Parallelism (MP) is a very inconsistent term. One can slice vertically or horizontally. One can implement a naive slow version …
-
## Description
In the finetunine_classifier.py, it updates the metric each batch and get the result after each epoch both in train() and evaluate().
However, there are some wrong settings:
1. We …
-
# The Problem
There has to be a better way than having a python script running 24/7 on an EC2 instance. Will need to look into different ways to get the data.
The crux of the issue is how the da…
-
I've found out the hard way that most newer micro SD cards reports as SD card init failed on the Sdrive, even from the same manufacturer/model/size that few months ago were working great.
up to now P…
-
when i run the RWKV-LM-Lora,i meet that error,
the model i use:
RWKV-4-Raven-1B5-v9-Eng99%-Other1%-20230411-ctx4096.pth
my insruction run in wsl2:
python3 train.py --load_model /home/wubo/chatRWKV…
-
I am trying to use the framework to continue pretraining llama3-8B. I have converted the HF checkpoint into nanotron format and the generated tokens seem reasonable.
I use the following setting to…
-
While preparing the benchmark for eager and dynamo using the code from the fork: https://github.com/tfogal/NeMo I get errors for dynamo case.
## 🐛 Bug
After fixing [1187](https://github.com/Ligh…
-
It is my understanding from the videos that batch size should have no influence on convergence. But I have cases where increasing the batch size will lead to underfitting, or where decreasing the batc…
-
https://docs.starrocks.io/docs/sql-reference/sql-functions/table-functions/files/
-
Hi All,
I am using StreamingLens in my spark structure streaming application but it's always showing same logs .BatchId is getting updated but **Streaming Query State: NONEWBATCHES** remains same.…