redpajama Search Results

514 results
for redpajama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

togethercomputer/OpenChatKit #135

When use one Gpu do model training, met one issue.

bash training/finetune_RedPajama-INCITE-Chat-3B-v1.sh My configurations changes as below: --lr 1e-5 --seq-length 2048 --batch-size 8 --micro-batch-size 1 --gradient-accumulate-step 1 \ --num-layers…

yxy123 updated 1 year ago
3
Lightning-AI/lit-llama #330

explanation of the hyperparameters in the pretraining script

Hi, I'm using multi-node training and I need to know how to calculate the hyperparameter values in the train_redpajama script. Can you please elaborate more on how to set these values? Here are …

LamOne1 updated 1 year ago
3
togethercomputer/RedPajama-Data #107

Step 2) "Invalid option: ---input_base_uri"

``` bash scripts/apptainer_run_quality_signals.sh \ --config configs/rp_v2.0.conf \ --dump_id "2022-49" \ --input_base_uri "file:///path/to/data/root" \ --output_base_uri "file:///path/to…

timpal0l updated 8 months ago
1
jax-ml/jax #22436

CTRL+C Broken When Running distributed.initialize() on one T…

## Issue Encountered a deadlock while running a JAX-based LLM training script on a TPU-v4-32 pod. SSH'd into worker 0 and ran the script there directly, instead of using `--worker all --command "..."…

s-smits updated 4 months ago
4
dhakalnirajan/LLaMA-BitNet #1

Usage example

Can you provide an example of how to launch a training instance? how can one choose the llama model size (350M, 750M, .. 7B, etc)? Thanks in advance

andreamigliorati updated 6 months ago
4
irthomasthomas/undecidability #386

SciPhi/AgentSearch-V1 · Datasets at Hugging Face

- [ ] [SciPhi/AgentSearch-V1 · Datasets at Hugging Face](https://huggingface.co/datasets/SciPhi/AgentSearch-V1) #### Getting Started The AgentSearch-V1 dataset is a comprehensive collection of over …

irthomasthomas updated 10 months ago
1
lm-sys/FastChat #1154

Training new Vicuna based on fully open-source OpenLLaMA

Hi :wave: I was wondering if there is any ongoing incentives for training a new Vicuna model based on the fully open source [OpenLLaMA](https://github.com/openlm-research/open_llama)? This would ul…

wilhelmagren updated 1 year ago
8
RoboFlamingo/RoboFlamingo #2

Exception in thread Thread-1:

Hello, Thank you very much for open-sourcing such an interesting project. I followed the prompt steps and conducted a test. However, there was an error during the rendering process, which caused th…

le-wei updated 2 months ago
7
togethercomputer/RedPajama-Data #104

Running the pipeline on cloud or a big data platform

Dear RedPajama team, I apologize this might not be the right place to ask questions, but I was curious on several aspects of your projects and couldn’t find other better ways to reach out. I'm a…

zllai updated 8 months ago
1
hpcaitech/ColossalAI #4289

[BUG]: 使用RedPajama-Data-1T-Sample数据集训练llama-7b模型报错

### 🐛 Describe the bug 运行命令：colossalai run --nproc_per_node 8 --master_port 8822 --hostfile /home/edcuser/models/ColossalAI/examples/language/llama/hostfile.txt --master_addr 192.168.30.3 pretrain…

Maxhyl updated 1 year ago
3

上一页 1...3 4 5 6 7 8 9...52 下一页

514 results for redpajama

514 results
for redpajama