Open SeekPoint opened 1 year ago
and how to install alpaca-rlhf
(gh_alpaca-rlhf) amd00@asus00:~/llm_dev/alpaca-rlhf$
(gh_alpaca-rlhf) amd00@asus00:~/llm_dev/alpaca-rlhf$ sh run.sh --num_gpus 1 ./alpaca_rlhf/deepspeed_chat/training/step1_supervised_finetuning/main.py --sft_only_data_path MultiTurnAlpaca --data_output_path ./rlhf-tmp/ --model_name_or_path ~/hf_model/llama-7b-hf --per_device_train_batch_size 2 --per_device_eval_batch_size 2 --max_seq_len 128 --learning_rate 3e-4 --num_train_epochs 1 --gradient_accumulation_steps 8 --num_warmup_steps 100 --output_dir ./rlhf/actor --lora_dim 8 --lora_module_name q_proj,k_proj --only_optimize_lora --deepspeed --zero_stage 2
start 20230602162350--------------------------------------------------
[2023-06-02 16:23:51,869] [WARNING] [runner.py:191:fetch_hostfile] Unable to find hostfile, will proceed with training with local resources only.
╭─────────────────────────────── Traceback (most recent call last) ────────────────────────────────╮
│ /home/amd00/anaconda3/envs/gh_alpaca-rlhf/bin/deepspeed:6 in
+-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | |=============================================================================| | 0 N/A N/A 1085 G /usr/lib/xorg/Xorg 4MiB | | 0 N/A N/A 1967 G /usr/lib/xorg/Xorg 4MiB | | 0 N/A N/A 259783 C ...Speed-Chat/bin/python3.10 755MiB | +-----------------------------------------------------------------------------+ (gh_alpaca-rlhf) amd00@asus00:~/llm_dev/alpaca-rlhf$
I got one 3090 and I changed gpu_nums to 1
and how to install alpaca-rlhf