lmops Search Results - Githubissues

48 results
for lmops

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/LMOps #128

Backward pass is invalid for module in evaluation mode durin…

Hi, I'm experiencing an Assertion Error during training of miniLLM using ZeRO with optimizer and parameter offload on a single H100 GPU. It seems as though deepspeed's parameter offload script is gett…

Ispanicus updated 11 months ago
4
microsoft/LMOps #119

[minillm] typo in bash

https://github.com/microsoft/LMOps/blob/80d7d4a0ba8d61ca7be6cae72d06cf71dda3e9e0/minillm/scripts/gpt2/eval/eval_main_self_inst.sh#L18C32-L18C32 > CKPT="${BASE_PATH}/results/gpt2/${CKPT_NAME}/" It …

wutaiqiang updated 1 year ago
1
microsoft/LMOps #95

Uprise: Error while running inference

I'm running the inference script with `bash inference_hf.sh`. But I'm getting some error related to path. ``` [2023-10-17 18:06:41,654][root][INFO] - Total encoded queries tensor torch.Size([277, …

dittops updated 1 year ago
1
microsoft/LMOps #94

Uprise: load persistent id instruction was encountered

I'm trying to run the generate_dense_embeddings script with the following command ``` python DPR/generate_dense_embeddings.py model_file=/root/LMOps/uprise/archive/data.pkl ctx_src=dpr_uprise sh…

dittops updated 1 year ago
4
microsoft/LMOps #80

I am getting "NameError: name 'overall_cls' is not defined" …

Hello all, when I run python raw2read.py I am getting "NameError: name 'overall_cls' error. Here I am providing part log. Help me in fixing in this issue. PS C:\Users\rajas\Desktop\AI_Research\LMO…

rajvadiyala updated 1 year ago
7
microsoft/LMOps #71

huggingface_hub.utils._validators.HFValidationError: Repo id…

When I run this instruction `bash scripts/opt/tools/process_data_dolly.sh /PATH/TO/MiniLLM # Process Dolly Train / Validation Data`，it has some error messages like 'huggingface_hub.utils._validators.H…

WenTingTseng updated 1 year ago
1
microsoft/LMOps #101

[miniLLM] The evaluation might be wrong when using dp_size >…

At the evaluation phase of llama-7b/gpt2-xlarge whose `MP_size=1`, I try to use 8 gpus to accelerate the evaluation phase. The code is `scripts/gpt2/eval/run_eval.sh`. I simplify this code to only …

cailinhang updated 1 year ago
3
microsoft/LMOps #70

code releasing of llm_retriever

Looking forward to your code releasing of llm_retriever :)

Mr-lonely0 updated 1 year ago
1
microsoft/LMOps #58

[minillm] how to eval sft/llama-13B with 1 A100 GPU or 4 A10…

I want to run scripts/llama/eval/eval_main_dolly.sh to evaluate sft/llama-13B, I have access to 1 A100 gpu OR 4 A10 gpus, how should I modify the scripts/llama/eval/eval_main_dolly.sh file to get it w…

SleepEarlyLiveLong updated 1 year ago
2
HumanSignal/Adala #41

Support timeout and retry

openai always timeout or raise exception, is there a plan to support openai request timeout and retry?

yinggoga updated 1 year ago
5

上一页 1...1 2 3 4 5...5 下一页

48 results for lmops

48 results
for lmops