lmops Search Results - Githubissues

48 results
for lmops

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/LMOps #171

[tuna] Libraries are conflicting and/or very aged

So disappointed of what is released here. these are just non working pieces. Funny that in train.py for example you have: from custom import CustomTrainer, but custom is actually have only TunaTrain…

batawfic updated 9 months ago
5
microsoft/LMOps #263

[MiniLLM] The "processed_data.tar" data link is invalid.

I used the code in the guide to download the data, but found that the URL was no longer valid. `DLINK=$(echo -n "aHR0cHM6Ly9jb252ZXJzYXRpb25odWIuYmxvYi5jb3JlLndpbmRvd3MubmV0L2JlaXQtc2hhcmUtcHVibGljL…

shhn1 updated 2 months ago
2
microsoft/LMOps #264

[MiniLLM] mismatch between formula and implementation (gradL…

``` def _pg_loss( self, logprobs: TensorType["batch_size", "response_size"], old_logprobs: TensorType["batch_size", "response_size"], advantages: TensorType["b…

lancerts updated 2 months ago
2
microsoft/LMOps #211

[MiniLLM] Processed RoBERTa Corpus dataset download

Unable to download processed RoBERTa Corpus only. Also encountering repeated interruptions during download of the full processed_data.tar with an error indicating dead links, possibly due to incomple…

AKaubay updated 2 months ago
2
jongwooko/distillm #10

Download the training/evaluation data

How can I download the training/evaluation intruction-response data? Can you tell me how to download the data? I tried but failed. link: https://conversationhub.blob.core.windows.net/beit-share…

ypw-lbj updated 4 months ago
2
microsoft/LMOps #93

[minillm] apply minillm to other LLMs like Baichuan/Qianwen

Thank you for your awesome work minillm that explored the knowledge distillation for LLMs. I noticed that minillm supports the gpt2/gptj/opt and llama series models only, my question is how should I d…

SleepEarlyLiveLong updated 4 months ago
6
microsoft/LMOps #237

[UPRISE]CUDA out of memory. Tried to allocate 3.25 GiB. GPU

When run the code:bash inference.sh,it occurs an error. ![image](https://github.com/microsoft/LMOps/assets/61148892/fa43a14f-e8c0-4dbe-a8e9-06556c0a3384) ![image](https://github.com/microsoft/LMOps/…

zhouchang123 updated 5 months ago
4
microsoft/LMOps #238

[UPRISE]When training the uprise,a problem happened.

Here is the config and the log info. ![image](https://github.com/microsoft/LMOps/assets/61148892/ac8c172b-30f8-43f8-865b-59bf315fa7f1) Here is the wrong message. ![image](https://github.com/microso…

zhouchang123 updated 4 months ago
17
microsoft/LMOps #222

Is reward_fn equal to log_softmax

I noticed that the `scores` in `reward_fn` is actually equal to `logits_i - logsumexp(logits)`. I think this expression can be calculated directly by `log_softmax`. Why not use `log_softmax`? htt…

EganGu updated 6 months ago
2
huggingface/peft #1520

integrate ResLoRA

### Feature request Integrate ResLoRA into PEFT. Paper: https://arxiv.org/abs/2402.18039 Code from Microsoft: https://github.com/microsoft/LMOps/tree/main/reslora ### Motivation I find interestin…

hllj updated 7 months ago
2

上一页 1...1 2 3 4 5...5 下一页

48 results for lmops

48 results
for lmops