issues
search
microsoft
/
LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
https://aka.ms/GeneralAI
MIT License
3.6k
stars
274
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
add implementation of ResLoRA.
#172
hgtttttt
closed
7 months ago
0
[tuna] Libraries are conflicting and/or very aged
#171
batawfic
opened
7 months ago
5
[MiniLLM] About the gradient accumulation in finetune.py
#170
songmzhang
closed
7 months ago
2
Learning Law
#169
t1101675
closed
7 months ago
1
[MiniLLM]Why dolly only has 12435 training samples?
#168
yumath
closed
7 months ago
2
【MiniLLM】About the number of training data of dolly
#167
songmzhang
closed
7 months ago
4
Bump cryptography from 41.0.2 to 42.0.4 in /minillm/transformers/examples/research_projects/decision_transformer
#166
dependabot[bot]
closed
7 months ago
1
Bump cryptography from 41.0.2 to 42.0.2 in /minillm/transformers/examples/research_projects/decision_transformer
#165
dependabot[bot]
closed
7 months ago
1
Questions about the free-law data used in the paper "Adapt LLM to domains"
#164
WUHU-G
opened
7 months ago
2
[MiniLLM]LLama sft on Dolly hard to reproduce results in paper.
#163
yumath
closed
7 months ago
2
Bump cryptography from 41.0.2 to 42.0.0 in /minillm/transformers/examples/research_projects/decision_transformer
#162
dependabot[bot]
closed
7 months ago
1
Bump dash from 2.3.0 to 2.15.0 in /minillm/transformers/examples/research_projects/decision_transformer
#161
dependabot[bot]
closed
7 months ago
1
[MiniLLM] sft of llama2-7b out of memory on V100
#160
yumath
closed
8 months ago
2
Bump aiohttp from 3.8.5 to 3.9.2 in /minillm/transformers/examples/research_projects/decision_transformer
#159
dependabot[bot]
closed
7 months ago
1
Bump pillow from 10.0.1 to 10.2.0 in /minillm/transformers/examples/research_projects/decision_transformer
#158
dependabot[bot]
closed
7 months ago
1
why is the mpu/cross_entropy missing a softmax_logits_t
#157
155394551lzk
closed
7 months ago
2
top-p < 1 fails inf assertion
#156
artsobolev
opened
8 months ago
1
iS LLMA lossless?
#155
riyaj8888
closed
7 months ago
1
Bump jinja2 from 2.11.3 to 3.1.3 in /minillm/transformers/examples/research_projects/decision_transformer
#154
dependabot[bot]
closed
7 months ago
1
Bump gitpython from 3.1.32 to 3.1.41 in /minillm/transformers/examples/research_projects/decision_transformer
#153
dependabot[bot]
closed
7 months ago
1
Bump gitpython from 3.1.32 to 3.1.41 in /minillm/transformers/examples/research_projects/distillation
#152
dependabot[bot]
closed
7 months ago
1
prompt_optimization
#151
chensimian
opened
8 months ago
2
Bump fonttools from 4.31.1 to 4.43.0 in /minillm/transformers/examples/research_projects/decision_transformer
#150
dependabot[bot]
closed
7 months ago
1
Paper:ADAPTING LARGE LANGUAGE MODELS VIA READING COMPREHENSION
#149
J-G-Y
closed
9 months ago
3
AdaptLLM models with Llama Index
#148
mirix
closed
7 months ago
6
The file name is missing l
#147
ycp1027
closed
9 months ago
2
use_bf16_for_qwen
#146
SleepEarlyLiveLong
closed
9 months ago
0
Details for GPT4 evaluation
#145
jongwooko
opened
9 months ago
0
add a patch for the intergration of qwen and qwen_parallel into minillm
#144
SleepEarlyLiveLong
closed
9 months ago
0
Integrate qwen and qwen_parallel into minillm pipeline
#143
SleepEarlyLiveLong
closed
9 months ago
1
Bump transformers from 4.28 to 4.36.0 in /llm_retriever
#142
dependabot[bot]
closed
9 months ago
1
Bump transformers from 4.26.1 to 4.36.0 in /minillm/transformers/examples/tensorflow/language-modeling-tpu
#141
dependabot[bot]
closed
9 months ago
1
Bump transformers from 4.26.0 to 4.36.0 in /minillm/transformers/examples/research_projects/vqgan-clip
#140
dependabot[bot]
closed
9 months ago
1
Bump transformers from 4.21.2 to 4.36.0 in /promptist/trlx/docs
#139
dependabot[bot]
closed
9 months ago
1
Bump transformers from 4.21.1 to 4.36.0 in /minillm/transformers/examples/research_projects/codeparrot/examples
#138
dependabot[bot]
closed
9 months ago
1
Bump transformers from 4.19.0 to 4.36.0 in /minillm/transformers/examples/research_projects/codeparrot
#137
dependabot[bot]
closed
9 months ago
1
Bump transformers from 3.5.1 to 4.36.0 in /minillm/transformers/examples/research_projects/pplm
#136
dependabot[bot]
closed
9 months ago
1
Bump transformers from 3.5.1 to 4.36.0 in /minillm/transformers/examples/research_projects/bertology
#135
dependabot[bot]
closed
9 months ago
1
Bump transformers from 3.5.1 to 4.36.0 in /minillm/transformers/examples/research_projects/bertabs
#134
dependabot[bot]
closed
9 months ago
1
Bump transformers from 3.5.1 to 4.36.0 in /minillm/transformers/examples/research_projects/bert-loses-patience
#133
dependabot[bot]
closed
9 months ago
1
Bump transformers from 3.5.1 to 4.36.0 in /minillm/transformers/examples/research_projects/deebert
#132
dependabot[bot]
closed
9 months ago
1
Bump transformers from 3.5.1 to 4.36.0 in /minillm/transformers/examples/research_projects/adversarial
#131
dependabot[bot]
closed
9 months ago
1
the logits between MP=1 and MP=4 is different when control all other variables to be the same
#130
SleepEarlyLiveLong
closed
9 months ago
9
RuntimeError: probability tensor contains either `inf`, `nan` or element < 0
#129
dongzhiwen1218
opened
9 months ago
0
Backward pass is invalid for module in evaluation mode during minillm training with ZeRO parameter offload
#128
Ispanicus
closed
9 months ago
4
Timeout Error in all_gather during evaluate_ppo() on 2 H100 GPUs with miniLLM and ZeRO
#127
Ispanicus
opened
9 months ago
2
SFT data and pretrain data problem
#126
Emperorizzis
opened
9 months ago
0
MiniLLM: logit_processor generation fix in the model parallel setup
#125
artsobolev
closed
9 months ago
0
dolly/RoBERTa Corpus dataset download
#124
AInkCode
closed
9 months ago
2
MiniLLM: BOS token is missing in training, but present during evaluation
#123
artsobolev
closed
9 months ago
2
Previous
Next