issues
search
microsoft
/
LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
https://aka.ms/GeneralAI
MIT License
3.71k
stars
283
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Bump pillow from 10.0.1 to 10.3.0 in /minillm/transformers/examples/research_projects/decision_transformer
#184
dependabot[bot]
closed
5 months ago
1
【MiniLLM】is it normal to get negative loss at some step?
#183
lllyyyqqq
closed
1 month ago
1
Questions about task datasets used in the paper "Adapt LLM to domains"
#182
Lydia-yang
closed
7 months ago
2
RoBERTa Corpus
#181
stephencurry-web
closed
1 month ago
1
The update method in the UCB algorithm is inconsistent with the paper and code
#180
kerala21
opened
7 months ago
2
Bump black from 22.1.0 to 24.3.0 in /minillm/transformers/examples/research_projects/decision_transformer
#179
dependabot[bot]
closed
5 months ago
1
[llm_retriever] Questions about the dataset
#178
OStars
opened
8 months ago
0
ModuleNotFoundError: No module named 'deepspeed'
#177
qxpBlog
closed
1 month ago
1
ImportError: cannot import name 'mpu' from 'transformers'
#176
qxpBlog
closed
8 months ago
4
Missing Jailbreak dataset from protegi?
#175
tboen1
opened
8 months ago
2
Update train.py
#174
haorannlp
closed
8 months ago
0
Bump transformers from 4.35.2 to 4.36.0 in /reslora
#173
dependabot[bot]
closed
8 months ago
0
add implementation of ResLoRA.
#172
hgtttttt
closed
8 months ago
0
[tuna] Libraries are conflicting and/or very aged
#171
batawfic
opened
8 months ago
5
[MiniLLM] About the gradient accumulation in finetune.py
#170
songmzhang
closed
8 months ago
2
Learning Law
#169
t1101675
closed
8 months ago
1
[MiniLLM]Why dolly only has 12435 training samples?
#168
yumath
closed
9 months ago
2
【MiniLLM】About the number of training data of dolly
#167
songmzhang
closed
9 months ago
4
Bump cryptography from 41.0.2 to 42.0.4 in /minillm/transformers/examples/research_projects/decision_transformer
#166
dependabot[bot]
closed
9 months ago
1
Bump cryptography from 41.0.2 to 42.0.2 in /minillm/transformers/examples/research_projects/decision_transformer
#165
dependabot[bot]
closed
9 months ago
1
Questions about the free-law data used in the paper "Adapt LLM to domains"
#164
WUHU-G
opened
9 months ago
2
[MiniLLM]LLama sft on Dolly hard to reproduce results in paper.
#163
yumath
closed
9 months ago
2
Bump cryptography from 41.0.2 to 42.0.0 in /minillm/transformers/examples/research_projects/decision_transformer
#162
dependabot[bot]
closed
9 months ago
1
Bump dash from 2.3.0 to 2.15.0 in /minillm/transformers/examples/research_projects/decision_transformer
#161
dependabot[bot]
closed
9 months ago
1
[MiniLLM] sft of llama2-7b out of memory on V100
#160
yumath
closed
9 months ago
2
Bump aiohttp from 3.8.5 to 3.9.2 in /minillm/transformers/examples/research_projects/decision_transformer
#159
dependabot[bot]
closed
9 months ago
1
Bump pillow from 10.0.1 to 10.2.0 in /minillm/transformers/examples/research_projects/decision_transformer
#158
dependabot[bot]
closed
9 months ago
1
why is the mpu/cross_entropy missing a softmax_logits_t
#157
155394551lzk
closed
9 months ago
2
top-p < 1 fails inf assertion
#156
artsobolev
opened
10 months ago
1
iS LLMA lossless?
#155
riyaj8888
closed
9 months ago
1
Bump jinja2 from 2.11.3 to 3.1.3 in /minillm/transformers/examples/research_projects/decision_transformer
#154
dependabot[bot]
closed
9 months ago
1
Bump gitpython from 3.1.32 to 3.1.41 in /minillm/transformers/examples/research_projects/decision_transformer
#153
dependabot[bot]
closed
9 months ago
1
Bump gitpython from 3.1.32 to 3.1.41 in /minillm/transformers/examples/research_projects/distillation
#152
dependabot[bot]
closed
9 months ago
1
prompt_optimization
#151
chensimian
opened
10 months ago
2
Bump fonttools from 4.31.1 to 4.43.0 in /minillm/transformers/examples/research_projects/decision_transformer
#150
dependabot[bot]
closed
9 months ago
1
Paper:ADAPTING LARGE LANGUAGE MODELS VIA READING COMPREHENSION
#149
J-G-Y
closed
10 months ago
3
AdaptLLM models with Llama Index
#148
mirix
closed
8 months ago
6
The file name is missing l
#147
ycp1027
closed
10 months ago
2
use_bf16_for_qwen
#146
SleepEarlyLiveLong
closed
11 months ago
0
Details for GPT4 evaluation
#145
jongwooko
opened
11 months ago
0
add a patch for the intergration of qwen and qwen_parallel into minillm
#144
SleepEarlyLiveLong
closed
11 months ago
0
Integrate qwen and qwen_parallel into minillm pipeline
#143
SleepEarlyLiveLong
closed
11 months ago
1
Bump transformers from 4.28 to 4.36.0 in /llm_retriever
#142
dependabot[bot]
closed
11 months ago
1
Bump transformers from 4.26.1 to 4.36.0 in /minillm/transformers/examples/tensorflow/language-modeling-tpu
#141
dependabot[bot]
closed
11 months ago
1
Bump transformers from 4.26.0 to 4.36.0 in /minillm/transformers/examples/research_projects/vqgan-clip
#140
dependabot[bot]
closed
11 months ago
1
Bump transformers from 4.21.2 to 4.36.0 in /promptist/trlx/docs
#139
dependabot[bot]
closed
11 months ago
1
Bump transformers from 4.21.1 to 4.36.0 in /minillm/transformers/examples/research_projects/codeparrot/examples
#138
dependabot[bot]
closed
11 months ago
1
Bump transformers from 4.19.0 to 4.36.0 in /minillm/transformers/examples/research_projects/codeparrot
#137
dependabot[bot]
closed
11 months ago
1
Bump transformers from 3.5.1 to 4.36.0 in /minillm/transformers/examples/research_projects/pplm
#136
dependabot[bot]
closed
11 months ago
1
Bump transformers from 3.5.1 to 4.36.0 in /minillm/transformers/examples/research_projects/bertology
#135
dependabot[bot]
closed
11 months ago
1
Previous
Next