issues
search
fe1ixxu
/
ALMA
State-of-the-art LLM-based translation models.
MIT License
348
stars
26
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
predict problem
#51
leee-SeungHyeon
opened
1 week ago
0
Fail to load ALMA-13B
#50
wygao8
opened
1 week ago
1
A shout out to SimPO
#49
fe1ixxu
closed
1 week ago
0
citation for CPO paper
#48
kashif
opened
3 weeks ago
1
Questions about Inference
#47
kira-lin
closed
3 weeks ago
1
preprocess_cpo_data
#46
martimfasantos
closed
3 weeks ago
1
Question about ALMA(R)
#45
mru4913
closed
3 weeks ago
2
CPO question
#44
gongye19
opened
1 month ago
2
OSError: Error no file named pytorch_model.bin, tf_model.h5, model.ckpt.index or flax_model.msgpack found in directory
#43
gongye19
opened
1 month ago
2
OOM 问题, 显卡是A00 40G
#42
gongye19
opened
1 month ago
4
单语数据集的构建
#41
ywlq
closed
2 months ago
0
[Question] Exception during parallel data finetuning
#40
aiyubx
closed
2 months ago
1
Error on running the evaluation command
#39
Amrit-Bhaskar-abhask10
opened
3 months ago
2
No such file or directory
#38
hxue3
opened
3 months ago
0
Training metrics currently not logged?
#37
SirRob1997
opened
3 months ago
2
Question on cpo loss
#36
vince62s
closed
3 months ago
1
[Bug/Feature] The dataset isn't reading the same cache_dir
#35
alvations
opened
3 months ago
2
[Question] Replicating ALMA by training from scratch
#34
alvations
closed
3 months ago
2
Got Errors when pretraining LLaMA-2 on Monolingual Dataset
#33
vhientran
closed
3 months ago
4
Pretraining inquiry.
#32
gyupro
closed
3 months ago
3
Runing parallel_ft_lora.sh
#31
zhengkid
closed
4 months ago
2
Loading ALMA-7B-R (LORA merged) through huggingface downloads Pretrained + LORA
#30
tranvaj
closed
3 months ago
1
A couple of questions for your theory
#29
gyupro
closed
3 months ago
2
Using custom monolingual data instead of OSCAR dataset
#28
learnercat
closed
4 months ago
2
How much parallel data?
#27
zidsi
closed
3 months ago
1
How much CPO data set is expected to be needed when creating a one-to-one machine translator?
#26
qwopqwop200
closed
4 months ago
3
running `runs/parallel_ft_lora.sh`gives overflow
#25
hndrstwn
closed
4 months ago
2
Incomplete Translation from English to Chinese although `max_tokens` is enough
#24
DeyangKong
closed
4 months ago
3
Fine-tuning on longer contexts for better performance?
#23
NilanEkanayake
closed
5 months ago
1
</s> or eos needed for other base models?
#22
zidsi
closed
5 months ago
2
DPODataCollatorWithPadding( TypeError: __init__() got an unexpected keyword argument 'max_length'
#21
sahsaeedi
closed
5 months ago
2
The English-Chinese translation is incomplete.
#20
detectRecog
closed
5 months ago
2
Updates data processing in utils.py [fix bug]
#19
sweta20
closed
5 months ago
1
How to fix error to access huggingface?
#18
vhientran
closed
5 months ago
1
Regarding the memory usage of full-weight fine-tuning
#17
Franciscus-Carolus
closed
6 months ago
2
Suggestion on foundation model
#16
cmp-nct
closed
6 months ago
4
NotImplementedError: all_exhausted stopping strategy in `interleave_datasets` is not implemented yet with a list of <class 'datasets.iterable_dataset.IterableDataset'>.
#15
zwhe99
closed
6 months ago
2
Release `Random` and `Filtered` parallel corpora
#14
zwhe99
closed
6 months ago
2
Polite form selection
#13
cmp-nct
closed
6 months ago
5
About how to specify pairs in `Parallel_ft.sh`.
#12
kyoto7250
closed
7 months ago
2
Added tokenizer training and wechsel
#11
nvassilyev
closed
7 months ago
0
Embedding init
#10
nvassilyev
closed
7 months ago
0
questions about reproduce the results of paper
#9
ZeroneBo
closed
7 months ago
4
What do i need to add a new language ?
#8
MohamedAliRashad
closed
9 months ago
1
7B or 13B ?
#7
geronimi73
closed
9 months ago
1
[Question] Suggested machine and GPUs to run the training
#6
alvations
closed
9 months ago
2
[Question] Is "this" needed in the general prompt?
#5
alvations
closed
9 months ago
1
About the interleave probability selections
#4
Aniruddha-JU
closed
9 months ago
2
Update README.md
#3
eltociear
closed
9 months ago
0
Checkpoints for other languages
#2
kristaller486
closed
9 months ago
1
Next