fe1ixxu ALMA issues - Githubissues

fe1ixxu / ALMA

State-of-the-art LLM-based translation models.

MIT License

439 stars 35 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Installation issue

#70 Bostoncake opened 1 week ago
0
代码问题

#69 ccjjs opened 1 week ago
0
Ko -> En Bug

#68 sailfish009 opened 3 weeks ago
2
Some questions about alma.

#67 yuanzhiyong1999 opened 3 weeks ago
6
torch.cuda.OutOfMemoryError: CUDA out of memory

#66 risotoonero closed 2 weeks ago
1
CPO 复现，模型重复输出

#65 XDeepAzure opened 3 weeks ago
1
Oscar data structure does not match "utils.py" code and datasets module

#64 risotoonero closed 3 weeks ago
3
Problem with interleave_datasets() for monolingual data fine-tuning

#63 aiyubx closed 1 month ago
1
finetuning for a specific language

#62 jeff11-1-1 closed 2 weeks ago
1
Release X-ALMA

#61 fe1ixxu closed 1 month ago
0
Multi Language Translation for More than specified 5 Languages

#60 NavKumarGit opened 1 month ago
0
Reproduction Issues

#59 soneeee22000 opened 2 months ago
0
Unable to Reproduce ALMA-7b-LoRA Performance, Seeking Assistance

#58 liangyingshao opened 2 months ago
6
error on pretraining

#57 tsbiosky opened 2 months ago
1
Issues with Translation Quality Using ALMA/ALMA-R Models on Multi-Domain Dataset

#56 cocaer closed 3 months ago
2
GPUs used during parallel data fine-tuning

#55 liangyingshao closed 3 months ago
1
i change the sh for evaluate my data ,but met error

#54 DengNingyuan opened 4 months ago
0
torch版本问题

#53 DengNingyuan opened 4 months ago
0
Unknown source language

#52 noahdasanaike opened 4 months ago
0
predict problem

#51 leee-SeungHyeon opened 5 months ago
0
Fail to load ALMA-13B

#50 wygao8 closed 2 months ago
1
A shout out to SimPO

#49 fe1ixxu closed 5 months ago
0
citation for CPO paper

#48 kashif closed 3 months ago
1
Questions about Inference

#47 kira-lin closed 5 months ago
1
preprocess_cpo_data

#46 martimfasantos closed 5 months ago
1
Question about ALMA(R)

#45 mru4913 closed 5 months ago
2
CPO question

#44 gongye19 opened 6 months ago
2
OSError: Error no file named pytorch_model.bin, tf_model.h5, model.ckpt.index or flax_model.msgpack found in directory

#43 gongye19 opened 6 months ago
2
OOM 问题, 显卡是A00 40G

#42 gongye19 opened 6 months ago
5
单语数据集的构建

#41 ywlq closed 7 months ago
2
[Question] Exception during parallel data finetuning

#40 aiyubx closed 7 months ago
1
Error on running the evaluation command

#39 Amrit-Bhaskar-abhask10 opened 7 months ago
2
No such file or directory

#38 hxue3 opened 8 months ago
0
Training metrics currently not logged?

#37 SirRob1997 opened 8 months ago
2
Question on cpo loss

#36 vince62s closed 7 months ago
1
[Bug/Feature] The dataset isn't reading the same cache_dir

#35 alvations closed 3 months ago
2
[Question] Replicating ALMA by training from scratch

#34 alvations closed 8 months ago
2
Got Errors when pretraining LLaMA-2 on Monolingual Dataset

#33 vhientran closed 8 months ago
4
Pretraining inquiry.

#32 gyupro closed 8 months ago
3
Runing parallel_ft_lora.sh

#31 zhengkid closed 9 months ago
2
Loading ALMA-7B-R (LORA merged) through huggingface downloads Pretrained + LORA

#30 tranvaj closed 8 months ago
1
A couple of questions for your theory

#29 gyupro closed 8 months ago
2
Using custom monolingual data instead of OSCAR dataset

#28 learnercat closed 9 months ago
3
How much parallel data?

#27 zidsi closed 8 months ago
1
How much CPO data set is expected to be needed when creating a one-to-one machine translator?

#26 qwopqwop200 closed 9 months ago
3
running `runs/parallel_ft_lora.sh`gives overflow

#25 hndrstwn closed 9 months ago
3
Incomplete Translation from English to Chinese although `max_tokens` is enough

#24 DeyangKong closed 9 months ago
3
Fine-tuning on longer contexts for better performance?

#23 NilanEkanayake closed 9 months ago
1
</s> or eos needed for other base models?

#22 zidsi closed 9 months ago
2
DPODataCollatorWithPadding( TypeError: __init__() got an unexpected keyword argument 'max_length'

#21 sahsaeedi closed 9 months ago
2