Open andreyvelich opened 4 months ago
i am interested to contribute on this just i ping on this thread if any help is required /assign
what type of transformer we are looking . i have look for given below transformer model Data Collator can be used
Thank you for your interest @live2awesome! It would be nice if you could let us know what changes we need to make to our HF LLM Trainer to support Data Collators for other Transformers. Also, we should discuss if we should add Data Collator by default to all supported transformers.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
/remove lifecycle/stale
/lifecycle frozen
More context: https://github.com/kubeflow/training-operator/pull/2031#discussion_r1526533371. Currently, we apply HuggingFace Data Collator only for
AutoModelForCausalLM
Transformer in HF LLM Trainer.We need to investigate if we should apply it for other Transformers for language modelling models.