georgian-io / Multimodal-Toolkit

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
https://multimodal-toolkit.readthedocs.io
Apache License 2.0
587 stars 84 forks source link

Standardize output formats to match transformers #35

Open akashsaravanan-georgian opened 1 year ago

akashsaravanan-georgian commented 1 year ago

At present all models return return loss, logits, classifier_layer_outputs.

Transformer models generally return loss, logits, hidden_states, attentions. Maybe we realign with this in the future?

Ref: https://github.com/huggingface/transformers/blob/main/src/transformers/modeling_outputs.py#L665-L699