georgian-io / Multimodal-Toolkit

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
https://multimodal-toolkit.readthedocs.io
Apache License 2.0
587 stars 84 forks source link

Fix TorchTabularTextDataset.__len__ method #62

Closed DougTrajano closed 9 months ago

DougTrajano commented 9 months ago

Since the TorchTabularTextDataset accepts the labels as None, we have an issue in the __len__() method that uses the self.labels to return the len.

In this PR, we suggest replacing it with the length of self.encodings.

akashsaravanan-georgian commented 9 months ago

Thanks for all the effort Doug! I've merged both PRs in