pytorch / text

Models, data loaders and abstractions for language processing, powered by PyTorch
https://pytorch.org/text
BSD 3-Clause "New" or "Revised" License
3.49k stars 813 forks source link

labels in IMBD #2162

Closed AlexanderTrg closed 1 year ago

AlexanderTrg commented 1 year ago

🐛 Bug

all labels in train ds from IMBD are 1

from torchtext.datasets import IMDB train_iter = IMDB(split='train') labels = [label for label, text in train_iter] print(sum(labels)/len(labels)) 1.0

pytorch 2.0.0+cu118 torchtext 0.15.1+cpu

Nayef211 commented 1 year ago

Closing because this is a duplicate of https://github.com/pytorch/text/issues/2041. Let's track the issue there