Unable to use flow_from_dataframe - y_col must be str,list,tuple

keras-team / keras-preprocessing

Utilities for working with image data, text data, and sequence data.

Other

1.02k stars 444 forks source link

Hi,

i am trying to train a multi task CNN using flow_from_dataframe. The columns in dataframe are already in str format but the dtypes shows "Object" no matter what I use to convert them to string. Seems pandas uses object even for str now.

The dataframe has these columns:

Image PFRType FuelType image1.jpg 1-3 NG

Image object PFRType object FuelType object dir object dtype: object

And I get this error: If class_mode="sparse", y_col="['PFRType', 'FuelType']" column values must be strings.

here is the code for generator

trainGen = ImageDataGenerator()
trainGenDf = trainGen.flow_from_dataframe(trainLabel,
                                         directory = '../MTLData/train/',
                                         x_col = "Image",y_col=['PFRType', 'FuelType'],
                                         class_mode='sparse',
                                         target_size=(224,224),
                                         batch_size=32)

I am using Keras Version: 2.3.1 Can someone please help?

dataframe['combined_classes'] = dataframe[('PFRType', 'FuelType')].apply(lambda x: x.tolist(), axis=1) trainGen = ImageDataGenerator() trainGenDf = trainGen.flow_from_dataframe(dataframe, directory = '../MTLData/train/', x_col = "Image", y_col='combined_classes', class_mode='sparse', target_size=(224,224), batch_size=32)

keras-team / keras-preprocessing

Unable to use flow_from_dataframe - y_col must be str,list,tuple #286