Building-ML-Pipelines / building-machine-learning-pipelines

Code repository for the O'Reilly publication "Building Machine Learning Pipelines" by Hannes Hapke & Catherine Nelson
MIT License
584 stars 249 forks source link

Data Preprocessing - Handling Multilabel Data #41

Closed albertnanda closed 3 years ago

albertnanda commented 3 years ago

How do we handle multilabel classification, need an example for multilabel classification, unable to convert the target to one-hot.

hanneshapke commented 3 years ago

Hi @albertnanda, you could generate the one hot encoding for each label and then sum the individual vectors. This will give you a multi-hot vector. You can perform this process with TFT.