huggingface / datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
https://huggingface.co/docs/datasets
Apache License 2.0
19.1k stars 2.66k forks source link

Remove columns in interable dataset #5842

Closed surya-narayanan closed 1 year ago

surya-narayanan commented 1 year ago

Feature request

Right now, remove_columns() produces a NotImplementedError for iterable style datasets

Motivation

It would be great to have the same functionality irrespective of whether one is using an iterable or a map-style dataset

Your contribution

hope and courage.

amyeroberts commented 1 year ago

Transferring this issue as it's related to the 🤗 Datasets library

Bearnardd commented 1 year ago

Hi @surya-narayanan! Could you provide some code snippet?

mariosasko commented 1 year ago

This method has been recently added to the IterableDataset, so you need to update the datasets' installation (pip install -U datasets) to use it.