Add with_rank to Dataset.from_generator similar to Dataset.map and Dataset.filter.
Motivation
As for Dataset.map and Dataset.filter, this is useful when creating cache files using multi-GPU, where the rank can be used to select GPU IDs. For now, rank can be added in the gen_kwars argument; however, this, in turn, includes the rank when computing the fingerprint.
Your contribution
Added #7199 which passes rank based on the job_id set by num_proc.
Feature request
Add
with_rank
toDataset.from_generator
similar toDataset.map
andDataset.filter
.Motivation
As for
Dataset.map
andDataset.filter
, this is useful when creating cache files using multi-GPU, where the rank can be used to select GPU IDs. For now, rank can be added in thegen_kwars
argument; however, this, in turn, includes the rank when computing the fingerprint.Your contribution
Added #7199 which passes rank based on the
job_id
set bynum_proc
.