Open zhujiem opened 6 months ago
Huggingface Datasets:
dataset = load_dataset("parquet", data_files={split: data_blocks}, split=split) super().__init__(dataset=dataset, num_workers=8, batch_size=self.batch_size)
Huggingface Datasets: