NVIDIA / Megatron-LM

Ongoing research training transformer models at scale
https://docs.nvidia.com/megatron-core/developer-guide/latest/user-guide/index.html#quick-start
Other
10.13k stars 2.28k forks source link

Fix Bug: Configuring Datasets with train-data-path, valid-data-path, test-data-path #840

Open Eisenhower opened 4 months ago

Eisenhower commented 4 months ago

Fixed the bug that prevents configuring datasets using train-data-path, valid-data-path, and test-data-path.

When the --split parameter is not configured, the --split parameter will be set to the default value 969, 30, 1. In the blended_megatron_dataset_config.py file, within the __post_init__ function, the following code will raise an error when configuring datasets using train-data-path, valid-data-path, and test-data-path because the split parameter is not None:

if self.blend_per_split is not None and any(self.blend_per_split): assert self.blend is None, "blend and blend_per_split are incompatible" assert self.split is None, "split and blend_per_split are incompatible"

github-actions[bot] commented 2 months ago

Marking as stale. No activity in 60 days.