Open timpal0l opened 8 months ago
good catch, thanks for reporting! The three flags --input_base_uri
, --output_base_uri
and --max_docs
are actually set in the config file: https://github.com/togethercomputer/RedPajama-Data/blob/bb594b01a92b7e6fcf70cf3b6659851ce17edcce/configs/rp_v2.0.conf#L4-L6
You can just drop them in the call to the apptainer script.
Invalid option: ---input_base_uri Usage: apptainer_run_quality_signals.sh [ -c | --config ] [ -d | --dump_id ]