Open bogdanscode opened 3 months ago
Convert .md files to parquet files so that they can be processed by data prep pipeline This is the preferred input for InstructLab
178
Also, in the future can you sign your commits?
Oh and the code2parquet transform is in transforms/code/code2parquet
Why are these changes needed?
Convert .md files to parquet files so that they can be processed by data prep pipeline This is the preferred input for InstructLab
Related issue number (if any).
178