eBay / tsv-utils

eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
https://ebay.github.io/tsv-utils/
Boost Software License 1.0
1.43k stars 80 forks source link

Directory cat #277

Closed jondegenhardt closed 4 years ago

jondegenhardt commented 4 years ago

Adds a new build/test tool, dircat that concatenates the files in a directory into a single output stream. Each file's contents is preceded by the filename. This is needed for tsv-split command line testing.

It is similar to using tail -n +1 on a directory, which is what tsv-split command line tests did before. However, this had problems with inconsistent file listing orders on different platforms, causing trouble for CI testing. Having a standalone tool addresses this problem.

This PR also contains updates to tsv-split command line testing, including a few tests that were dropped earlier due to platform consistency issues.