eBay / tsv-utils

eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
https://ebay.github.io/tsv-utils/
Boost Software License 1.0
1.43k stars 80 forks source link

tsv-select: Restrict max excluded field number. #269

Closed jondegenhardt closed 4 years ago

jondegenhardt commented 4 years ago

An update the recently added tsv-select --exclude operation. The implementation allocates memory up to the maximum excluded field. Users have no way to know this, but might casually enter arbitrarily large numbers if they want to ensure trimming the all fields beyond a certain number.

This PR limits the max excluded field to 1048576 (more than 1 million fields). This is certainly a large number of fields for TSV files, and should ensure the common case works well. The limit can be increased if use-cases arise.