Open jterry64 opened 4 years ago
Actually, seems like this can just happen occasionally when repartitioning with Spark. There are ways to get rid of empty partitions, but seems like it can happen randomly, so I think I'll just make the datapump more resilient to this happening.
Whitelist is occasionally generating empty part files. When trying to upload to GFW API, this causes an empty file error that might put the dataset into a failed state. Easy to workaround, but seems better to just stop it from happening in the first place.