Open ctjlewis opened 3 years ago
This data is extremely poorly-formatted. A huge CSV would be ideal, rather than grouping everything into arbitrary 200kB buckets and then cutting off at > 1MB.
Could anyone provide the actual source code used to generate this data so it can be replicated as well?
This data is extremely poorly-formatted. A huge CSV would be ideal, rather than grouping everything into arbitrary 200kB buckets and then cutting off at > 1MB.
Could anyone provide the actual source code used to generate this data so it can be replicated as well?