datablist / sample-csv-files

70 stars 24 forks source link

Sample CSV files

This repository contains sample Comma Separated Value (CSV) files. CSV is a generic flat file format used to store structured data. Datasets are split in 3 categories: Customers, Users and Organizations. For each, sample CSV files range from 100 to 2 millions records. Those CSV files can be used for testing purpose. They can be open by any application compatible with CSV files or with a CSV editor.

The datasets are generated using random values. Mosly using Python Faker package.

Customers CSV Sample

Customer Schema

People CSV Samples

People Schema

Organizations CSV Samples

Organization Schema

Local Set up to generate files

Python Environments

Create a Python virtual env:

python3 -m venv venv/sample-csv

Activate it

source venv/sample-csv/bin/activate

So you can install dependencies:

pip install -r requirements.txt

Run script

python src/main.py