tecosaur / DataToolkit.jl

Reproducible, flexible, and convenient data management
78 stars 4 forks source link

+title: [[file:Main/docs/src/assets/logotype.svg]]

+author: tecosaur



DataToolkit is a batteries-included family of packages for robustly managing data. The particular package(s) you want to use will depend on the project.

For now, this set of packages around the beta stage of development. No major changes to the core functionality or structure are anticipated, but small expansions in the data-CLI functionality and set of transformers and plugins provided by /DataToolkitCommon/ are expected prior to the 1.0 release, and larger changes may occur if there is good reason for them.

+begin_src toml

[[iris]] uuid = "3f3d7714-22aa-4555-a950-78f43b74b81c" description = "Fisher's famous Iris flower measurements"

driver = "web"
checksum = "k12:cfb9a6a302f58e5a9b0c815bb7e8efb4"
url = "https://raw.githubusercontent.com/scikit-learn/scikit-learn/1.0/sklearn/datasets/data/iris.csv"

driver = "csv"
args.header = ["sepal_length", "sepal_width", "petal_length", "petal_width", "species_class"]
args.skipto = 2


** Similar Packages

** Relevant Links