tensorchiefs / data

data
0 stars 0 forks source link

Handling of large data #1

Open oduerr opened 4 months ago

oduerr commented 4 months ago

Handling of large data. It should be possible to load data from repositories like dropbox. They should be CSV files or gz.csv.

oduerr commented 4 months ago

Description

We aim to create a flexible and platform-independent caching system for CSV files, supporting both R and Python. This system will enable easy caching, loading, and documentation of datasets. The main components for each dataset are:

Tasks

Data Management and Caching:

Separation of Large and Small Files:

Documentation: