graykode / matorage

Matorage is tensor(multidimensional matrix) object storage manager for deep learning framework(Pytorch, Tensorflow V2, Keras)
https://matorage.readthedocs.io
Other
73 stars 8 forks source link

Feature for storing not only 'numpy arrays' but also file format required for training #7

Closed graykode closed 4 years ago

graykode commented 4 years ago

Many deep learning tasks require file as well as numpy arrays to evaluate the trained model. These formats are applicable :

Therefore, it is implemented so that it can be used as above through data saver :

data_config = DataConfig(
        endpoint="127.0.0.1:9000",
        access_key="minio",
        secret_key="miniosecretkey",
        dataset_name="mnist",
        attributes=[
            ("input_ids", "int64", (384)),
       ],
    )
data_saver = DataSaver(config=data_config, refresh=True)

# general data saver for saving numpy array
data_saver({
      "input_ids": batch[0],
}, filetype=False)
data_saver.disconnect()

# file data saver
data_saver({
      "key": `file path`,
}, filetype=True)
data_saver.disconnect()

ToDo