Open flipdazed opened 7 years ago
HDF5?
tested hdf5
against cPickle
import h5py
import numpy as np
import timeit
import cPickle as pkl
a = np.random.random((10000,10000))
t1 = timeit.default_timer()
with open('data.pkl', 'wb') as f:
pkl.dump(a,f)
t1 = timeit.default_timer() - t1
t2 = timeit.default_timer()
h5f = h5py.File('data.hdf5','w')
h5f.create_dataset('dataset_1', data=a)
h5f.close()
t2 = timeit.default_timer() - t2
print 'cPickle: {} secs'.format(t1)
print 'hdf5: {} secs'.format(t2)
## -- End pasted text --
cPickle: 57.8580451012 secs
hdf5: 1.47937297821 secs
Can set up an HDF5 server: https://github.com/HDFGroup/h5serv
Aim Need a method of pull live data from the cloud in Amazon EC2