import dask.dataframe as dd
from dask import delayed
tcx1 = delayed(tcxtools.TCXPandas('workout_1.tcx').parse()) # Delay these calculations
tcx2 = delayed(tcxtools.TCXPandas('workout_2.tcx').parse()) # Use as many as needed
total = dd.from_delayed([tc1, tc2]) # However many files you need
total.visualize() # Visualize the task graph
total.compute() # Compute it
# This returns a dataframe with all the files
PS: tcx1 and tcx2 are wrong in this example on line total = dd.from_delayed([tc1, tc2]) # However many files you need
I'm getting the error:
TypeError: Series.name must be a hashable type
Anybody can help what should I do here to fix this error?
I'm doing exactly like the example on README:
PS:
tcx1
andtcx2
are wrong in this example on linetotal = dd.from_delayed([tc1, tc2]) # However many files you need
I'm getting the error:
Anybody can help what should I do here to fix this error?
Thanks in advance.