triskadecaepyon / pyworkout-toolkit

Python tools to process workout data and telemetry for metrics and machine learning
BSD 3-Clause "New" or "Revised" License
18 stars 3 forks source link

Error on reading many TCX using DASK #11

Open albertoivo opened 3 years ago

albertoivo commented 3 years ago

I'm doing exactly like the example on README:

import dask.dataframe as dd
from dask import delayed

tcx1 = delayed(tcxtools.TCXPandas('workout_1.tcx').parse()) # Delay these calculations
tcx2 = delayed(tcxtools.TCXPandas('workout_2.tcx').parse()) # Use as many as needed

total = dd.from_delayed([tc1, tc2]) # However many files you need
total.visualize() # Visualize the task graph
total.compute() # Compute it
# This returns a dataframe with all the files

PS: tcx1 and tcx2 are wrong in this example on line total = dd.from_delayed([tc1, tc2]) # However many files you need

I'm getting the error:

TypeError: Series.name must be a hashable type

Anybody can help what should I do here to fix this error?

Thanks in advance.

triskadecaepyon commented 3 years ago

Sorry about the delay. I need to track down and see where the error is coming from. A lot of dask has changed so my examples will need to change too.