HK3-Lab-Team / pytrousse

PyTrousse collects into one toolbox a set of data wrangling procedures tailored for composing reproducible analytics pipelines.
Apache License 2.0
0 stars 1 forks source link

Include a property that finds datetime columns #63

Open lorenz-gorini opened 4 years ago

lorenz-gorini commented 4 years ago

Datetime columns are really hard to be identified by pandas and they are assigned with dtype="object". Since pytrousse package wants to avoid the manual analysis of each column with dtype="object", it could be useful to include a property that returns the possible columns containing datetime values.

My proposal to do so it to try to convert to datetime values the columns with dtype="object" (by using pd.to_datetime() function) and a check can be performed on how many values are lost on those columns. This information can be shared with the user that can decide which columns should really be converted or not.