hosseinmoein / DataFrame

C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types and contiguous memory storage
https://hosseinmoein.github.io/DataFrame/
BSD 3-Clause "New" or "Revised" License
2.53k stars 313 forks source link

Remove Duplicates (just by Index). #309

Closed TheBlackPlague closed 4 months ago

TheBlackPlague commented 4 months ago

This is super helpful if one has a data frame indexed by date.

Also, as a statistical thing, it would be nice to count the number of duplicates in a Data Frame too.

hosseinmoein commented 4 months ago

They are on my to do list

hosseinmoein commented 4 months ago

I actually already had the code for remove_duplicates() with index column by itself. I merged it to master. For the count look at value_counts(). See if that satisfies your requirements.