hosseinmoein / DataFrame

C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types and contiguous memory storage
https://hosseinmoein.github.io/DataFrame/
BSD 3-Clause "New" or "Revised" License
2.38k stars 298 forks source link

Remove Duplicates (just by Index). #309

Closed TheBlackPlague closed 1 week ago

TheBlackPlague commented 2 weeks ago

This is super helpful if one has a data frame indexed by date.

Also, as a statistical thing, it would be nice to count the number of duplicates in a Data Frame too.

hosseinmoein commented 2 weeks ago

They are on my to do list

hosseinmoein commented 2 weeks ago

I actually already had the code for remove_duplicates() with index column by itself. I merged it to master. For the count look at value_counts(). See if that satisfies your requirements.