dcomtois / summarytools

R Package to Quickly and Neatly Summarize Data
522 stars 78 forks source link

Suggestion: how often does ID values exist #43

Closed paulfeitsma closed 5 years ago

paulfeitsma commented 6 years ago

When analyzing a data set with e.g. client ID's it is very usefull to know how often unique ID's appear in the dataset. e.g. 90% appears once, 5% appears twice, etc.. (data frame summary)

dcomtois commented 6 years ago

Not sure how this is different from what is already in place, i.e. frequencies of most common strings / factors...?

dcomtois commented 5 years ago

After some thought, I find this very difficult to implement, as there is no way of knowing if a numeric column is in fact an ID... If users turn it into a factor however, then they'll get the frequencies.