scicloj / tablecloth

Dataset manipulation library built on the top of tech.ml.dataset
https://scicloj.github.io/tablecloth
MIT License
305 stars 27 forks source link

Improve support for missing values for column api #101

Closed ezmiller closed 1 year ago

ezmiller commented 1 year ago

Goal

Add some basic missing support, parallelling that in TC's main dataset API.

Solution

This PR adds the following fns:

The only actually new function here is count-missing, which is just a convenience. replace-missing and drop-missing just use tech.v3.dataset's functions. They pack the column into a dataset, call those fns, and then extract the column.