Closed laserkelvin closed 1 month ago
So window size is used by the running average, so as you're iterating through the dataset it will do (by default) a running average of properties based on 10 of the last samples. It's different from just capping the number of samples to go through, because you might want to sweep through the data and look for outliers.
This PR adds a big QoL oriented CLI, which provides some high level functionality for inspecting LMDB datasets.
matsciml.datasets.lmdb_cli
module, which houses aclick
-based interface with multiple commands that perform various LMDB inspection taskspyproject.toml
to installlmdb_cli
as a "script", which allows you to access the CLI after installingmatsciml
simply by runninglmdb_cli
in the command line.