datamole-ai / edvart

An open-source Python library for Data Scientists & Data Analysts designed to simplify the exploratory data analysis process. Using Edvart, you can explore data sets and generate reports with minimal coding.
https://datamole-ai.github.io/edvart/
MIT License
48 stars 7 forks source link

fix: fix default number of histogram bins being extremely high #167

Closed mbelak-dtml closed 1 year ago

mbelak-dtml commented 1 year ago

The high number of bins caused extreme loss of performance for some specific data. For example, for the column mass (g) of the edvart.example_datasets.dataset_meteorite_landings(), the number of inferred bins is over 5M, even though the dataset contains under 50k rows.