gershonc / octopus-ml

A collection of handy ML and data visualization and validation tools. Go ahead and train, evaluate and validate your ML models and data with minimal effort.
MIT License
21 stars 5 forks source link

showcase pandas-summary breaks #2

Open frankbolton opened 3 years ago

frankbolton commented 3 years ago

When running the example code the following snippet breaks

from pandas_summary import DataFrameSummary dfs = DataFrameSummary(XY_df) dfs.summary()

The Conda environment in use was set up specifically for evaluation of the octopus-ml:

octopus-ml 0.1.9 pypi_0 pypi pandas 1.2.4 py38h60cbd38_0 conda-forge pandas-summary 0.0.41 py_1 conda-forge

Error message:

AttributeError Traceback (most recent call last)

in 1 import missingno as msno 2 from pandas_summary import DataFrameSummary ----> 3 dfs = DataFrameSummary(XY_df) 4 # dfs.summary() ~\Miniconda3\envs\octopus\lib\site-packages\pandas_summary\__init__.py in __init__(self, df) 25 self.df = df 26 self.length = len(df) ---> 27 self.columns_stats = self._get_stats() 28 self.corr = df.corr() 29 ~\Miniconda3\envs\octopus\lib\site-packages\pandas_summary\__init__.py in _get_stats(self) 87 # settings types 88 stats['types'] = '' ---> 89 columns_info = self._get_columns_info(stats) 90 for ctype, columns in columns_info.items(): 91 stats.ix[columns, 'types'] = ctype ~\Miniconda3\envs\octopus\lib\site-packages\pandas_summary\__init__.py in _get_columns_info(self, stats) 109 self.EXCLUDE, 110 column_info['constant'].union(column_info['bool'])) --> 111 column_info[self.TYPE_NUMERIC] = pd.Index([c for c in rest_columns 112 if common.is_numeric_dtype(self.df[c])]) 113 rest_columns = self.get_columns(self.df[rest_columns], self.EXCLUDE, column_info['numeric']) ~\Miniconda3\envs\octopus\lib\site-packages\pandas_summary\__init__.py in (.0) 110 column_info['constant'].union(column_info['bool'])) 111 column_info[self.TYPE_NUMERIC] = pd.Index([c for c in rest_columns --> 112 if common.is_numeric_dtype(self.df[c])]) 113 rest_columns = self.get_columns(self.df[rest_columns], self.EXCLUDE, column_info['numeric']) 114 column_info[self.TYPE_DATE] = pd.Index([c for c in rest_columns AttributeError: module 'pandas.core.common' has no attribute 'is_numeric_dtype'
gershonc commented 2 years ago

Yeah, it was an Pandas_summary issue, it solved now