Use pd.Categorical for index columns

When the input is a dict of fit results (keys are the categories, values are fit results) the "index" or "key" column in the returned DataFrame contains strings. This column should be converted to categorical, as it is naturally a category.

Conversely, when the input is a list of fit results the "index" is an integer. Initially I though to convert also this column to categorical type, since it can save some space (index are int64 while categorical use the smallest int for internal codes). However this approach causes problems with seaborn (see https://github.com/mwaskom/seaborn/issues/997). Briefly, seaborn always builds FacetGrid plots looking at all the categories in the given column. So when selecting fit results (let's say index < 6, as in the example notebook), seaborn will plot all the empty axes for the empty categories. The solution would be the remove the unused categories after selection, but this requires longer and more convoluted pandas commands.

There may be other subtle issue in using categorical for integer columns, with no clear benefit. Therefore I think is better to leave the integer columns added by pybroom (indicating index in a list of fit results) as integer type.

tritemio / pybroom

Use pd.Categorical for index columns #3