Coefplot: Plotting the same regression equation using different dataframes

py-econometrics / pyfixest

Fast High-Dimensional Fixed Effects Regression in Python following fixest-syntax

MIT License

175 stars 35 forks source link

Thanks! The error arises because coefplot() loops over all models and assigns them their names based on the _model_name attribute - and as they are identical for different sample estimates, you get the strange behavior above. .

I see two options:

Option 1: You could work with the split argument:

import pyfixest as pf 

df = pf.get_data()
fit = pf.feols("Y ~ X1", split = "f1", data = df[df.f1.isin([1,2])])
fit.coefplot(coord_flip=False, keep = "X1")

Alternatively, you could overwrite the _model_name attribute by hand:

fit1 = pf.feols("Y ~ X1", data = df[df.f1.isin([1])])
fit2 = pf.feols("Y ~ X1", data = df[df.f1.isin([2])])

fit1._model_name
# 'Y~X1'
fit2._model_name
# 'Y~X1'

fit1._model_name += ", sample f1 = 1"
fit2._model_name += ", sample f1 = 2"
pf.coefplot([fit1, fit2], coord_flip=False, keep = "X1")

which also produces

Maybe it would be convenient to add a model_name argument to pf.coefplot() that would allow users to pass custom model names to avoid duplicates? Maybe we should even throw an error in case of duplicate model names?

py-econometrics / pyfixest

Coefplot: Plotting the same regression equation using different dataframes #720