A missing value should not be used in calculations of entropy or conditional entropies, and should be only used as "last resort" in the feature selection model (Series.mode have a flag to ignore nans)
a "none" level, if considered an acceptable value, should be used in entropy calculations and in the calculation of mode in the feature selection model.
Two different concepts:
Series.mode
have a flag to ignore nans)