Make parameter estimation before fitting more transparent

AUTProgram commented 1 year ago

The current way of initialising parameters is described correctly in the docstrings for ModelParameter.initialise and ModelParameter.get_initial_value, but may feel unintuitive for users. I introduced a bug in one of my scripts because I assumed the wrong initial parameter value.

For example, if one does

param.fixed_to = 0.9
param.initialise(1.0)

then param.initialised_to = 0.9 afterwards. So the argument passed to initialise does not necessarily have any effect on the value to which the parameter is initialised. Rather, it's just a fallback in case there's no other value that takes precedence. I think it's worth thinking about whether we can improve this part of the codebase.

Firstly, I find the word initialise already a bit ambiguous because I would argue that a parameter that is fixed during a fit does not need to be assigned an initial value in the sense of "initial parameter estimate for curve fitting". So for parameters that are fixed during a fit (for which fixed_to is not None), it's questionable whether param.initialise() has the same meaning as for parameters that are floated. As far as I can tell, for fixed parameters the attribute initialised_to is not actually used by the Fitter._fit method anyway.

I believe conceptually there's only three states a parameter can be in: -) fixed_to a certain value, in which case it does not need an initial value -) not fixed and an initial value has been specified by user -) not fixed and no initial value has been specified by user, in which case it still needs to be given an initial estimate by the model routine before fitting

I think one way of reflecting these options would be (leaving out the attributes and methods not relevant here):

class ModelParameter:
    fixed_to: Optional[float] = None
    initialised_to: Optional[float] = None

    def needs_initialising(self):
        return (self.fixed_to is None and self.initialised_to is None)

    def initialise(self, value):
        if self.fixed_to is not None:
            raise RuntimeError("Parameter is fixed, no initial value needed.")
        self.initialised_to = value

class Model:
    def estimate_parameters(self, x, y, model_parameters):
        if model_parameters["param"].needs_initialising():
            model_parameters["param"].initialise(some_value)

This would mean that if the user has set param.fixed_to or param.initialised_to before fitting, those values would just be used directly. If initialise is called on a parameter, the user can be certain that the value passed will be the value that initialised_to is set to afterwards. Note that a fixed parameter would have self.initialised_to = None, while a floated parameter would have self.fixed_to = None. However, a downside of this approach is that the if model_parameters["param"].needs_initialising() statement has to be repeated for every parameter, which I don't like.

Not completely sure about the right way of doing things here, but I thought I'd put it out there to see what other people think.