Evaluating trees with features not in dataset

Hi @Jgmedina95,

Thanks for raising this! So this should be because @inbounds is used throughout the evaluation to have faster compute. I make an in-bounds assumption since trees made during the search should never have different features than available in the dataset. However, as you raise, I don't think it hurts to check before running the evaluation, so maybe we could do that. (it could be that the user could save their search state, then re-run with a different dataset, and be surprised when trees access undefined memory?)

I just looked at the code, and it seems like sometimes @inbounds is actually not used, such as: https://github.com/MilesCranmer/SymbolicRegression.jl/blob/6075f13c3e8fbb8b16686a6f7c1157f0235174ee/src/EvaluateEquation.jl#L163

Whereas, @inbounds is used in the functions which fuse operators together, like here: https://github.com/MilesCranmer/SymbolicRegression.jl/blob/6075f13c3e8fbb8b16686a6f7c1157f0235174ee/src/EvaluateEquation.jl#L189

So perhaps for some trees, this would raise an error, and sometimes it would not. Will think more whether we should add a bounds check.

Best, Miles

MilesCranmer / SymbolicRegression.jl

Evaluating trees with features not in dataset #133