In each batch, I firstly compute one independent E{n} for that batch, but it doesn't work at all; but according to your code, E{n} is global, there exists only one value for entire mini-batch GD optimization process, it works evidently. I want to know the reason?
In each batch, I firstly compute one independent E{n} for that batch, but it doesn't work at all; but according to your code, E{n} is global, there exists only one value for entire mini-batch GD optimization process, it works evidently. I want to know the reason?