I no longer work on this project.
To sum things up, in order for this to work better:
Okay, here's a gif showing the current state of the model.
It ain't much but it's honest work ¯\_(ツ)_/¯
In progress. Current state:
The loss of detection part started to flatten,
so I experimented with the relative weights of
losses from both branches when computing grads.
This results in shifting the trade-off between
what will the model learn more: detect or recognize.
That is why the sudden drop in detection loss.
The problem is the code is still very messy and poorly structured.
Given that the main project pipeline is working, next steps are:
Once above is finished: