RaidasGrisk / tf2-fots

FOTS text detection and recognition
12 stars 3 forks source link
fots tensorflow2 text-detection text-detection-recognition text-recognition

tf2-fots

I no longer work on this project.
To sum things up, in order for this to work better:

  1. Use bigger model, MobileNet does not seem to suffice.
  2. Better training data, with more varied text angles, sizes and lengths.
  3. Clever loss functions and combinations of losses of different model branches.

Okay, here's a gif showing the current state of the model.
It ain't much but it's honest work ¯\_(ツ)_/¯

Gif

In progress. Current state:

Loss

The loss of detection part started to flatten,
so I experimented with the relative weights of
losses from both branches when computing grads.

This results in shifting the trade-off between
what will the model learn more: detect or recognize.
That is why the sudden drop in detection loss.

TODO

The problem is the code is still very messy and poorly structured.
Given that the main project pipeline is working, next steps are:

Once above is finished: