lukas-blecher / LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.
https://lukas-blecher.github.io/LaTeX-OCR/
MIT License
12.59k stars 1.03k forks source link

Model feedback implementation? #317

Open alex-crr opened 1 year ago

alex-crr commented 1 year ago

I get this "$|\cdot||x|||=\sum{i=1}^{n}|x{i}$" instead of "$1\cdot||x|||=\sum{i=1}^{n}|x{i}$" no matter the temperature value, every time I retry. Would it be possible to add a way to "correct" the model, or give it feedback by modifying the text it outputs?

lukas-blecher commented 1 year ago

Are you suggesting a online database with user uploaded image, latex pairs?

I'd rather not mess with all the privacy related issues.

alex-crr commented 1 year ago

That actually would be one of the possible implementations, but your concern about privacy are fair. I was also thinking of a local implementation, but that wouldn't really help the general model, and I could still use the training feature you provided in this case.

exaCORE42 commented 1 year ago

Just a question from an inexperienced novice: Would it be possible (or even useful at all) to randomly produce training data for the model?

alex-crr commented 1 year ago

DISCLAIMER: I'm far from being proficient with machine learning or data manipulation You could use some AI models to expand your database, but that wouldn't really be useful on OCR. Training data can be separated on two types for this model: