Open alex-crr opened 1 year ago
Are you suggesting a online database with user uploaded image, latex pairs?
I'd rather not mess with all the privacy related issues.
That actually would be one of the possible implementations, but your concern about privacy are fair. I was also thinking of a local implementation, but that wouldn't really help the general model, and I could still use the training feature you provided in this case.
Just a question from an inexperienced novice: Would it be possible (or even useful at all) to randomly produce training data for the model?
DISCLAIMER: I'm far from being proficient with machine learning or data manipulation You could use some AI models to expand your database, but that wouldn't really be useful on OCR. Training data can be separated on two types for this model:
I get this "$|\cdot||x|||=\sum{i=1}^{n}|x{i}$" instead of "$1\cdot||x|||=\sum{i=1}^{n}|x{i}$" no matter the temperature value, every time I retry. Would it be possible to add a way to "correct" the model, or give it feedback by modifying the text it outputs?