Thank you for your work, the results are pretty cool!
I am trying to reproduce your work for a personal project, but it seems for dataset generation many times in the src downloaded from Arxiv there are many .tex files, I can always merge into a single .tex files maybe using a simple script, so my question is do you expect only a single tex file converted to HTML using LATExml for each pdf?
Hey,
Thank you for your work, the results are pretty cool!
I am trying to reproduce your work for a personal project, but it seems for dataset generation many times in the src downloaded from Arxiv there are many .tex files, I can always merge into a single .tex files maybe using a simple script, so my question is do you expect only a single tex file converted to HTML using LATExml for each pdf?
Best, Saksham