Better e2e parsing - Githubissues

Before this PR, all the visual equation detection results are only saved as images but not in the structure.csv file. We fix this by pairing the text-based equation token classification results and the visual equation detections in a way similar to figure and caption matching and save it in the structure.csv file.

We also add the vila paper to the tests for #33.

allenai / vila

Better e2e parsing #34