KWARC / llamapun

common language and mathematics processing algorithms, in Rust
https://kwarc.info/systems/llamapun/
GNU General Public License v3.0
25 stars 6 forks source link

Sample scanner for pre-words to `.ltx_ref` elements #27

Closed dginev closed 5 years ago

dginev commented 5 years ago

Requested by Bruce at https://github.com/brucemiller/LaTeXML/issues/1043 , pretty simple to put together.

Downside is that it take 90 s for 1000 documents, so maybe should be sped up before merging...

dginev commented 5 years ago

Took just about 2 days over arXiv 08.2018, and the result is at: https://gist.github.com/dginev/c83d239524e1380f7b0e5e92a24a5eb2

Will merge here, and revisit if we ever get the thread-safe libxml wrapper operational.