Closed markmacgillivray closed 8 years ago
The licence check running on this has to do massive amounts of regexing
That is a webpage that is about 37MB in size. Trying to regex check on it locks the machine for a long time. Resetting processes leaves the previous machine stuck processing it, and allocates it onto another machine too. Enough resets and all machines are stuck trying to run 234 regexes over 37MB of text :(
Adding a catch to academic licence to not run if the content size is greater than 10m bytes for the time being.
Can we add something to the provenance record for such items, explaining what's happened, so the user knows to check manually?
Done in dev. To push to live soon.
http://www.tandfonline.com/doi/full/10.1080/15548627.2015.1100356