Closed titaniumbones closed 6 years ago
How would this fit into existing workflows? I don't think we want to rebuild Versionista, right?
It looks like it would be a great help to be able to filter out some diffs that are repeated many times over (mostly header & footer). In order to do that ,we would need to read & diff the files somehow. Obviously this requires an extension first.
On 01/28/2017 05:13 PM, James McKinney wrote:
How would this fit into existing workflows? I don't think we want to rebuild Versionista, right?
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/edgi-govdata-archiving/version-tracking-ui/issues/9#issuecomment-275878456, or mute the thread https://github.com/notifications/unsubscribe-auth/AAWPNEVaGR48Fq8Am1u8_ABq7Kdb4e9Eks5rW72mgaJpZM4LwnsU.
In that case, we're just using Versionista to collect versions, and then performing all the diffing ourselves?
well, sorta. We would need to perform the diff, see if it passes a test, and if not mark it as irrelevant. We could sitll use the versionista UI for viewing diffs, we'd just perform an initial low-bar-of-acceptance filter against known trivial changes. It's a dramatic expansion of the tool, I guess -- but eventually we're going to need it, I guess the monitoring team is finding it a full-time job for about 5 people already.
On 01/28/2017 06:39 PM, James McKinney wrote:
In that case, we're just using Versionista to collect versions, and then performing all the diffing ourselves?
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/edgi-govdata-archiving/version-tracking-ui/issues/9#issuecomment-275882738, or mute the thread https://github.com/notifications/unsubscribe-auth/AAWPNLzU1jG45Pd9RVYrNiWa5PA8cZlFks5rW9GkgaJpZM4LwnsU.
This is happening (so exciting!) in a few different repos, see https://github.com/edgi-govdata-archiving/web-monitoring for progress. Closing as appropriate issues are ticketed there
jsdifflib-node seems to be relatively actively maintained, but is, obviously, designed for node; here's the parent library, difflibjs, which seems to have some outstanding bugs.
Depends on #3.