edgi-govdata-archiving / version-tracking-ui

ARCHIVED--Bookmarklet to modify UI for Versionista website monitoring
MIT License
3 stars 1 forks source link

Find or develop an html diffing tool in js #9

Closed titaniumbones closed 6 years ago

titaniumbones commented 7 years ago

jsdifflib-node seems to be relatively actively maintained, but is, obviously, designed for node; here's the parent library, difflibjs, which seems to have some outstanding bugs.

Depends on #3.

jpmckinney commented 7 years ago

How would this fit into existing workflows? I don't think we want to rebuild Versionista, right?

titaniumbones commented 7 years ago

It looks like it would be a great help to be able to filter out some diffs that are repeated many times over (mostly header & footer). In order to do that ,we would need to read & diff the files somehow. Obviously this requires an extension first.

On 01/28/2017 05:13 PM, James McKinney wrote:

How would this fit into existing workflows? I don't think we want to rebuild Versionista, right?

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/edgi-govdata-archiving/version-tracking-ui/issues/9#issuecomment-275878456, or mute the thread https://github.com/notifications/unsubscribe-auth/AAWPNEVaGR48Fq8Am1u8_ABq7Kdb4e9Eks5rW72mgaJpZM4LwnsU.

jpmckinney commented 7 years ago

In that case, we're just using Versionista to collect versions, and then performing all the diffing ourselves?

titaniumbones commented 7 years ago

well, sorta. We would need to perform the diff, see if it passes a test, and if not mark it as irrelevant. We could sitll use the versionista UI for viewing diffs, we'd just perform an initial low-bar-of-acceptance filter against known trivial changes. It's a dramatic expansion of the tool, I guess -- but eventually we're going to need it, I guess the monitoring team is finding it a full-time job for about 5 people already.

On 01/28/2017 06:39 PM, James McKinney wrote:

In that case, we're just using Versionista to collect versions, and then performing all the diffing ourselves?

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/edgi-govdata-archiving/version-tracking-ui/issues/9#issuecomment-275882738, or mute the thread https://github.com/notifications/unsubscribe-auth/AAWPNLzU1jG45Pd9RVYrNiWa5PA8cZlFks5rW9GkgaJpZM4LwnsU.

dcwalk commented 6 years ago

This is happening (so exciting!) in a few different repos, see https://github.com/edgi-govdata-archiving/web-monitoring for progress. Closing as appropriate issues are ticketed there