mediacloud / story-indexer

The core pipeline used to ingest online news stories in the Media Cloud archive.
https://mediacloud.org
Apache License 2.0
2 stars 5 forks source link

approaches to future proofing / testing Story object? #215

Open rahulbot opened 10 months ago

rahulbot commented 10 months ago

The Story object is the key data object passed around by out system all over the place. As we encounter edge cases and make more changes, it feels like it is becoming harder to validate that it is doing the right thing in all cases. I've created this issue to capture ideas for how we might address this so it doesn't become and long-term system risk.

Some initial ideas (from @philbudne and I):