rivernews / review-scraper-java-development-environment

An environment to develop review scraper
0 stars 1 forks source link

Data Integrity #26

Closed rivernews closed 4 years ago

rivernews commented 4 years ago

When working on healthcrowd work, we got unresolvable collision.

After inspection, it is because helpCount can change over time. Because of this, reviews could change overtime.

rivernews commented 4 years ago

Discussion

While we can do more compact data storing, they are quite complicate to maintain. We'll rather keep it simple. But we still need to make trade off on this. Storing an entire object again - indeed is not reasonable.

Some decisions made

rivernews commented 4 years ago

Approach

rivernews commented 4 years ago

Merged by #27. Store helpfulCount into seperate place from other "stable" fields. AKA, storing varying and stable data separately. so we have better storage efficiency.