kaogeek / fact-finder

A platform collecting facts --data and evidence-- and presenting them AS IT IS in the useful way. No prejudice!
https://factfinder.app/
MIT License
12 stars 2 forks source link

Using web archiving tools for storing local copy of the evidence #7

Open itpcc opened 2 years ago

itpcc commented 2 years ago

Currently, there's so much damaging evidence that the law enforcement person might force them to delete the footage or remove it from the site. In my opinion, archiving those pages locally should be seriously considered since using archive.is or archive.org might not work with the Thai court system.

One solution is using Webrecorder tools. It can use to store almost all pieces of information from the webpage into a single Web Archive (WARC) format and replay exactly as it was displayed beforehand. I, personally, use this tool to store news pages regarding vaccine situations and, even after the page was deleted from the sites, it still shows that page exactly as I see it then. This might help to convince the court that the evidence is real.

Moreover, it can also archive to IPFS to store information using distributed network too.

If you considering or would like to have any discussion regarding this approach, please reply to this issue. I'd be more than glad to see it happen.

Finally, I thank you all for your hard work. You have proved to all the people of Thailand that politics is relateable and touchable to all of us.

tee4cute commented 2 years ago

@itpcc Thanks for reaching us! I was thrilled for your ideas especially in using IPFS. Could you please explore this idea further by investigating that the indexing data (json format) can be kept and queried in IPFS network without potential performance tradeoff?

Storing public and open data in decentralized system will make this project being more trusted and transparent!

tee4cute commented 2 years ago

I've ever played with BigchainDB --A decentralized database relying on blockchain technologies. With BigchainDB, we can store and query data from blockchain network in conventional way --like centralized dbs-- without sacrificing app performance.

Hope that it may help you for the idea exploration ;)

itpcc commented 2 years ago

Could you please explore this idea further by investigating that the indexing data (json format) can be kept and queried in IPFS network without potential performance tradeoff?

In short, it's decentralized content-based addressing file system. Instead of location-based system that we have to tell where the file is, we ask the network to send us a file using the ID based on hashed value of the file content. This not only prevent file from being tampered but also allow multiple sources to help us store the same file and should make it easier to retrieve the file faster.

The different from BitTorrent's DHT system are the usage of global swarm of tracker, support of versioning, and allow to integrate with blockchain (but not natively).

Regarding your question, it might not work with file querying since it's based on file system architecture, you might be able to browse (using data link), but the complex querying like keyword searching might not. But it's a (kinda) good system to store an evidence as I said since its address is based on its content hash.

itpcc commented 2 years ago

I've found this Stanford Seminar video regarding IPFS which explain in detail about how IPFS works. Hope this help.

vtno commented 2 years ago

I'm quite interested in this as well. Let me do a little research and I'll post some of the implementation detail here.