Open kouloumos opened 1 week ago
@elraphty another thing to have in mind is that in the current scraper, the thread_url
for answers as assigned here:
https://github.com/bitcoinsearch/scraper/blob/ad894da9f891801ace8bfb5d1aaa5d3c30e2bb6f/bitcoin.stackexchange.com/main.py#L90-L91
is not correct. the tail "#" + post.attrib.get("Id")
shouldn't be part of thread_url
. The correct is
"thread_url": "https://bitcoin.stackexchange.com/questions/" + post.attrib.get("ParentId"),
please have that in mind for the new implementation
Our scraper is designed to keep track of content from multiple Bitcoin-related sources, including the Bitcoin Stack Exchange. Currently, however, we haven’t received new data from the Bitcoin Stack Exchange in over seven months. This is because the source we were using for periodic data dumps is no longer being updated.
Problem:
Proposed Solution:
Tasks: