radiolarian / AO3Scraper

A Python scraper for getting fan fiction content and metadata from Archive of Our Own.
175 stars 56 forks source link

Scraping comments (and threads) #10

Closed bianchi-dy closed 3 years ago

bianchi-dy commented 4 years ago

I'm not a CS major by training so I'm not sure what data structure might suit storing these best or if it might get corrupted by the text being stored in a CSV. Any ideas?

ssterman commented 3 years ago

Sorry for the delay -- if you are still interested in this, some thoughts:

bianchi-dy commented 3 years ago

I'm actively working on this again! Not sure why the scraping script ao3_get_fanfics.py is slow for me today but my plan is to follow @ssterman's suggestion: