disinfoRG / FbScraper

MIT License
3 stars 2 forks source link

Better db management #20

Closed andreawwenyi closed 4 years ago

andreawwenyi commented 4 years ago
  1. add sitesAirtable scrapy spider for updating Site table
  2. use pugsql to execute db queries. Put all sql queries in queries folder. see issue #18
  3. fix issue #15
  4. simplify _crawler.py, _spider.py and _pipeline.py files. Right now, for both Page and Post, fb_handler.py directly calls _crawler.py, and _crawler.py no longer needs _pipeline.py. (i.e. we can get rid of _spider.py and _pipeline.py) This makes the code base easier to navigate.