thehitchhikersguideto / bookworms

1 stars 4 forks source link

SCRAPER BUG -> Datatime object sometimes seem to be taking in the publisher #38

Open vtwoptwo opened 1 year ago

vtwoptwo commented 1 year ago

Image

ZaneReda commented 1 year ago

I think this may stem from the way you are loading the data, the MongoDB shows no evidence of this.

ZaneReda commented 1 year ago

If you are referring to the date/time that the entry was scraped then it probably has to do with how the data is being collected from the MongoDB if you are referring to the date of publish then yes I would expect that some entries are incorrect as the format isnt standardized on good reads. I can either change the scraper to return the whole thing so it can be parsed externally or leave it as is. Let me know which you prefer.