Closed cstrouse closed 4 years ago
The reason is that the website uses different style tag values in different pages. Adding an option to ignore styling may be good.
A new feature has been added to the new version (v1.1.7) which you can use as a workaround for this problem:
scraper.get_result_similar('https://www.weedsta.com/strains/banana-kush', attr_fuzz_ratio=0.8)
@alirezamika Works great. Thanks a bunch!
This site is consistent and well-structured with easily located selectors but
autoscraper
struggles with scraping the data. I trained it with a few examples which found the data successfully but subsequent attempts to scrape other pages yields missing data even though the markup is the same for these pages as the training pages.Here's an example where you can see that the percentages are not returned.