C10-Brazilian-e-commerce-modeling-team / brazilian-e-commerce

0 stars 6 forks source link

chore: Design the webscraping strategy to extract the information. #17

Closed alexrods closed 2 years ago

alexrods commented 2 years ago

Summary

Set the tools for data extraction through web scrapping

Acceptance Criteria

GabyGO2108 commented 2 years ago

For this issue we decided to go with our favorite language, Python. The reasons, well:

  1. It's simplicity
  2. It has a large amount of libraries
  3. Timely saving
  4. The syntax is easily understood
  5. Familiarity- the entire team is familiarized with it

As for the libraries, we first thought about using Scrapy and LXML for their power and simplicity. We'll see if they are enough, or changes will need to be made.

alexrods commented 2 years ago