section-engineering-education / engineering-education

“Section's Engineering Education (EngEd) Program is dedicated to offering a unique quality community experience for computer science university students."
Apache License 2.0
363 stars 889 forks source link

Getting started with web scraping using Python #455

Closed ahmadmardeni1 closed 3 years ago

ahmadmardeni1 commented 3 years ago

Brief Summary:

Web scraping is a technique in which a computer program extracts data from human-readable output coming from a website.

In this tutorial, I will be using python with beautiful soup library to make the reader understand what is web scraping, how he can use it to gather important information in a good format, and we will create two programs step by step, one of them will be a Covid19 Desktop Notifier which makes a live update on COVID-19 Cases and Deaths for your country using a popup notification on desktop.

Key Takeaways:

References:

N/A

ninjaginja commented 3 years ago

@ahmadmardeni1 We actively sell products that aim to prevent web scraping. I see that one of your takeaways talks about ensuring permissions around web scraping, but I just want to be sure that this is front and center with the article. Is there a way to reframe the article as 'Building a COVID-19 Desktop Notifier with Python', rather than framing it within the context of web scraping?

ahmadmardeni1 commented 3 years ago

@ninjaginja my aim was to focus on legal issues in order to prevent new learners from doing web scraping on random websites, so I wrote about 2 methods in order to know if the website is allowing us or not. Also, I can change the tutorial just to "Build a COVID-19 Desktop Notifier". let me know if you gave me permission to write about web scraping or not, I think I can give the reader a good value because I'm an expert with it.

ninjaginja commented 3 years ago

Yes, the topic is okay as long as you lead with the legal issues.