issues
search
knox-academy
/
webscraping
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Best python feed reading module
#58
knox-academy
opened
1 year ago
1
Sqlite in a container
#57
knox-academy
opened
1 year ago
5
Basic Flask Website
#56
knox-academy
opened
1 year ago
3
Create a python script for storing RSS Feed data in a sqlite DB
#55
knox-academy
opened
1 year ago
3
Create a python script for storing RSS Feed data in a sqlite DB
#54
knox-academy
opened
1 year ago
1
Research automated python email parsing
#53
knox-academy
opened
1 year ago
1
More Full Stories
#52
knox-academy
opened
1 year ago
1
Basic Flask website
#51
knox-academy
closed
1 year ago
0
Basic Flask website
#50
knox-academy
closed
1 year ago
1
Python port scanner
#49
knox-academy
opened
1 year ago
1
Automate the scraping and updating process using a continuous integration tool like Jenkins.
#48
knox-academy
closed
1 year ago
0
Set up an S3 bucket named knox-academy-rssfeed-data to store the JSON files.
#47
knox-academy
closed
1 year ago
0
Implement a system to store the 10 most recent articles from both feeds combined.
#46
knox-academy
closed
1 year ago
0
Test the JSON file to ensure it is storing the correct data.
#45
knox-academy
closed
1 year ago
0
Create a JSON file to store the article information.
#44
knox-academy
closed
1 year ago
0
Parse the scraped data to extract the article information.
#43
knox-academy
closed
1 year ago
0
Test the script to ensure it is scraping the correct data.
#42
knox-academy
closed
1 year ago
0
Create a script to scrape the RSS feeds.
#41
knox-academy
closed
1 year ago
0
Use Beautiful Soup to scrape the RSS feeds.
#40
knox-academy
closed
1 year ago
0
Deploy the web scraper to a production environment.
#39
knox-academy
closed
1 year ago
0
Document the web scraper's functionality and usage instructions for future reference.
#38
knox-academy
closed
1 year ago
0
Test the web scraper to ensure it is functioning correctly and storing data in the S3 bucket.
#37
knox-academy
closed
1 year ago
0
Optimize the web scraper's performance and efficiency to ensure timely and accurate data retrieval.
#36
knox-academy
closed
1 year ago
0
Set up an S3 bucket named knox-academy-rssfeed-data.
#35
knox-academy
closed
1 year ago
0
Store the article data in a JSON file.
#34
knox-academy
closed
1 year ago
0
Parse the data from the RSS feeds and extract the 5 most recent articles.
#33
knox-academy
closed
1 year ago
0
Create a script to read the RSS feeds from bleepingcomputer.com.
#32
knox-academy
closed
1 year ago
0
Set up a development environment with the selected tool and necessary dependencies.
#31
knox-academy
closed
1 year ago
0
Research and select a web scraping tool that can read RSS feeds and store data in a JSON file.
#30
knox-academy
closed
1 year ago
0
Issue 8: Develop a user guide for the Python script, taking into account factors such as audience, language, and format.
#29
knox-academy
closed
1 year ago
0
Issue 7: Establish documentation standards for the Python script, including factors such as readability, completeness, and version control.
#28
knox-academy
closed
1 year ago
0
Issue 6: Develop testing criteria for the Python script, including edge cases, error handling, and performance.
#27
knox-academy
closed
1 year ago
0
Issue 5: Establish specific requirements for the S3 bucket, including factors such as security, accessibility, and cost.
#26
knox-academy
closed
1 year ago
0
Issue 4: Determine a schedule for running the script, taking into account factors such as server load and peak usage times.
#25
knox-academy
closed
1 year ago
0
Issue 3: Establish criteria for determining what constitutes duplicate data and implement a method for identifying and removing duplicates.
#24
knox-academy
closed
1 year ago
0
Issue 2: Determine the specific requirements for the format of the JSON data, including which fields to include from the Hacker News website.
#23
knox-academy
closed
1 year ago
0
Issue 1: Research and select appropriate libraries for the Python script based on factors such as popularity, ease of use, and compatibility with other tools we are using.
#22
knox-academy
closed
1 year ago
0
Need to create a python script to scrape hacker news daily
#21
knox-academy
closed
1 year ago
3
Document the project and provide instructions for future maintenance.
#20
knox-academy
closed
1 year ago
0
Test the webscraper in the production environment.
#19
knox-academy
closed
1 year ago
0
Deploy the webscraper to a production environment.
#18
knox-academy
closed
1 year ago
0
Test the JSON file storage and upload code.
#17
knox-academy
closed
1 year ago
0
Break down into two smaller issues: "Write code to store the extracted article data in a JSON file" and "Write code to upload the JSON files to the S3 bucket."
#16
knox-academy
closed
1 year ago
0
Write code to filter the extracted data to the 5 most recent articles.
#15
knox-academy
closed
1 year ago
0
Test the data extraction code.
#14
knox-academy
closed
1 year ago
0
Write code to extract data from the RSS feeds.
#13
knox-academy
closed
1 year ago
0
Set up a development environment for the project.
#12
knox-academy
closed
1 year ago
0
Consult with the dev team to determine the best tool for the job.
#11
knox-academy
closed
1 year ago
0
Create a webscraper to read RSS feed from bleepingcomputer.com
#10
knox-academy
closed
1 year ago
3
Dev: Setup AWS S3 Bucket for storing scraped data
#9
knox-academy
closed
1 year ago
0
Next