espin086 / NewsWaveMetrics

is a powerful tool for analyzing news sentiment on both national and local stories, allowing users to correlate these stories with their own uploaded metrics, starting with stock market price data. Stay ahead of the curve and make informed decisions with SentimentSync.
MIT License
1 stars 0 forks source link

Creation of Python Module for Gathering Historical News Data using RSS Feeds #17

Open espin086 opened 4 months ago

espin086 commented 4 months ago

This task requires the creation of a Python module for gathering historical news data pertaining to specific topics like a company. The data source for this module should be RSS Feeds, as the aim is to find a free alternative to RapidAPI.

Scope

  1. Investigate: Conduct research on existing solutions and identify potential RSS feeds that can provide reputable news stories.
  2. Design: Plan the architecture of the Python module, including key functions and interfaces.
  3. Develop: Create the Python module that fetches, processes, and formats data from chosen RSS feeds. The module should be able to scrape and download the text from the source.
  4. Test: Test the module against the acceptance criteria outlined below.

Acceptance Criteria

  1. Cost: The Python module does not incur any additional API-related costs.
  2. Content Quality: The selected sources publish reputable news stories.
  3. Functionality: The Python module has the ability to download all the text from the source.
  4. Historical data: The Python module can pull historical data related to a specific topic or company.

Blockers

Checklist

Additional Notes While the focus is to use RSS feeds, this should not limit the exploration of other potential free sources of news data. Consideration could be given to APIs that provide free-tier access or open-source libraries that might facilitate the task.