D-Mielewczyk / euro-temperature-trend-stats

MIT License
0 stars 0 forks source link

Detect 5 Areas with the Biggest Temperature Changes Using PySpark #4

Closed D-Mielewczyk closed 4 days ago

D-Mielewczyk commented 3 weeks ago

Develop a PySpark script to analyze the cleaned climate data and identify the 5 areas with the most significant temperature changes. The script should be saved with an appropriate name reflecting its functionality.

Requirements:

  1. Load the cleaned data:

    • Use PySpark to load the preprocessed and cleaned climate data from the specified location.
  2. Data Analysis:

    • Analyze the temperature data to detect the 5 areas with the largest temperature changes over the specified time period.
    • Ensure the analysis is accurate and considers all relevant factors to identify the areas with the most significant changes.
  3. Output:

    • Save the results of the analysis, including the names of the 5 areas and their respective temperature changes, to a specified location.
  4. Script Naming:

    • Save the PySpark script with a fitting name such as detect_top5_temperature_changes.py.

Details:

Acceptance Criteria:

Additional Notes:

D-Mielewczyk commented 3 weeks ago

Blocked by #2