-
Review the necessary and desirable requirements of the tutorial participants are aligned with section 2.5 (Technical approach and methodology) of the proposal.
-
Description: Develop a real-time streaming word count application leveraging Apache Spark Streaming's DStream API. Utilize Python to ingest streaming text data from a chosen source, tokenize the wor…
-
Real-time Data Transformation with Apache Flink
Difficulty: 2
Technology: Apache Flink
Description: Develop an advanced real-time data transformation application utilizing Apache Flink's DataSt…
-
### Duplicates
- [X] I have searched the existing issues
### Summary 💡
I'd like to use Auto-GPT with a Jupyter Notebook. My proposal is that Auto-GPT should be able to:
- run the notebook …
-
I want to create histograms and be able to access their sum of weights squared. When using WeightedMean storage sum_of_weights_squared just returns the number of entries, not the sum of weights square…
-
### Overview
We need to add lat/long coordinates to the data set we get from the city so that a higher % of parking violations and summary statistics display on the map.
### Details
Around 15% of…
-
# Investigate routes with `ratio > 1`
There are some routes that have a ratio of actual trips to scheduled trips greater than one, and it would be good to know why.
## Access the data
### Jupyter Not…
-
Understand bike thefts in 2017 by month, location, outcome using Python for plotting and summary stats
Notebook: [data analysis](bicycle-theft-cambridgeshire/notebooks/Data-analysis-and-mapping.ipy…
-
The data is from Kaggle:
https://www.kaggle.com/competitions/instacart-market-basket-analysis/data
A video of the field person describing the video to Pranay:
[ Gen AI Demo-20230522_103358-Meeting R…
-
## Diagnostic Data
- Python version (& distribution if applicable, e.g., Anaconda): 3.11.5 (Anaconda 2023.07)
- Type of virtual environment used (e.g., conda, venv, virtualenv, etc.): conda
- O…