airscholar / e2e-data-engineering

An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
https://www.youtube.com/watch?v=GqAcTrqKcrY
204 stars 91 forks source link

Update spark_stream.py #1

Closed aviravipati closed 1 year ago

aviravipati commented 1 year ago

I think the spark-sql package missing issue could be due to this