Spotify's songs & Netflix's movies positivity over Covid incidence
The software uses Apache Kafka to download and elaborate data from Spotify CSV charts, Netflix FlixPatrol.com charts (scraped) and Apple-Google mobility. These data is updated every week for 45 countries. These data is then integrated with the help of IMDb API and Spotify API. Finally, everything is stored in a MongoDB database.
Before setting up the project, these softwares need to be installed:
Start the aforementioned software processes, and then create 3 Kafka topics ('netflix', 'spotify', 'covid'), with the following command:
bin/kafka-topics.sh --zookeeper localhost:2181 --create --topic topic_name --partitions 1 --replication-factor 1 --config max.message.bytes=5242880
After that, install these python libraries through pip install
:
Textblob need also the package corpora: python -m textblob.download_corpora
To start the project, just type this simple command:
python main.py
It will start 6 threads: