This project aims to analyze video game data to identify trends and insights. The data is extracted from RAWG using Python and stored in Google Cloud Storage (GCP). The data is then transformed using dbt Cloud and Mage. The entire process is orchestrated using Mage.
Data is extracted from the RAWG Video Games Database API. The API provides information on video games, including game title, genre, platform, release date, and more. The API documentation can be found here.
For the data dictionary, refer to the dbt documentation (Click Sources
) here
Python Script using Mage
End to end pipeline to extract, transform, and load video game data
dbt docs to view the data model and documentation
./keys/gcp-creds.json
[!IMPORTANT] Do not commit the service account key to the repository.
terraform init terraform plan terraform apply
10. Prepare config files and create directories:
```bash
bash script/00_repo_initial_setup.sh
Prepare the files .env
Start the docker containers:
docker-compose up -d
Open the mage application
Run the pipeline end_to_end_pipeline
Check the data in BigQuery
Clone the dashboard in Looker Studio and connect the data source to BigQuery