Ensure the data generated from the python version of colab's state script is expected.
It's using correct sources.
Once the above are ensured, uncomment the git commit lines in the YAML file.
Have used the idea to directly add R library but had to create a specific library for the GitHub instance, in case someone has way without using it, we can add it apart from downloading it on github instance(it will increase the job time by 20 min).
When to run script is yet to be decided, should it be based on whenever there is a data update on the main source( A functionality of hashing would need to be added to ensure that) or else if it's a daily job, just add a cron job for that else make a cron job to regularly check the hash of the main file.