JustinGOSSES / awesome-list-visual-explorer-template

Change single config file & then run scripts that will build a website that visualizes the implicit community of a list of related GitHub repos, especially Awesome Lists
https://justingosses.github.io/awesome-list-visual-explorer-template/
MIT License
3 stars 0 forks source link
awesome awesome-list open-source visual-explorer visualization

awesome-list-visual-explorer-template

The awesome-list-visual-explorer-template is a repository designed to be used as a template repository. If the name of the code repository is notawesome-list-visual-explorer-templateyou are looking at a code repository made with the template.

This repository includes a config file, _config.yml that users can edit to point at an Awesome List, a GitHub org, or any other group of GitHub repositories. The user can then run a few pre-built scripts that harvest GitHub metadata about each repository, transform that metadata into visualizations, and generate a website with those interactive visualizations that is deployed as a GitHub pages page.

The visualizations show:

The goal of this project is to surface the characteristics, trends, connections, relationships, etc. that can describe the implicit community of developers and repositories in an Awesome List, or other type of list, and make them available in a visual form to the same people who get value from an Awesome List in list form. The speed of insights possible from visualizations in one place should be greater than what could occur by reading each repository individually.

The idea behind its creation is by being able to quickly understand visually the trends and relationships in a community of related code repositories, it might nudge developers who are a part of that implicit community how think about those projects, where they might contribute, and who is working on the things they are also interested in. For instance, they might be more likely to contribute to a project they can see others in their community depend on rather than starting their similar project from scratch.

Is this the awesome-list-visual-explorer-template TEMPLATE or a repository made from the template?

NOTE: If the name of the code repository is not awesome-list-visual-explorer-template, you are looking at a code repository made with the template, https://github.com/JustinGOSSES/awesome-list-visual-explorer-template.

If this repository is not the template itself, suggest you move the section below up to the top & move the first paragraph to below this line.

Description of this code repository if not the template:

WRITE YOUR PURPOSE OF THE REPOSITORY HERE

Key Pages That The Template Builds

Front Page

This is a catalog of all the code repositories. The organizational structure is based on topics tags on the github repositories themselves and categories that organize those tags set in category/category_info.json.

About Page

This is an about page for the template itself.

Explore Page

A series of visualizations that give a high level overview of how the community of Awesome List GitHub code repositories has changed over time, including:

Dependencies Page

An interactive graph network of dependencies and organizational connections between code repositories.

Popular Repositories Page

Visualization of the most popular repositories including:

Additional Information About The awesome-list-visual-explorer-template Template

Webpages Built With This Template

Presentations on What's trying to be accomplished with this repository

Prior Work

What is it based off of? This site was created by taking a fork of the Lawrence Livermore National Laboratory's open source software catalog and changing a bunch of stuff to make it useful for visualizing Software Underground's Awesome-Open-Geoscience list.

It was initially created as part of the Transform 21' hackathon put on by The Software Underground or SWUNG.

Original Project Plan: https://github.com/softwareunderground/transform-2021-hackathon/discussions/14

The project plan has now been moved to issues: https://github.com/softwareunderground/open_geosciene_code_projects_viz/projects/1

The code is still largely that of the original Lawrence Livermore National Laboratory's open source software catalog.

Completed Changes From Original Project & Possible Future Changes Roadmap: https://github.com/softwareunderground/open_geosciene_code_projects_viz/blob/main/changes_needed.md

Overview of How awesome-list-visual-explorer-template Template Works

At a very high level, there a variety of bash and Python scripts that grab github repository URLs from an Awesome List README location you supply as configuration, get details about those repositories from the GitHub API, and then rebuild the webpages with that information and information from a configuration file.

A step-by-step high level overview of how the template is used:

Prerequisites

Before you begin, make sure you have working installs of Git, Ruby, and Bundler https://bundler.io/ You will need these tools for development.

Getting Started

First, use the template repository as your template. Got to https://github.com/JustinGOSSES/awesome-list-visual-explorer-template/ and click the big green button that say "Use This Template".

You'll probably want to give it a name that somewhat aligns with the Awesome List repository that you'll use for your data.

Next, to work locally, clone your repository:

git clone https://github.com/JustinGOSSES/awesome-list-visual-explorer-template/   <--- or your name of the repository!

Make sure you are in the directory you just created by running cd nameOfRepositoryHere Then you can use bundler to install the Ruby dependencies (see the Jekyll installation docs for step-by-step guides to setting this up):

bundle install

Running this will install everything in your Gemfile (including Jekyll).

After all the data is updated and the pages are built fresh, you'll want to the development web server with:

bundle exec jekyll serve

Followed by opening http://127.0.0.1:4000/open_geosciene_code_projects_viz/ in a web browser.

You probably want to visualize details of an Awesome list specific to you though, so we'll get to that now.

Go to the _config.yml file and change some of the details. The important ones to change are:

The python scripts call the GitHub API, which means the environment you run your code in requires a GitHub API Token as an environmental variable. The token only needs READ access to public repositories. You can read about how to get a GitHub API token here and here. If running locally in a terminal, you probably want to get it into your environment by running something like export GITHUB_API_TOKEN digitsOfYourTokenGoHere. If running in the GitHubActions, the .github/workflows/update.yml file already has you covered.

Next, we'll work with the scripts that pull in data and rebuild the pages.

Change directory to _explore/scripts and read the README there for instructions on how to start a virtual environment and install the dependencies in requirements.txt.

Once you've done that, you can proceed to run the first bash script to gather GitHub code repository URLs from the Awesome List URL you added to _config.yml. This done by running bash grabNewRepos.sh.

Next step is to run bash BUILD.sh. This is the main build script for the repository and runs a bunch of python files in the scripts folder and also calls the GitHub API to get information like stars and contributors from each GitHub code repository listed in input_lists.json. It also replaces the name of the repository used in the template with the name of the new repository listed in _config.yml.

NOTE: the build.sh script does many calls to the GitHub API, which means it can sometimes take 15-30 minutes to complete its full run!

Lastly, change directory back to the root of the directory by running cd ../../ and then run bundle exec jekyll serve to start up a server that will show a local version of the webpage at http://127.0.0.1:4000/nameOfYourRepositoryThatWasSetInConfigYamlFile.

GithubActions

The steps to gather data from the Awesome List defined in the _config.yml file and rebuild the page is also built into a GitHubActions configuration file at .github/workflows/update.yml.

Currently, this is set to only run on manual trigger or what's called workflow dispatch. Once changes are stabled, the GitHubActions scripts could be used to pull in new data from the Awesome list on some predetermined timing or upon pull request to keep the page visualizations up to date with the Awesome List it pulls from.

Tips

The gems in your sourcefile get updated frequently. It is a good idea to occasionally run bundle update from within your project's root directory to make sure the software on your computer is up to date.

Sometimes there can be dependency conflicts if your local version of Ruby is different from this repo or github pages deployment settings. You can find the version number of each of GitHub Page's current dependency's here. You can often avoid dependency issues if you use the same versions, including for Ruby.

As an example, the default version of Ruby used to deploy GitHub Pages on github.com as of 2021-04-08 was Ruby 2.7.1. If you tried running Ruby version 3.0.0 locally on macOS, you'll need to do some extra steps to correctly install the dependencies for this repository. You'd need to run bundle add webrick as it is no longer a prepackaged dependency with Ruby in 3.0.0. You may also need to run gem install eventmachine -- --with-openssl-dir=/usr/local/opt/openssl@1.1 as MacOS >10.14 doesn't use openssl from the same path as is still assumed to be in by eventmachine.

How to get additional code repositories tracked not in the awesome list in the _config.yml file?

------------ IN PROGESS ------------

Status

This is still very much an experiment. Please jump in via issues or introduce yourself and your interest.

Contributing to the Template

Please find instructions on contributing feedback, issues, pull requests, discussion, thoughts, etc. at https://github.com/JustinGOSSES/awesome-list-visual-explorer-template/blob/main/CONTRIBUTING.md

Before contributing, please read the Template's code of conduct</>.

Contributing to this repository if not the template

------------ INFORMATION WILL GO HERE ------------

You might want to mention things like whether github actions is being used to update the metadata pull or how frequently the data is pulled.

How to get changes from the template after your repository is already built?

This is still in progress, but the goal is to have things built such that a configuration file could be moved elsewhere, the rest of the repository updated to the latest from the template, the configuration file moved back in, the scripts would be re-run, and the resulting websites and visualizations would be updated with additional visualizations built into the template.