Organizing and cataloging datasets and data access methods used in OHW projects.
Here are my thoughts on this project: It would be cool to have a list of all the datasets OHW participants have used over the years and some code examples of how to use them. Data access and wrangling are valuable skills to have and to struggle with, but sometimes, finding the right dataset or navigating server requests can be a barrier to success during the condensed timeline of OHW. I figure if we can organize the projects from past OHW events and make them easily navigable on the website, it'll help future participants get started quicker working with the data they're looking for.
The general plan is to complete and document the full workflow for 1 project to start with. The workflow might be something like:
We already have organized lists of projects from each year of OHW on the various websites:
OHW_project_list.md
is where I'm throwing every project from over the years. Each project's title links to the corresponding GitHub repository, and as many linked datasets as can be found will be listed with each project.
An AI chatbot to assist OceanHackWeek participants with questions about projects, datasets, and methods.