instead of using manual effort. The crawler first looks at the publications collection page for each year; then it goes into each publication link to retrieve information.

The idea is to use a general schema to store a publication (title, journal, year, etc.) and generate citation styles on the fly, instead of storing three citation styles directly in the publication entry, so that there is less burden on data entry. To achieve this, the crawler looks at the bibtex of each publication, which is the closest representation to a general schema, and parses the bibtex info using a BibTeX parser. It then generates .md files in the _publications directory.

icyphy / icyphy.github.io

#6 Use crawler to transfer publications from icyphy.org to the new site. #8