emergencyindex / emergencyindex.com

AN ANNUAL DOCUMENT OF PERFORMANCE PRACTICE
http://emergencyindex.com
GNU General Public License v3.0
2 stars 2 forks source link

improve tags scape'r'n #52

Closed edwardsharp closed 4 years ago

edwardsharp commented 4 years ago

some improvements to the tags scrape'r

the main thing here is writing individual tags when there's a base tag instead of concatenating the base term onto the term tag. so for example tags like: communication 25, 53, 111, 127, 185, 207, 209, 225, 229, 265, 359, 377, 433, 463, 465, 561; tele-communication 431 previously would write project 431 with a tag communication tele-communication but now would write two tags: communication and tele-communication.

other examples which involve a lot of repetition and maybe are not really sensical: love lover, religion Catholicism, voice voice over. see changes to the 2018 projects tags doing it this way over here: https://github.com/emergencyindex/projects-2018/commit/99f5e67ddbe084f2d8b96e2c0728249fa06e5074

this PR also improves (or refactors, mostly) the terms scrape'r. the idea is use a dry-run (which will still output files) but otherwise write the terms in one command (i.e. not needing to first write to .json file then read that file again to write project .md files).

still #TODO: