arceli / charter

Organizational document for the Archival of Informal Astronomy Communications
Other
6 stars 6 forks source link

Blog post capture as PDF/A #22

Open libcce opened 8 years ago

libcce commented 8 years ago

@jonathansick @lnielsen locally, we've discussed WARC https://en.wikipedia.org/wiki/Web_ARChive for arceli but seems unsuitable for the project (large file sizes, possible issues with rendering). In several of the conversations we've had, from the folks at PressForward to locally, it seems like everyone falls back to PDF/A. There seem to be several python PDF packages available, any thoughts on the best package to use (ahead of the AAS hack day)? @lnielsen anything form BlogForever that may be of use to this project?

On the side

kelle commented 6 years ago

Right now, all we're sending to Zenodo is a JATS file, which I think is the most flexible format needed for an archive. But the PDF is the current most human readable version and ADS also has infrastructure setup to link to PDFs. I found this http://pdfmyurl.com/html-to-pdf-api which we might be able to use to create the PDF at the same time as the JATS.

Thoughts? @AramZS