NYCPlanning / data-engineering

Primary repository for NYC DCP's Data Engineering team
23 stars 0 forks source link

Dataset READMEs and Data Dictionaries from metadata #561

Open alexrichey opened 9 months ago

alexrichey commented 9 months ago
damonmcc commented 8 months ago

found an example of our code converting a markdown file to a pdf here

alexrichey commented 2 months ago

@damonmcc Is this the correct issue, or is there another?

damonmcc commented 1 month ago

looking into using weasyprint (already the engine we're using in our use of pandoc) to do html + css = pdf

weasyprint samples here

alexrichey commented 1 month ago

@damonmcc Here's how I'd go about the remainder of this task. Let's chat when you have a minute. In terms of timeline, the next check-in with Amanda+Matt+myself is Thursday next week, and I'd like to show progress on this item. POC/WIP is fine 🙂

Data Dictionary (All fields come from metadata)

README

Most fields from metadata, except changelog. Potentially other fields? TBD. e.g. rich text with images.

For inlining, potentially use something like this: https://pypi.org/project/css-inline/