Open dcwalk opened 7 years ago
tools for metadata extraction:
command-line: pdfinfo
docs: the doc author within word, but also another cli tool (edited)
this resource: http://www.forensicswiki.org/wiki/Document_Metadata_Extraction
exiftool (another command line thing) claims to be able to read metadata from Word DOCX files: http://www.sno.phy.queensu.ca/~phil/exiftool/TagNames/OOXML.html
From April 18 work sesh: "The "What Climate Change Means to [STATE]" series are 50 PDF simple fact sheets with pretty good metadata already present. I'm manually using
pdfinfo
, which can't be right"