Closed metasj closed 5 years ago
Titles and filenames are the source of the most feature requests from users of the current system. We look for "Title" field metadata in PDFs and HTML, but often the implicit title is stored in some other way -- the first line of text; the first H1; &c. Many uploads are exports from Word or PowerPoint, which don't always have clean metadata.
I think (@isTravis ?) that we also want to let users edit/set the publication date of each document. This often isn’t found in metadata and can be lied about anyway
Titles and filenames are the source of the most feature requests from users of the current system.
(Emphasis mine.) Editing the title makes sense, but by filename, do you mean folks want to be able to edit the filename that appears here: And if so, two questions:
zb2rhbMrTKVRJ6GwfDUGExo7RDi6kdzumSio5VPVDyQsBTtyy.pdf
). Would we want to ensure the downloaded filename matched whatever the user had set?I feel like the filename is a property of an individual file, which should not be editable. Maybe sensitive information will get accidentally included in the filename so maybe we let users hide that info, but IMHO that's not a priority.
Titles and publication dates are properties of the document (which span multiple files/versions) and should be user-editable (we just try to guess at good ones from the metadata).
This is relevant to a recent change in the database model where I moved fileName from the Documents table to the Assertions table.
I've heard two use cases regarding filenames - and I'm not sure I'm sympathetic to either of them.
Regardless of whether either of these are processes we want to support, neither require renaming of the filename - so I think we can pass on that ability.
Regarding publication date - yes - we do want to allow them to set that. It is a value scraped from the doc (so can't be verified anyways) and it is important that the uploader be able to assert the publication date accurately. We store a separate uploadDate (well... just createdAt
right?) that is not editable.
Uploaders would like to preserve their original filename, and to be able to edit the title.
Allow uploaders to edit the titles of docs they upload, for instance at the end of the upload process. We will also want to be able to bulk update some titles with scripts, for instance those with blank titles and HTML pages with long, unhelpful ones.