jonaswinkler / paperless-ng

A supercharged version of paperless: scan, index and archive all your physical documents
https://paperless-ng.readthedocs.io/en/latest/
GNU General Public License v3.0
5.37k stars 356 forks source link

[BUG] Markdown no longer recognized #1378

Open PaulWoitaschek opened 2 years ago

PaulWoitaschek commented 2 years ago

Using Paperless 1.5.0, markdown files (.md) are no longer picked up. I assume its due to this change: https://github.com/jonaswinkler/paperless-ng/commit/f51207fc32df05d04f6f48a9bdd60de988fb1481#diff-d9b2f6919a66c2c55977a823c2d38697c3a41a91af224f8e3562c0dcb5d93ed0 In the old paperless they are indexed as plain text files.

It would be great if markdown support came back as I organize all my letters using markdown files & pandoc templates.

amenk commented 2 years ago

Probably it would be enough to add

"text/markdown": "*.md"

here:

https://github.com/jonaswinkler/paperless-ng/commit/f51207fc32df05d04f6f48a9bdd60de988fb1481#diff-d9b2f6919a66c2c55977a823c2d38697c3a41a91af224f8e3562c0dcb5d93ed0R10

The question is if there are other text file types which should be supported, for example *.rst (reStructuredText ) might be a candidate which people might expect to be indexed. And if we support *.csv, then *.tsv might also make sense.

But this should be pretty much it.