huridocs / uwazi

Uwazi is a web-based, open-source solution for building and sharing document collections
http://www.uwazi.io
MIT License
240 stars 79 forks source link

Add the ability to tag parts of images #925

Open kjantin opened 7 years ago

kjantin commented 7 years ago

For many orgs in the MENA region, OCR scanning doesn't work well on written Arabic documents, so many of these documents are images (in PDF format) of text that can't be read by a computer. Would it be possible to add the functionality to tag parts of an image, so that the user can create a table of contents, create references and connections, etc within the PDF?

txau commented 7 years ago

@kjantin it is possible. And this functionality has been requested before, so I guess we should consider it. Still I wouldn't give any priority to this until we have a real use case.

whyfrycek commented 7 years ago

We had a discussion with the organisation that is interested in this today and they asked about it again. They are getting ready with their instance, so would that be the "real use case"?

danicatalan commented 7 years ago

We can get inspiration on how Presentator tackles this problem. Presentator is an app than allows users make annotations on design screens with two approaches: drawing squares and dropping dots.

screen shot 2017-06-22 at 09 36 58 screen shot 2017-06-22 at 09 35 54

They render the information above the pins in a cool way, but in our case we can render this info on the sidepanel.

image

We also need to consider which approach do we want to implement. I think squares are cooler but maybe dots are easier to implement and solves our problem right now.