Open ghost opened 5 years ago
nice one. well the grouping is per the matrix. so it means that if you have more than the text that you need, then the "matrix" of it is the original matrix with translation to the text start (width of text before it) and end is translation of the text itself. you could probably tinker with the text extraction code to give you individual character boxes as an array in addition. and then you can do the math.
Can you help me modifying the extraction code?
@Ingokoepp did you ever manage to do this?
I am looking to find specific words within a PDF and then redact them.....
@Ingokoepp did you ever manage to do this?
I am looking to find specific words within a PDF and then redact them.....
I have same situation did you find any solutions?
Alas no, not to date anyway.
It has been de-prioritised for our use case for now (it will come back) but all our research suggested that basically, its a bitch to do!
Hi, im trying to find a way how i can get the matrix of a specific word or regex. I used the text extraction example to get the row where my word is.
I don't know how i could use this to get the matrix of the word so i can draw the rectangle above the word instead of the whole row.