Different result compared to when extracting directly with pdftotext

Don't know if it's an update issue or what but bytes aren't the only problem. textract:

91. Registration number is the official set of numbers and letters shown on the front and back of vehicle on the
__________________________________.
а)
b)
c)
d)

Licence plate.
Number board.
Register table.
Number place.

It's like this in the bytes too: xd0\xb0)\nb)\nc)\nd)\n\nLicence plate.\nNumber board.\nRegister table.\nNumber place. pdftotext:

91. Registration number is the official set of numbers and letters shown on the front and back of vehicle on the
    __________________________________.
      а)   Licence plate.
     b)    Number board.
      c)   Register table.
     d)    Number place.

Here, try it yourself:

Python program that saves the results of converting files using pdftotext and textract into different files: https://github.com/Filip98/congenial-bassoon/blob/master/a.py Sample file: https://nissrednjastrucna.edu.rs/data/documents/Opsti-deo.pdf

deanmalmgren / textract

Different result compared to when extracting directly with pdftotext #330