Text cut every 80 characters in .doc files

I am trying to read a .doc file using textract still every 80 characters, a \n is inserted when the document is read while it is not in the document.

To Reproduce file = "dev/test_textract_80.doc" # path to file text = textract.process(file).decode('utf-8')

where the .doc file contains 0_1_2_3_4_..._98_99_ (every number from 0 to 99 separated with an underscore)

Expected behavior Expected : text -> 0_1_2_3_4_5_6_7_8_9_10_11_12_13_14_15_16_17_18_19_20_21_22_23_24_25_26_27_28_29_30...

Current output : text -> 0_1_2_3_4_5_6_7_8_9_10_11_12_13_14_15_16_17_18_19_20_21_22_23_24_25_26_27_28\n_29_30... (notice the \n before _29)

Desktop

OS: Ubuntu 18.04]
Textract version 1.6.3
Python version 3.8
Virtual environment (yes) - conda

Additional context The only workaround I found was to edit /textract/parsers/doc_parser.py as mentionned here

deanmalmgren / textract

Text cut every 80 characters in .doc files #367