CenterForOpenScience / pydocx

An extendable docx file format parser and converter
Other
183 stars 55 forks source link

Bug when parsing #263

Open pmeyer-git opened 1 year ago

pmeyer-git commented 1 year ago

Hello, The attached document makes the parser crashing. ... File "C:\Users\xxxxx\Anaconda3\lib\site-packages\pydocx\export\base.py", line 524, in export_simple_field parsed_instr = simple_field.parse_instr() File "C:\Users\xxxxx\Anaconda3\lib\site-packages\pydocx\openxml\wordprocessing\simple_field.py", line 41, in parse_instr m = self._parse_instr_into_field_type_and_arg_string() File "C:\Users\xxxxx\Anaconda3\lib\site-packages\pydocx\openxml\wordprocessing\simple_field.py", line 35, in _parse_instr_into_field_type_and_arg_string return re.match('^\s([^\s]+)\s(.*)$', self.instr) File "C:\Users\xxxxx\Anaconda3\lib\re.py", line 191, in match return _compile(pattern, flags).match(string) TypeError: expected string or bytes-like object

I isolated the buggy element here: Pydocx bug.docx

Thanks. Pat

pmeyer-git commented 1 year ago

pydocx (0.9.10)

pmeyer-git commented 1 year ago

Seems to be DUP of previous issue... Already solved but not published...