File "/opt/airflow/dags/iop/iop_process_file.py", line 21, in iop_parse_file
parsed = parser.parse(xml)
File "/opt/airflow/dags/common/parsing/parser.py", line 99, in parse
return self._generic_parsing(publisher_parsed_article)
File "/opt/airflow/dags/common/parsing/parser.py", line 60, in _generic_parsing
"title": clean_whitespace_characters(get("title")),
File "/opt/airflow/dags/common/cleanup.py", line 9, in clean_whitespace_characters
return " ".join(input.split())
AttributeError: 'list' object has no attribute 'split'
Solution: test the corresponding article in QA, and mark the title as the required field.
How it was solved: marked the title as required filed (it has to be according to the schema), parsed the titles with all tags inside, in order not to lose data
The error in the workflow:
Solution: test the corresponding article in QA, and mark the title as the required field.
How it was solved: marked the title as required filed (it has to be according to the schema), parsed the titles with all tags inside, in order not to lose data