Humorloos / IE683

0 stars 0 forks source link

Escape Special Characters in XML files #49

Closed Humorloos closed 2 years ago

Humorloos commented 2 years ago

Has dependency: #37

Deadline: 11-08

ashishrana160796 commented 2 years ago

Have updated the colab notebook which replaces the "&" character with "and". After much thought leaving the other special characters if any, within the data. As below Python parsing wasn't working earlier and now its working fine after "&" character replacement.

# Checking whether the document is getting parsed with `utf-8` default encoding.
import xml.etree.ElementTree as ET
root = ET.parse(MOVIES_DATA_DIR.joinpath('netflix.xml')).getroot()

Waiting from the Java XMLReader code execution feedback from @subashp93, after that can update the repository & close this issue as well. Thanks!

ashishrana160796 commented 2 years ago

Issue closed with commit 833224d and basic XML Loading testing done.

Sorry, accidentally made a direct push instead of PR. Thanks!