Closed MylanaBulat closed 1 year ago
"The soup getter has been swapped back to xml as beautiful soup allows for replacing tags with text, making the fades easy to deal with. Currently can print out the titles of acts in a given year"
Some xml files have the description in uncommon places. Currently writing a script to search for the descriptions and return them in method get_description().
Completed as at 16:52 17/02/23. Removed method get_description() and now code is called directly from fetch_acts(). Script currently scrapes the names + descriptions of the acts, fixes Irish fada errors, and prints them to the console.
use beautiful soup to extract data from the website,