Open 1jamesthompson1 opened 2 weeks ago
ATSB will be straight forward as they use the same naming structure as TAIC. Therefore it is predictable and will result in minimal wasted page loading.
However TSBs naming structure is a bit more complex which could result in quite a large serach space with lots of wasted webpage loading.
I am currently at the point where I have both working theory rtheory except for two problems:
Currently the webscraping works for TAIC only. It does this by using a template and loops through looking at each webpage and seeing if it has report pdf etc.
This technique can be extended for both #254 and #252. There could be a new class built that gives it the template as well as how to actually scrape the report webpage for report pdf and information.