Goal is to find the HTML page that links to the pdf found on BankTrack's website. One approach could be to (Google) search for the pdf name, check first n links, search these for the link to the pdf. Another approach would be to scrape the entire bank website — but BankTrack sometimes does not save the “correct” pdf name; could use keywords to create good heuristics.
Best to focus on the first or similarly simpler approaches first. This may be a tough one!
Goal is to find the HTML page that links to the pdf found on BankTrack's website. One approach could be to (Google) search for the pdf name, check first n links, search these for the link to the pdf. Another approach would be to scrape the entire bank website — but BankTrack sometimes does not save the “correct” pdf name; could use keywords to create good heuristics.
Best to focus on the first or similarly simpler approaches first. This may be a tough one!