Based on previous training we found out 9 out of 30 attributes are important. With limited resources available, we extracted 7 attributes either by using regex or web crawling on Dataset 2.
Some websites in Dataset 2 are down so webcrawling was not possible and that's why not extracted.
Based on Target availability, we didn't extract data from "OTHER" target group.
Data Extraction:
Extracted data can be found here: