Closed ansjin closed 7 years ago
Wow this one's huge! And it looks interesting
Added the header parsing logic of a website to extract entity from it for date Event extraction.
For date event extraction currently consider only wikipedia page(for the time) and extract entity information from the website header. (can be extended to other pages also but for getting good results currently considering wikipedia page) This entity information then can be used to store the events against the already existing entities by group 1.
PS: Currently tested on local Db and its working but not the linking of entities to date event extraction as my local db doesn't have the entities in it. Now this branch can be merged as all earlier things are working fine with some new additions also tested. So please complete the review,
Let's get this in as quickly as possible, so we can move forward. There are more important topics than refactoring at the moment.
Code Refactoring
PS: Don't merge, still testing
Update 21:04:2017, 16:35
Done testing