Hello,
you can see in my fork the algorithm I have used for retreiving the datetime from the news. It is based on this package. I have also included the descending order for the news among the set collected.
You can see from a new test script how the algorithm tackle rubbish from html (usually words ahead date, not parsable).
ps. it can be improved further by removing copies of the same papers.
Hello, you can see in my fork the algorithm I have used for retreiving the datetime from the news. It is based on this package. I have also included the descending order for the news among the set collected.
You can see from a new test script how the algorithm tackle rubbish from html (usually words ahead date, not parsable).
ps. it can be improved further by removing copies of the same papers.