Open mmagaldi-eng opened 3 years ago
The script is ready!
Let's have a look on Monday, just to check what services are involved and to be sure that we have considered all data and features
It is now over a month since the script was ready, what is the status of this?
As far as I know, there are 2 issues: 1) the creation of authors (@tavitto16, please update us on this) 2) the extraction of the publication date (since we're not using crawlers). @vittorianovancini is working to extract and provide this data in a new field of the annotations file.
@tavitto16, why you can't create an author instance if Camila pass you the author's and publisher's name?
What else do you need?
End Users need to use annotated data in their pilot use cases. Thus, manually annotated data will be first used to train NLP models and then will be ingested and processed to predict all scores.
@macagari will implement a solution to get data from the end-users' spreadsheet and process them as if they were crawled and preprocessed from the web (JSON document compliance).