Annotated data ingestion

fandangOrg / fandango

FAke News discovery and propagation from big Data ANalysis and artificial intelliGence Operations

1 stars 1 forks source link

Annotated data ingestion #98

Open mmagaldi-eng opened 3 years ago

mmagaldi-eng commented 3 years ago

End Users need to use annotated data in their pilot use cases. Thus, manually annotated data will be first used to train NLP models and then will be ingested and processed to predict all scores.

@macagari will implement a solution to get data from the end-users' spreadsheet and process them as if they were crawled and preprocessed from the web (JSON document compliance).

macagari commented 3 years ago

The script is ready!

mmagaldi-eng commented 3 years ago

Let's have a look on Monday, just to check what services are involved and to be sure that we have considered all data and features

pstalidis commented 3 years ago

It is now over a month since the script was ready, what is the status of this?

mmagaldi-eng commented 3 years ago

As far as I know, there are 2 issues: 1) the creation of authors (@tavitto16, please update us on this) 2) the extraction of the publication date (since we're not using crawlers). @vittorianovancini is working to extract and provide this data in a new field of the annotations file.

mmagaldi-eng commented 3 years ago

@tavitto16, why you can't create an author instance if Camila pass you the author's and publisher's name?

What else do you need?