The script could iterate all pubs in the bucket_stage, and output key phrases in the separate JSON files in file abstract_keyphrases and a report listing the name of pubs without abstract. The key phrase result is very promising. Please tell me if this is what you want or where should I improve it.
For next steps, I will look into how I could get these missing abstracts from APIs and update them in the bucket_stage to have a more coverage of abstract.
Hi @ceteri ,
The script could iterate all pubs in the
bucket_stage
, and output key phrases in the separate JSON files in fileabstract_keyphrases
and a report listing the name of pubs without abstract. The key phrase result is very promising. Please tell me if this is what you want or where should I improve it.For next steps, I will look into how I could get these missing abstracts from APIs and update them in the
bucket_stage
to have a more coverage of abstract.