abstract_retriever.py - Githubissues

vanessaliup commented 3 years ago

Hi David, In this file 'abstract_retriever.py', for this part blob = { 'id': data['id'], # str 'title': data['title'], # str 'abstract': self._sentencize(data["abstract"]), # List[str] 'journal': data['journal'], # str 'url': new_url, 'authors': data['authors'], # List[str] } out.append(blob)

I wonder if there is a data field as 'content'? I tried to extract the contents from those documents in this way but it generated some errors. If I want to extract the contents from the documents, what should I do? Thank you!

dwadden commented 3 years ago

Just to clarify - it sounds like the code works fine at present, but you're interested in extracting additional data? Is that correct?

If that's the case, I unfortunately can't offer support. The abstract_retriever relies on http://covidex.ai/ to do the queries. You'll have to inspect the Covidex docs for more information on their API.

But if there's a bug in the SciFact code, let me know and I can try to help.

dwadden commented 3 years ago

Closing due to lack of activity.

allenai / scifact

abstract_retriever.py #18