lifan0127 / ai-research-assistant

Aria is Your AI Research Assistant Powered by GPT Large Language Models
GNU Affero General Public License v3.0
794 stars 33 forks source link

"Paper doesn't provide abstract or additional information", is PDF not considered in GPT4 call? #69

Open artofbeinghuman opened 6 months ago

artofbeinghuman commented 6 months ago

Hi,

I used ARIA so far to get an insight into my libraries' papers. Now I have a paper and its PDF in Zotero but by chance no abstract in the abstract property field and it throws this error/answer: "The paper with item ID 802 does not provide an abstract or additional information that would allow for a summary of its key arguments."

I thought ARIA sends the attached PDF to GPT4 for analysis, so not having the abstract shouldn't be a dealbreaker.

Would you please explain, what is happening under the hood? Are the pdfs taken into account, and if so, why this answer?

Thank you! Marvin

lifan0127 commented 6 months ago

Hi @artofbeinghuman the PDF full text is no longer automatically fed to GPT-4 at the moment, because it consumes a lot of tokens ($), significantly increases the processing time and doesn't always work as expected, which upset some earlier users.

I still very much like to support this important feature, just that I haven't figured out what I think is an acceptable user experience. Your suggestions are welcome.

Right now, if you can locate the relevant section within a PDF to summarize, you can feed it to GPT-4 vision: https://github.com/lifan0127/ai-research-assistant#visual-analysis-gpt-4-vision

richardkaplan commented 4 months ago

Could inclusion of PDFs be a user option? There are certainly situations where it is desirable.

lifan0127 commented 4 months ago

Hi @richardkaplan, now that certain GPT models can handle much longer context window (32K or 128K) than one year ago, we are going to enable PDF fulltext option in the next major release (March or April).

richardkaplan commented 4 months ago

Thanks - much appreciated