Hi, your blog (in the section titled Long Multimodal Input - Paper Reading) shows an example of Aria answering a question based on a PDF file. https://www.rhymes.ai/blog-details/aria-first-open-multimodal-native-moe-model However, when I feed the same PDF file into Aria running locally it outputs "PIL.UnidentifiedImageError: cannot identify image file 'test.pdf'" Does Aria natively support PDF files as input? Am I just not using the right Python library (PIL)? Thanks.
No, the raw inputs for Aria model support image and text only. For the PDF files, you have to preprocess it with PyMUPDF library. Here are some more details about how to process PDF files. @andy8025
Hi, your blog (in the section titled Long Multimodal Input - Paper Reading) shows an example of Aria answering a question based on a PDF file. https://www.rhymes.ai/blog-details/aria-first-open-multimodal-native-moe-model However, when I feed the same PDF file into Aria running locally it outputs "PIL.UnidentifiedImageError: cannot identify image file 'test.pdf'" Does Aria natively support PDF files as input? Am I just not using the right Python library (PIL)? Thanks.