Description:
In the context of using gcsfs with llama-index for reading files from a GCP bucket, an error occurs for PDF files indicating that the 'GCSFile' object has no attribute 'name'. This issue does not occur when accessing DOCX files. It suggests that there may be an inconsistency or a bug in how gcsfs handles file metadata, specifically for PDF files.
Please show the full traceback, so we can see what llama_index is trying to do. You may also want to cross-post on their tracker. No GCSFile has a .name, but they do have a .path .
Description: In the context of using
gcsfs
withllama-index
for reading files from a GCP bucket, an error occurs for PDF files indicating that the'GCSFile' object has no attribute 'name'
. This issue does not occur when accessing DOCX files. It suggests that there may be an inconsistency or a bug in howgcsfs
handles file metadata, specifically for PDF files.Version:
gcsfs
: (specify the version you're using, e.g.,2023.6.0
)Steps to Reproduce:
GCSFileSystem
fromgcsfs
to access files in a GCP bucket..pdf
and.docx
files.name
attribute in the case of PDF files.Relevant Logs/Tracebacks: