airbytehq / airbyte

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
https://airbyte.com
Other
15.51k stars 4k forks source link

[source-sharepoint] connector fetches documents only from the main site #37003

Open akozichev opened 5 months ago

akozichev commented 5 months ago

Connector Name

source-microsoft-sharepoint

Connector Version

0.2.2

What step the error happened?

During the sync

Relevant information

Sharepoint portals are structured in a way so they can host multiple site. Currently connector only recognises files hosted in the main site and everything else is ignored. The change is required so connector would iterate through all site when combining the files list.

Relevant log output

No response

Contribute

isaac47 commented 1 month ago

Does the connector also retrieve the permission levels of documents? I'm developing a tool and need to inherit document permission levels to manage user rights effectively.

nataliekwong commented 1 week ago

Grooming notes:

Example default site: https://{{company}}.sharepoint.com/Shared%20Documents/{{file name}} Example custom site: https://{{company}}.sharepoint.com/sites/{{folder_name}}/Shared%20Documents/{{file name}}

fyi @natikgadzhi I added this to the community board, but let me know if the team is not working on file sources

natikgadzhi commented 1 week ago

Yeah, file sources are a bit out of scope for now, but the common bottleneck usually is design partners to verify test builds with. We'll get to this.

natikgadzhi commented 1 week ago

We also potentially have design partners willing to test with us.