dlt-hub / verified-sources

Contribute to dlt verified sources 🔥
https://dlthub.com/docs/walkthroughs/add-a-verified-source
Apache License 2.0
50 stars 40 forks source link

Add pages resource to notion #413

Open adrianbr opened 3 months ago

adrianbr commented 3 months ago

Source name

Notion

Describe the data you'd like to see

I want to see page contents. Multiple of our users mentioned that's what they need for ML cases.

See filter here, https://github.com/dlt-hub/verified-sources/blob/master/sources/notion/__init__.py#L33

Please group all pages under a single resource, and add the page(document id as a column, so we have a simple access point for the pages. Do not create a table per page as we do with databases (there it makes sense to have separation)

Are you a dlt user?

Yes, I'm already a dlt user.

Do you ready to contribute this extension?

No.

dlt destination

No response

Additional information

No response

adrianbr commented 1 month ago

Rahul reports we only get metadata, could you please have another look? We want the page text contents, so like if this is documentation someone wants to ingest it to vector dbs and "talk to it"

IlyaFaer commented 1 month ago

@adrianbr, @rudolfix, ping