crawlab-team / crawlab

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
https://www.crawlab.cn
BSD 3-Clause "New" or "Revised" License
11.27k stars 1.77k forks source link

Data Sources #1445

Open Vr1llon opened 7 months ago

Vr1llon commented 7 months ago

Is your feature request related to a problem? Please describe. Data insertion into relational database (Postgresql in my case) using data sources. I had to dig into crawlab sdk source code to figure out how and what the crawlab item pipeline does exactly, for example does it create table or not. Does it insert the _tid and _pid or not. And what table crawlab is looking for before insertion. And what defines the column names? I had to do this because there was no information regarding this topic in the data sources documentation.

Describe the solution you'd like Additional information to the documentation of the data sources.

Describe alternatives you've considered

Other suggestions

Additional note Thank you for the awesome software 🙏🏻

llmontoryxd commented 6 months ago

I'm also try to use crawlab, but have no idea how to use it with my postgresql