akherlan / momoshop

Historical pricing data for local brand :indonesia: clothing, fashion & beauty products
0 stars 0 forks source link

Automating dataset extraction (pipeline) and proper storage #16

Open akherlan opened 11 months ago

akherlan commented 11 months ago

Currently, data will be collecting and combining in pulldata branch and still run manually every certain period of time or when needed. It might be run by event triggered action in CI/CD for a while during development. But, must be needed more proper workflow (and tech stack?) and proper storage. Not parquet files dataset. The size is getting bigger and bigger.

Task:

akherlan commented 10 months ago

Using Supabase for data pull. Planned workflow:

scraper > transformer > supabase